Zeta Global

Senior Data Reliability Engineer

Zeta Global$120K — $150K *
Information Technology
8 - 10 years of experience
Job Overview by Ladders

Qualifications

  • Bachelor's degree in computer science or related field; master's preferred.
  • 10-12 years of experience in database engineering or reliability engineering.
  • At least 5 years of hands-on experience with PostgreSQL, especially in AWS RDS environments.
  • Strong expertise in cloud-native data platforms and enterprise-scale production environments.
  • AWS Certified Database - Specialty or related cloud certifications preferred.

Responsibilities

  • Own the reliability and performance of PostgreSQL RDS environments.
  • Lead monitoring initiatives for PostgreSQL RDS using CloudWatch and Prometheus.
  • Drive advanced PostgreSQL performance tuning and capacity planning.
  • Architect backup and disaster recovery strategies for business continuity.
  • Oversee Debezium and Kafka Connect for real-time data ingestion.
  • Optimize ETL workflows and data pipelines for reliability.
  • Manage Apache Airflow for efficient workflow orchestration.

Benefits

  • Health, dental, and vision insurance.
  • 401(k) plan with company match.
  • Flexibility in remote working options.
  • Professional development opportunities and training.
  • Generous vacation and paid time off policies.
Full Job Description
About the Role:

As a Senior Data Reliability Engineer, you will be responsible for architecting, scaling, and optimizing enterprise-grade data platforms, including large-scale data lakes and data warehouses built from multiple disparate data sources. This role requires deep expertise in cloud databases, data infrastructure reliability, observability, and automation, with a strong focus on operational excellence, performance, and resilience.

Responsibilities:

  • Own the reliability, availability, scalability, and performance of PostgreSQL RDS environments across production and non-production systems.
  • Lead proactive monitoring and observability initiatives for PostgreSQL RDS instances, leveraging tools such as CloudWatch, Prometheus, Grafana, and other enterprise monitoring platforms.
  • Drive advanced PostgreSQL performance tuning, including query optimization, indexing strategies, parameter tuning, and capacity planning.
  • Architect and optimize database backup, disaster recovery, and failover strategies to ensure business continuity and minimal downtime.
  • Own the reliability and operational excellence of Debezium and Kafka Connect ecosystems, ensuring robust real-time data ingestion and delivery.
  • Lead troubleshooting and optimization of ETL workflows and data pipelines, ensuring scalability, reliability, and fault tolerance across data platforms.
  • Oversee Apache Airflow workflow orchestration, ensuring high reliability, SLA adherence, and operational efficiency of production DAGs.
  • Design and implement Infrastructure as Code (IaC) solutions using tools such as Terraform, Crossplane, and automation frameworks to streamline deployments and operational tasks.
  • Lead incident response, root cause analysis, and post-incident reviews for critical production issues.
  • Define and enforce database security standards, including access controls, encryption policies, compliance adherence, and periodic security audits.
  • Partner closely with engineering, DevOps, and data platform teams to optimize data architecture and improve overall platform reliability.
  • Mentor junior engineers and drive best practices across database reliability engineering and cloud data operations.
  • Identify and lead continuous improvement initiatives focused on reliability, automation, scalability, and operational maturity.


Skills:

  • Deep expertise in PostgreSQL administration and performance tuning, preferably in AWS RDS environments.
  • Strong experience with Debezium, Kafka Connect, ETL frameworks/tools, and enterprise-grade data pipeline architectures.
  • Strong hands-on experience with Amazon Redshift, S3, and cloud-native data platforms.
  • Expertise in Apache Airflow workflow orchestration and operational management.
  • Experience with Apache Spark and large-scale distributed data processing.
  • Strong scripting and automation experience using Python, Bash, or similar languages.
  • Strong experience in Infrastructure as Code (IaC) using Terraform, Crossplane, or equivalent tools.
  • Hands-on experience with monitoring and observability tools such as CloudWatch, Prometheus, Grafana.
  • Strong understanding of cloud database security, compliance, and governance frameworks (e.g., GDPR, HIPAA).
  • Experience designing highly available, fault-tolerant, and scalable cloud database systems.


Experience and Qualifications:

  • Bachelor's degree in computer science, Information Technology, or a related field (master's preferred).
  • 10-12 years of overall experience in database engineering, cloud data infrastructure, or reliability engineering.
  • Minimum 5+ years of hands-on experience with PostgreSQL, including AWS RDS administration.
  • Strong experience in cloud-native data platforms and enterprise-scale production environments.
  • AWS Certified Database - Specialty or relevant cloud certifications preferred.

About Zeta Global

Zeta Global is a data-driven marketing technology company that combines the power of artificial intelligence with the scale of data, applying insights from over 2.4 billion user profiles to generate business outcomes. Zeta Global?s products and services include programmatic media buying, email marketing, CRM, data and analytics, and marketing automation. The company serves a wide range of industries, including financial services, insurance, automotive, telecommunications, retail, publishing, and travel. Zeta Global has offices in North America, Europe, and Asia-Pacific.
Learn more about Zeta Global
Size
1,300 employees
Market Cap
$1.7 billion
Industry
Founded
2007
NASDAQ

Similar Jobs

More Jobs at Zeta Global

  • Zeta Global
    Lead - Data Engineer
    $120K — $160K *
    Basking Ridge, NJ 07920 (Somerset County)
    Information Technology
    In-Person
  • Zeta Global
    Lead Database Administrator
    $120K — $150K *
    Basking Ridge, NJ 07920 (Somerset County)
    Information Technology
    In-Person
  • Zeta Global
    Data Engineer II
    $100K — $130K *
    Basking Ridge, NJ 07920 (Somerset County)
    Enterprise Technology
    In-Person
  • Zeta Global
    Database Administrator II
    $90K — $120K *
    Basking Ridge, NJ 07920 (Somerset County)
    Information Technology
    In-Person
  • Zeta Global
    Senior Data Reliability Engineer
    $120K — $150K *
    Basking Ridge, NJ 07920 (Somerset County)
    Information Technology
    In-Person

More Information Technology Jobs

Find similar Senior Data Reliability Engineer jobs: