Data Reliability Engineer

DCM Infotech Limited

$100K — $130K *
Information Technology
5 - 7 years of experience
Job Overview by Ladders

Qualifications

  • Bachelor's degree in computer science, Information Technology, or a related field.
  • 5-7 years of experience in database administration, with a focus on PostgreSQL.
  • 2-4 years of hands-on experience with PostgreSQL RDS.
  • Familiarity with data pipeline technologies like Debezium and Kafka Connect.
  • Strong SQL scripting skills for querying and basic data manipulation.

Responsibilities

  • Proactively monitor PostgreSQL RDS instances for performance and availability using tools like CloudWatch and Prometheus.
  • Identify and troubleshoot performance bottlenecks in PostgreSQL RDS.
  • Monitor Debezium and Kafka Connect connectors and troubleshoot data capture issues.
  • Oversee ETL workflows and resolve errors or delays in data processing.
  • Develop automation scripts and IAC templates to streamline database tasks and deployments.

Benefits

  • Participation in on-call rotations to address database incidents.
  • Opportunity to work with advanced cloud database technologies like AWS Redshift and Apache Airflow.
  • Strong emphasis on developing automation and workflow orchestration skills.
  • Exposure to compliance-related practices in cloud databases.
Full Job Description
About the Role:

As a Data Reliability Engineer II, you will play a crucial role in developing, optimizing, and managing several large data lakes and data warehouses, comprising data from multiple disparate sources.

Responsibilities:
  • Proactively monitor PostgreSQL RDS instances for performance, availability, and resource utilization (CPU, memory, storage, connections) using established monitoring tools (e.g., CloudWatch, Prometheus).
  • Assist in identifying performance bottlenecks in PostgreSQL RDS. Apply basic performance tuning techniques like reviewing query execution plans, adding missing indexes, and recommending parameter adjustments.
  • Monitor the health and performance of Debezium and Kafka Connect connectors, identifying and troubleshooting basic issues related to data capture and delivery.
  • Monitor ETL workflows and data pipelines for errors, performance bottlenecks, and processing delays. Troubleshoot and resolve issues to ensure reliable and timely data movement.
  • Provide support for data related issues and participate in root cause analysis.
  • Monitor the execution of Apache Airflow DAGs, identify failed tasks, and troubleshooting and re-runs.
  • Develop and maintain automation scripts and infrastructure as code (IAC) templates (e.g., using Crossplane, Terraform) to automate routine database tasks, deployments, and updates.
  • Participate in on-call rotations to respond to database-related incidents and perform troubleshooting and root cause analysis.
  • Assist in implementing and maintaining security best practices for cloud databases, including access controls, encryption, and compliance with regulatory requirements.
  • Regularly audit and assess database security configurations.
  • Configure and manage database backup and recovery strategies to ensure data integrity and availability in case of failures or data loss.
  • Analyse database query performance and collaborate with developers to optimize SQL queries and schemas.
  • Participate in continuous improvement initiatives to enhance the reliability, scalability, and performance of cloud databases.
  • Assist in the design and optimization of database schemas for cloud environments.
  • Skills:
  • Familiarity with data pipeline concepts and technologies such as Debezium, Kafka Connect, and ETL frameworks.
  • Basic understanding of Amazon Redshift and S3.
  • Exposure to Apache Spark for data processing.
  • Basic understanding of Apache Airflow for workflow orchestration.
  • Strong SQL scripting skills for querying and basic data manipulation.
  • Familiarity with scripting languages (e.g., Python, Bash) is a plus.
  • Knowledge of database security best practices, including access controls, encryption, and compliance with regulatory requirements (e.g., GDPR, HIPAA).
  • Having 'AWS Certified Database - Specialty' certification is a plus.
  • Experience and Qualifications:
  • Bachelor's degree in computer science, Information Technology, or a related field.
  • 5-7 years of experience in database administration, with a focus on PostgreSQL.
  • 2-4 years of hands-on experience with PostgreSQL RDS.

Similar Jobs

More Jobs at DCM Infotech Limited

More Information Technology Jobs

Find similar Data Reliability Engineer jobs: