Data Engineer - GCP

Relanto

$100K — $140K *
Information Technology
Less than 5 years of experience
Job Overview by Ladders

Qualifications

  • 2-4 years of experience in Data Engineering or related roles required
  • Proficient in Python and SQL for data manipulation and analysis
  • Hands-on experience with PySpark for distributed data processing
  • Familiar with BigQuery and GCP services for data warehousing
  • Experience with Apache Airflow for orchestration and workflow management
  • Practical knowledge of dbt for data transformation and modeling
  • Familiar with Git and CI/CD practices for version control and deployment

Responsibilities

  • Design, develop, and maintain scalable data pipelines using Python and PySpark
  • Build and optimize ETL/ELT workflows utilizing Apache Airflow
  • Develop data transformation models and workflows with dbt
  • Work with BigQuery for data warehousing, querying, and performance optimization
  • Write efficient SQL queries for processing large-scale data
  • Ensure data quality, integrity, and reliability across all pipelines
  • Collaborate with analysts, scientists, and stakeholders on data requirements
  • Monitor and troubleshoot production data workflows proactively
  • Implement logging, monitoring, testing, and CI/CD best practices
  • Optimize cloud resource usage and cost efficiency on GCP

Benefits

  • Collaborative work environment across teams including Data Analysts and Scientists
  • Opportunity to work with cutting-edge technologies like PySpark and GCP
  • Emphasis on best practices in data engineering and CI/CD
  • Exposure to diverse projects in data transformation and modeling
  • Encouragement for participation in code reviews and continuous learning opportunities
Full Job Description
Responsibilities:
  • Design, develop, and maintain scalable data pipelines using Python and PySpark
  • Build and optimize ETL/ELT workflows using Apache Airflow
  • Develop data transformation models and workflows using dbt
  • Work with BigQuery for data warehousing, querying, and optimization
  • Write efficient and optimized SQL queries for large-scale data processing
  • Ensure data quality, integrity, and reliability across pipelines
  • Collaborate with Data Analysts, Data Scientists, and business stakeholders for data requirements
  • Monitor and troubleshoot production data workflows and resolve issues proactively
  • Implement best practices for logging, monitoring, testing, and CI/CD
  • Optimize cloud resource usage and cost efficiency on GCP
  • Participate in code reviews and engineering best practices


Required Skills:
  • Must have 2-4 years of experience in Data Engineering or related roles
  • Strong hands-on experience with Python and SQL
  • Experience with PySpark and distributed data processing
  • Good knowledge of BigQuery and GCP services
  • Experience with Apache Airflow for orchestration
  • Hands-on experience with dbt for data transformation and modeling
  • Understanding of ETL/ELT frameworks and data warehousing concept
  • Familiarity with GCP services such as:
    • BigQuery
    • Cloud Storage
    • Dataproc
    • Composer
    • Pub/Sub
  • Experience with Git and CI/CD practices
  • Strong analytical and problem-solving skills


Required Skills

gcp data engineering

Similar Jobs

More Jobs at Relanto

More Information Technology Jobs

Find similar Data Engineer - GCP jobs: