We are seeking a
Data Engineer with expertise in SQL, Python, DBT and RisingWave to join our modern data team.
Responsibilities:- Design high-performance SQL pipelines across PostgreSQL, BigQuery, Snowflake, and MongoDB.
- Develop Python applications for data ingestion, transformation, and automation.
- Implement RisingWave streaming pipelines for real-time analytics.
- Build Apache Kafka architectures for high-throughput data processing.
- Orchestrate workflows using Apache Airflow on Google Cloud Platform.
- Optimize queries and implement data quality checks across multiple platforms.
- Mentor team members and collaborate with business stakeholders.
- Deploy CI/CD workflows using Git for reliable pipeline management.
RequirementsRequired Qualifications:- Bachelor's degree in Computer Science, Engineering, or related field.
- 5+ years of data engineering experience with SQL, Python, and RisingWave.
- Must have AlloyDB and CDC experience (DataStream/Debezium)
- Expert DBT skills across Big Query, Snowflake and AlloyDB.
- Expert SQL skills: CTEs, window functions, optimization across PostgreSQL, BigQuery, Snowflake.
- Advanced Python: pandas, sqlalchemy, API integration, streaming data processing.
- Production experience with Apache Kafka, Apache Airflow, and Google Cloud Platform.
- Experience with MongoDB, dimensional modeling, and both batch/streaming ETL pipelines.
- Strong Git and collaborative development experience.
Technical Skills:- Core: SQL (advanced), Python, RisingWave (required).
- Cloud: Google Cloud Platform, BigQuery, GCP native services.
- Streaming: Apache Kafka, real-time data processing.
- Orchestration: Apache Airflow (production experience).
- Databases: PostgreSQL, Snowflake, MongoDB.
- Tools: Git, Docker, CI/CD pipelines.
Preferred Qualifications:- GCP certifications, Terraform/CloudFormation experience.
- previous experience with RisingWave is strongly preferred
- Data visualization tools (Looker, Tableau, Power BI).
- DataOps and analytics engineering best practices.
- ClickHouse experience is preferred
What You'll Build:- Scalable SQL pipelines across multiple database systems.
- Python-based ETL/ELT solutions spanning cloud and on-premise.
- Real-time streaming pipelines using RisingWave and Kafka.
- GCP-native data solutions with automated quality checks.
- Airflow-orchestrated workflows with CI/CD deployment.
Benefits- Competitive Salary
- Healthcare Benefits Package
- Career Growth