Senior Data Engineer ID75059

AgileEngine

$100K — $140K *
Finance & Insurance
5 - 7 years of experience
Job Overview by Ladders

Qualifications

  • Authorized to work in the US without sponsorship.
  • Bachelor's degree in computer science/engineering or equivalent experience.
  • 5+ years of experience with Python.
  • 5+ years of experience with data processing libraries like Pandas, Polars, PySpark, or DuckDB.
  • 2+ years of experience with Big Data technologies like Spark and Snowflake.
  • Expert knowledge of pipeline orchestration using Airflow or similar tools.
  • Deep understanding of Medallion Architecture and diverse database technologies.

Responsibilities

  • Design and implement Python Data Engineering solutions.
  • Build scalable Data Lakes and Data Warehouses.
  • Implement robust ETL/ELT processes using Python and Airflow.
  • Develop ingestion workflows from third-party APIs.
  • Manage and optimize various file formats for data retrieval.
  • Support AI development tools for machine learning and analytics initiatives.
  • Act as a technical consultant to translate business goals into data roadmaps.

Benefits

  • Professional growth opportunities, including mentorship and personalized roadmaps.
  • Competitive USD-based compensation with budgeting for education and fitness.
  • Engagement in exciting projects with Fortune 500 companies.
  • Flexible working schedule with remote and in-office options.
Full Job Description
Job Description

ABOUT THE ROLE

We are looking for a Senior Data Engineer to design and build scalable data lakes, warehouses, and lakehouse architectures supporting a thematic research platform that processes large volumes of financial data daily. You will implement Python-based ETL/ELT pipelines, orchestrate workflows with Airflow, develop ingestion workflows from third-party APIs, and work with Snowflake, Spark, and AWS to deliver high-performance data infrastructure. The role combines hands-on engineering with technical consulting responsibilities, translating business goals into data architecture roadmaps.

WHAT YOU WILL DO

- Design and implement Python Data Engineering solutions;

- Design and build scalable Data Lakes, Data Warehouses, and Data Lakehouses;

- Design and implement robust ETL/ELT processes at scale using Python, incorporating modern pipeline orchestration tools like Airflow;

- Develop sophisticated ingestion workflows from diverse 3rd party APIs and data sources;

- Manage and optimize various file formats (Parquet, Avro, ORC) and columnar storage to ensure high-performance data retrieval;

- Work with AI development tools to support and accelerate ongoing development, machine learning initiatives and advanced analytics;

- Act as a technical consultant for stakeholders and leadership to gather requirements, understand business goals, and translate them into technical roadmaps;

- Work with Terraform and other tools to build AWS and on-prem infrastructure.

MUST HAVES

- You must be authorized to work for ANY employer in the US (e.g., Green card holders, TN visa holders, GC EAD, H4 EAD, U4U with EAD), as we are unable to sponsor or take over employment visa sponsorship at this time;

- Bachelor's degree in computer science/engineering or other technical field, or equivalent experience;

- 5+ years of experience with Python;

- 5+ years of experience with data processing, manipulation, and analytics libraries like Pandas, Polars, PySpark or DuckDB;

- 2+ years of experience with Big Data technologies (Spark, Snowflake);

- Expert-level knowledge of pipeline orchestration using Airflow or similar industry-standard tools;

- Deep understanding of Medallion Architecture, columnar file formats, and diverse database technologies (SQL, NoSQL, and Lakehouse architectures);

- Proven ability to work with 3rd party APIs for complex data ingestion tasks;

- Proficiency with modern Cloud platforms (AWS, GCP, Snowflake) and advanced SQL optimization;

- Exceptional soft skills with a proven ability to gather requirements from leadership and collaborate effectively across cross-functional teams;

- Excellence in optimizing complex data pipelines and troubleshooting data latency or consistency issues in massive datasets;

- A self-starter mindset, regularly investigating more efficient data architectures and AI development tools to improve pipeline performance;

- Taking pride in data integrity and the accuracy of the end-to-end pipelines and architectures you build;

- Strong communication skills for seamless global collaboration with stakeholders and distributed teams;

- Upper-intermediate English level.

NICE TO HAVES

- Familiarity with the fintech industry, understanding of financial data, regulatory requirements, and business processes specific to the domain;

- Documentation skills to document data pipelines, architecture designs, and best practices for knowledge sharing and future reference;

- OpenSearch, Elasticsearch;

- AWS Sagemaker Studio, Jupyter for analyze data;

- Terraform;

- Scala.

PERKS AND BENEFITS

- Professional growth: Mentorship, TechTalks, and personalized growth roadmaps.

- Competitive compensation: USD-based pay with education, fitness, and team activity budgets.

- Exciting projects: Modern solutions with Fortune 500 and top product companies.

- Flextime: Flexible schedule with remote and office options.

Similar Jobs

More Jobs at AgileEngine

More Finance & Insurance Jobs

Find similar Senior Data Engineer ID75059 jobs: