Data Pipeline Engineer

Dexian

$100K — $130K *
Information Technology
5 - 7 years of experience
Job Overview by Ladders

Qualifications

  • Bachelor's or Master's degree in Computer Science, Data Engineering, or a related quantitative field
  • AWS Certified Data Engineer - Associate or Microsoft Certified Azure Data Engineer (preferred)
  • 5+ years of experience building ETL pipelines with a focus on mass data migration
  • Expert knowledge of SQL and Python
  • Hands-on experience with database migration tools and legacy Oracle systems
  • Strong problem-solving skills and attention to detail regarding data integrity

Responsibilities

  • Build automated ETL/ELT pipelines for data extraction and loading into AWS RDS and Azure ADLS Gen2
  • Program data cleansing and validation logic into pipelines per the Data Quality Rulebook
  • Optimize extraction queries to reduce performance impact on legacy systems
  • Implement physical data models for target databases
  • Document code repositories and generate migration success/failure logs
  • Collaborate with Business Analyst on technical feasibility of data transformation rules
  • Embed automated quality checks in ETL scripts

Benefits

  • Onsite work required 4 days a week in Washington, D.C.
  • Exposure to large-scale data migration projects
  • Opportunity to work with advanced cloud technologies such as AWS and Azure
  • Involvement in agile ceremonies and collaborative team environment
  • Focus on data governance and ethical practices in data management
Full Job Description
ONSITE 4 DAYS A WEEK IN WASHINGTON, D.C.

Background and Context
The Data Engineer in this role will support programs involving one or more of the following:
  • Focuses explicitly on the one-time and phased mass migration efforts, ensuring zero data loss, strict adherence to mapping rules, and the successful execution of technical cutover protocols.

2. Scope of Work
2.1 Pipeline Development and Implementation
  • Build automated ETL/ELT pipelines to extract legacy data (Oracle, SharePoint) and load it into AWS RDS and Azure ADLS Gen2.
  • Program data cleansing, standardization, and validation logic into the pipelines based on the Data Quality Rulebook.
  • Execute the physical data extraction and load during pre-production and production cutovers.

2.2 Solution Design and Optimization
  • Tune extraction queries to minimize performance impacts on legacy production systems during sync operations.
  • Implement physical data models for the target databases.
  • Code and automate the technical rollback mechanisms and data reconciliation scripts.

2.3 Stakeholder Engagement and Change Management
  • Collaborate directly with the Business Analyst to interpret and implement STTM documents.
  • Provide feedback on technical feasibility and performance implications of proposed data transformation rules.
  • Participate in daily agile ceremonies and sprint planning.

2.4 Governance, Ethics, and Risk
  • Embed automated pre- and post-migration data quality checks directly into the ETL scripts.
  • Implement encryption standards at rest (AWS KMS) and in transit (TLS 1.2+).
  • Apply role-based access control (RBAC) schemas within the database layers via IAM and Microsoft Entra ID.

2.5 Documentation and Reporting
  • Document migration code repositories, operational runbooks, and cutover execution scripts.
  • Generate automated migration success/failure logs for auditing.

3. Required Qualifications and Experience
3.1 Education
  • Bachelor's or Master's in Computer Science, Data Engineering, or a related quantitative field.

3.2 Certifications (Preferred)
  • AWS Certified Data Engineer - Associate or Microsoft Certified Azure Data Engineer.

3.3 Mandatory Experience
  • 5+ years building ETL pipelines, with heavy emphasis on one-time mass data migrations from legacy relational databases.

3.4 Technical Knowledge
  • Expert SQL and Python.
  • Hands-on experience with database migration tools (e.g., AWS DMS), Databricks, and legacy Oracle ecosystems.

3.5 Core Competencies
  • Rigorous attention to detail regarding data integrity, strong problem-solving skills for schema mismatches, and ability to work under strict cutover deadlines.

Similar Jobs

More Jobs at Dexian

More Information Technology Jobs

Find similar Data Pipeline Engineer jobs: