Sr Data Engineer

datafuelX Inc.

$120K — $160K *
Information Technology
5 - 7 years of experience
Job Overview by Ladders

Qualifications

  • 5+ years in data engineering, focusing on scalable platforms
  • Strong skills in SQL, Python, and PySpark for analytics-driven models
  • Proficient with Iceberg, Delta, and Parquet formats
  • Hands-on experience with Dagster and Airflow orchestration
  • 2+ years in cloud environments (GCP & AWS)
  • Familiar with object storage platforms for large datasets
  • Solid grasp of data quality, testing, and monitoring methodologies

Responsibilities

  • Design, build, and maintain scalable data pipelines for media monetization
  • Integrate and transform high-volume media data from various sources
  • Develop lakehouse-style data models for analytics and reporting
  • Optimize ETL/ELT processes for batch and near-real-time data
  • Collaborate with cross-functional teams to support ML and insights
  • Ensure data quality, governance, and security of cloud assets
  • Participate in platform evolution and architectural improvements
  • Mentor junior engineers and assist in technical reviews

Benefits

  • Collaborative culture focused on innovation and continuous improvement
  • Opportunity to work on high-impact, revenue-critical projects
  • Hands-on role with a focus on building and optimizing rather than just maintaining
  • Involvement in modern data architecture and migration efforts
  • Mentorship opportunities to help develop junior staff skills
Full Job Description
About the Role

We're looking for a Senior Data Engineer that is innovative, curious, and collaborative to help
evolve the data platform that powers our sell-side media business. This role supports
data-driven workflows and a broad range of media monetization use cases, with an emphasis
on scalable, open data architectures.

You'll work on high-volume, revenue-critical media data while helping modernize how data is
stored, processed, and served across the organization. This is a hands-on role for engineers
who enjoy building, migrating, and improving platforms, not just maintaining them.

What You'll Do

  • Design, build, and maintain scalable, reliable data pipelines that support sell-side media
    monetization.
  • Ingest, integrate, transform, and model high-volume media and campaign data from
    multiple sources, delivering analytics-ready datasets that meet quality, accessibility, and
    business requirements.
  • Develop lakehouse-style data models that balance flexibility, performance, and cost
    efficiency, enabling reporting, analytics, operational workflows, and downstream
    consumption.
  • Build, optimize, and maintain ETL/ELT processes for both batch and near-real-time
    workloads, orchestrated with modern workflow tools (e.g., Airflow, Dagster).
  • Collaborate with machine learning, client-facing, product, application, and analytics teams to understand requirements, support ML feature productionization, and enable
    self-service insights.
  • Ensure data quality, lineage, observability, governance, and security, safeguarding cloud
    data assets and maintaining trust in revenue-critical systems.
  • Contribute to platform evolution and migration efforts, including evaluating tools,
    improving workflows, and reducing architectural complexity.
  • Continuously optimize pipelines, queries, and storage for performance, scalability, and
    cost efficiency.
  • Mentor junior engineers and participate in technical design and architecture reviews.
  • Detect, investigate, and resolve data anomalies to maintain pipeline reliability and data
    trustworthiness.

Required Experience & Skills

Core Requirements

  • 5+ years of experience in building highly scalable and reliable data engineering and
    analytics platforms
  • Strong experience building and optimizing modern data pipelines, architectures, and
    datasets using Data Lake, Data Warehouse, and Lakehouse paradigms
  • Advanced SQL, Python, and PySpark skills and experience building analytics-ready data models
  • Experience using Iceberg, Delta, and Parquet data formats
  • Experience using Dagster and Airflow orchestration tools
  • 2+ years hands-on experience with cloud platforms, such as GCP & AWS
  • Experience designing and building data pipelines on object storage-based platforms (e.g., S3, Wasabi, Dremio, Snowflake, Delta Lake) to process and manage large-scale analytical datasets
  • Solid understanding of data quality, testing, monitoring, and operational reliability
  • Strong communication skills and ability to work closely with technical and non-technical
    stakeholders
  • Experience collaborating with Software Engineers using Agile methodologies to build web applications that access, visualize, and sometimes update big data stores in a hybrid OLAP and OLTP environment.
  • Bachelor's degree or equivalent work experience (minimum 5 years) in Computer Science or related field

Preferred/Nice-to-Have

  • Experience in media industry strongly preferred including familiarity with sell-side
    concepts (linear TV or digital inventory, campaign delivery, pacing, audience-based selling and measurement)
  • 1+ years experience supporting or leading data platform migrations, hybrid architectures, or warehouse-to-lakehouse transitions using Dremio is a strong plus
  • Experience with Tableau, Metabase, or other data visualization tools
  • Experience working in an operational environment with time-sensitive customer
    commitments

Similar Jobs

More Jobs at datafuelX Inc.

  • Sr Data Engineer
    $120K — $160K *
    New York, NY 10025 (New York County)
    Information Technology
    In-Person

More Information Technology Jobs

Find similar Sr Data Engineer jobs: