Datavant

Senior Engineer - Ingestion & Streaming Frameworks

Datavant$150K — $190K *
US-AnywhereRemote in United States
Information Technology
5 - 7 years of experience
Job Overview by Ladders

Qualifications

  • 6+ years in data engineering, platform engineering, or data-focused software engineering
  • 3+ years of hands-on AWS experience, focusing on networking and IAM
  • 2+ years writing production Terraform or equivalent Infrastructure as Code (IaC)
  • 1+ years building self-service tools or internal platform frameworks
  • Strong SQL skills and understanding of data organization in warehouses/lakehouses
  • Experience with Snowflake or equivalent cloud data warehouse and workflow orchestrators (preferably Airflow)
  • Proficiency in CI/CD practices using GitHub or similar tools
  • Daily use of AI coding assistants like Claude Code, Cursor, or Copilot

Responsibilities

  • Design, build, and operate data ingestion frameworks for various sources
  • Own and enhance the ingestion stack, creating new patterns for APIs
  • Develop self-service tools for product engineers to onboard data sources
  • Write and review Terraform for computation, networking, and IAM services
  • Collaborate with product and analytics teams to implement appropriate ingestion patterns
  • Lead production troubleshooting and convert incidents into platform enhancements
  • Mentor engineers and enhance team practices through code reviews and collaboration

Benefits

  • Professional development opportunities
  • Collaborative work environment
  • Health and wellness programs
  • Flexible work hours and remote work options
  • Employee recognition programs
Full Job Description
The Ingestion & Streaming team sits on our Data & Machine Learning Platform organization and owns the movement layer of Datavant's data platform: batch and streaming pipelines, change data capture, document intake, and the self-service frameworks product teams use to land new sources into Snowflake, our Iceberg-backed lakehouse, and Databricks. Most data moves into the platform; some moves back out. Our job is to make both safe, fast, observable, and boring.

We are looking for a Senior Engineer who thinks like a platform builder first. We are shifting from a service-oriented posture ("we'll build the pipeline for you") to a platform-oriented one ("here is the paved path to build it yourself, safely"), though the team still builds pipelines directly when the situation calls for it. You will be central to that shift, designing the frameworks, tooling, and guardrails that scale how Datavant onboards new sources, and rolling up your sleeves for hands-on ingestion work when there isn't yet a paved path. AI fluency is a baseline expectation here. You should already be using Claude Code, Cursor, Copilot, or equivalent tools as a core part of your daily engineering workflow, have opinions about how they make a team faster, and know how to apply them responsibly when PHI and other sensitive data are in scope.

What You Will Do:
  • Design, build, and operate the ingestion frameworks that pull data from operational databases, vendor APIs, document streams, and third-party feeds into Snowflake, Iceberg, and Databricks
  • Own and evolve the ingestion stack (AWS DMS, MWAA / Airflow, Fivetran, and the homegrown tooling on top) and design new patterns for API sources that don't fit a managed connector
  • Build self-service tooling so product engineers can onboard new sources without becoming experts in our infrastructure
  • Write and review the Terraform behind our ingestion infrastructure: AWS networking, IAM, compute, and data services
  • Partner with product, data, and analytics teams to pick the right ingestion pattern for each source (CDC, batch, API, streaming) and stand it up end-to-end
  • Lead production troubleshooting and incident response, and turn each incident into a durable platform fix
  • Raise the bar on engineering quality, observability, cost discipline, and security in everything the team ships
  • Mentor mid-career engineers and pull peers along through code review, pairing, and design feedback


What We're Looking For:
  • 6+ years in data engineering, platform engineering, or data-focused software engineering
  • 3+ years of hands-on AWS with real strength in networking (VPC, subnets, routing, PrivateLink, security groups), IAM (roles, policies, permission boundaries), and the data services this role touches, plus the judgment to know when to reach for what
  • 2+ years writing production Terraform or equivalent IaC, with experience owning modules, reasoning about state and blast radius, and shipping infrastructure changes safely
  • 1+ years building self-service tooling, internal platforms, or paved-path frameworks consumed by other engineers
  • Strong SQL skills and the ability to reason about how data physically lives in a warehouse or lake
  • Production experience with Snowflake (or an equivalent cloud data warehouse) and a workflow orchestrator (Airflow / MWAA preferred)
  • Hands-on experience with at least one ingestion approach: CDC tooling (e.g., DMS, Debezium), managed connectors (e.g., Fivetran, Airbyte), or rolling your own pipelines for API sources
  • Solid CI/CD discipline in GitHub or equivalent: branching, code review, automated checks, repeatable deployment
  • AI-native working style: daily use of Claude Code, Cursor, Copilot, or equivalent, with views on how they make a team faster
  • Working knowledge of Python is expected; mastery isn't the bar
  • Clear written and verbal communication, especially in async, remote settings


What Helps You Stand Out:
  • Direct production experience with Iceberg or another open table format, especially bridging Snowflake and Databricks
  • Hands-on Databricks or Spark
  • Kubernetes experience
  • Snowflake certification(s)
  • Azure experience (we're primarily AWS, but our customers and acquisitions aren't always)
  • In-depth experience integrating data systems with managed identity platforms, particularly via SCIM (SailPoint a plus)
  • Prior experience in healthcare or another highly regulated industry like Finance
  • Prior DBA, SRE, or DRE work operating production data systems under pressure


At Datavant our total rewards strategy powers a high-growth, high-performance, health technology company that rewards our employees for transforming health care through creating industry-defining data logistics products and services.

The range posted is for a given job title, which can include multiple levels. Individual rates for the same job title may differ based on their level, responsibilities, skills, and experience for a specific job.

The estimated total cash compensation range for this role is:

$150,000-$190,000 USD

To ensure the safety of patients and staff, many of our clients require post-offer health screenings and proof and/or completion of various vaccinations such as the flu shot, Tdap, COVID-19, etc. Any requests to be exempted from these requirements will be reviewed by Datavant Human Resources and determined on a case-by-case basis. Depending on the state in which you will be working, exemptions may be available on the basis of disability, medical contraindications to the vaccine or any of its components, pregnancy or pregnancy-related medical conditions, and/or religion.

This job is not eligible for employment sponsorship.

About Datavant

Datavant is a healthcare technology company that specializes in connecting and standardizing healthcare data from various sources. The company's products are used to improve patient care, accelerate drug development, and enhance clinical research. Datavant's platform, called the Datavant Platform, uses artificial intelligence and machine learning to identify and link patient data across different sources while maintaining patient privacy. The company was founded in 2017 by Travis May and is headquartered in San Francisco, California.
Learn more about Datavant
Size
100 employees
Industry
Founded
2017

Similar Jobs

More Jobs at Datavant

More Information Technology Jobs

Find similar Senior Engineer - Ingestion & Streaming Frameworks jobs: