Data Engineer

Sapiom, Inc

$120K — $160K *
Information Technology
5 - 7 years of experience
Job Overview by Ladders

Qualifications

  • 5+ years transforming raw data into trustworthy, production-ready datasets
  • Hands-on experience with SQL, Python, Spark, AWS Glue, EMR, DBT, and Airflow
  • Strong command of MPP databases like Snowflake, AWS Redshift, or Teradata with production use
  • Proven collaboration with Engineering, Analytics, Data Science, and DevOps teams
  • Instincts for designing scalable schemas and systems
  • Comfortable in an on-call rotation for incident response
  • Effective communicator for both technical and non-technical stakeholders

Responsibilities

  • Build, scale, and optimize production-quality ETL pipelines
  • Design data schemas anticipating 10x growth
  • Own data quality, governance, and security across the platform
  • Develop self-serve data models for AI-powered analytics
  • Instrument pipeline observability and identify health metrics
  • Collaborate closely with Data Science, Analytics, and DevOps teams

Benefits

  • Opportunity to shape foundational data architecture
  • Engagement in a critical role that influences company-wide success
  • Collaborative culture that fosters cross-functional teamwork
  • Potential for professional growth with early-stage data initiatives
  • Access to cutting-edge data technologies and practices
Full Job Description
About the Role

This is a foundational infrastructure role at a company where the data layer isn't a back-office function - it's the nervous system of a payments platform processing every agent transaction, policy decision, and risk signal in real time. The right person thrives on ownership, has strong opinions about data quality and governance, and moves with the urgency of someone who knows that bad data costs more than bad code. As an early data engineer, you'll define not just the pipelines but the standards, architecture, and culture of data at Sapiom.

What You Will Do

You'll own Sapiom's data infrastructure end-to-end - designing and scaling ETL pipelines, defining schemas that survive 10x growth, and building the governance and quality frameworks that make data trustworthy across the company. You'll architect standardized data models that enable self-serve AI-powered insights, giving Analytics, Data Science, and product teams the visibility they need to move fast without coming to you for every query. The mandate is broad: pipelines, quality, security, observability, and the cross-functional partnerships that keep it all running.

Responsibilities
  • Build, scale, and optimize production-quality ETL pipelines - owning the full lifecycle from ingestion through availability, with clear quality and SLA standards
  • Design data schemas and architect for scale - anticipating 10x data growth and building models that don't require rework when it arrives
  • Own data quality, governance, security, and schema design across the platform - setting the standards and making sure they hold
  • Develop standardized, self-serve data models that enable AI-powered analytics - reducing friction for partner teams and eliminating one-off data pulls
  • Instrument pipeline observability and surface key health metrics to Analytics, Data Science, and DevOps - proactively surfacing issues before they become incidents
  • Partner closely with Data Science, Analytics, and DevOps - operating as a force multiplier across teams, not a bottleneck

Requirements
  • Demonstrated track record - 5+ years - transforming raw data into governed, well-documented, production-ready datasets that business teams can trust and use
  • Deep hands-on experience building and deploying production data pipelines using SQL, Python, Spark, AWS Glue, EMR, DBT, and Airflow
  • Strong command of MPP databases - Snowflake, AWS Redshift, or Teradata - with 3+ years of hands-on production use
  • Proven partnership record with Engineering, Analytics, Data Science, and DevOps teams - someone who treats cross-functional relationships as core to the job, not peripheral to it
  • Architectural instincts - able to design schemas and systems that scale gracefully, not just handle today's load
  • Comfort operating in an on-call rotation - including incident response outside regular working hours when the pipeline demands it
  • Clear communicator who can translate complex data infrastructure decisions into plain-language insights for both technical and non-technical stakeholders

Similar Jobs

More Jobs at Sapiom, Inc

  • Data Engineer
    $120K — $160K *
    San Francisco, CA 94112 (San Francisco County)
    Information Technology
    In-Person
  • Growth Marketing Manager
    $100K — $150K *
    San Francisco, CA 94112 (San Francisco County)
    Consumer Technology
    In-Person
  • Developer Relations Lead
    $120K — $160K *
    San Francisco, CA 94112 (San Francisco County)
    Enterprise Technology
    In-Person
  • Member of Technical Staff
    $130K — $180K *
    San Francisco, CA 94112 (San Francisco County)
    Enterprise Technology
    In-Person
  • Chief of Staff
    $150K — $200K *
    San Francisco, CA 94112 (San Francisco County)
    Business Services
    In-Person

More Information Technology Jobs

Find similar Data Engineer jobs: