Senior Data Engineer

Fuel Cycle

• $185K — $200K *

Los Angeles, CA 90011In-Person

Enterprise Technology

5 - 7 years of experience

Today

Be an Early Applicant

By clicking Apply, I agree with Ladders' Terms of Use and Privacy Policy

Job Overview by Ladders

Qualifications

5+ years of hands-on experience building production lakehouses on Databricks
Proficient in clean PySpark and Python coding
Deep understanding of multi-tenant SaaS architecture
Experience with Databricks components including Unity Catalog and Delta Live Tables
Strong background in batch ingestion pipeline design from relational databases
Familiarity with data tools and governance like Unity Catalog
Comfortable with AWS-native services and data integration

Responsibilities

Design and build a multi-tenant data lake that unifies diverse research data
Establish tenant-isolated data architecture for enterprise clients
Create provenance-aware data models with full traceability
Develop batch ingestion pipelines for existing relational databases
Implement nightly profile enrichment pipelines for user profiles
Build data access layers for AI agents and qualitative searches
Collaborate closely with engineering and product teams on data infrastructure

Benefits

Comprehensive health coverage including medical, dental, and vision
401(k) plan with company match for retirement savings
Equity purchase option to participate in company success
Flexible work schedule to maintain work-life balance
Generous time off including vacation, sick days, and holidays
Paid parental leave for family bonding
Monthly internet and phone stipends for remote work
Access to wellness and lifestyle perks such as fitness tools and mental wellness resources
Team connection perks including community lunches and a pet-friendly office environment

Full Job Description

Overview:

We are building the next generation of our platform - an AI-native data foundation that will power intelligent agents, living user profiles, digital twins, and conversational research analytics. This is a greenfield, foundational build that will define the future of the company.

We are building a new data engineering team reporting directly to the VP of Engineering. You will be a founding member of this team, responsible for designing and building the Databricks-first data lake and pipeline infrastructure that every future AI product will depend on.

This is not a maintenance role. We need engineers who are productive from day one - experienced enough to make architectural decisions independently and drive the build without step-by-step direction.

Key Responsibilities:

Data Lake & Warehouse Architecture

A multi-tenant data lake and warehouse that unifies all research data - surveys, qualitative feedback, CRM, media transcripts, and more - in a structured, AI-consumable format
Tenant-isolated data architecture where enterprise client data is structurally separated at the storage and query layer
Provenance-aware data models where every data point carries full traceability back to its source

Data Ingestion & Pipelines

Batch ingestion pipelines that migrate and continuously sync data from existing relational databases and cloud storage into the new lake architecture
A nightly profile enrichment pipeline that rebuilds living user profiles from all data sources within each client account

Data Access & AI Enablement

Data access layers serving AI agents via MCP, qualitative search via RAG pipelines, statistical computation tools, REST APIs, and bulk export

Your Success Metrics:

Quickly integrates with the engineering team and contributes meaningfully to the data platform build
Takes ownership of assigned pipeline and infrastructure work end-to-end, from design through production
Brings architectural recommendations and solutions proactively, rather than waiting for direction
Demonstrates strong collaboration and communication across engineering and product teams

Who you'll work with?

VP of Engineering - your direct manager and the data engineering team's founding sponsor
Peer Senior Data Engineers on the founding data engineering team
AI product and platform engineers consuming the data foundation you build
Product and engineering stakeholders across the Fuel Cycle platform

Core Skills, Competencies & Attributes:

Proactive Ownership: You bring recommendations and solutions to your manager - you don't wait to be told what to do.
Architectural Judgment: You have the judgment to make the right foundational decisions and defend them.
Greenfield Builder: You thrive on greenfield builds and take full ownership from design through to production.
Comfort with Ambiguity: You are comfortable with ambiguity and can translate high-level vision into a concrete engineering plan.
Outsized Impact: You understand that on a small team your decisions have outsized and lasting impact.

What you'll bring:

You have 5+ years of deep, hands-on experience building production lakehouses on Databricks. You write clean PySpark and Python, model data thoughtfully, and know how to build for a multi-tenant SaaS environment.
Deep production experience across the Databricks platform including Unity Catalog, Delta Live Tables, Databricks SQL, and Workflows
Delta Lake as a production table format - ACID transactions, schema evolution, performance optimization, and multi-tenant governance via Unity Catalog
Experience building and maintaining dbt transformation projects using the Databricks adapter in a production environment
PySpark for large-scale data transformation and batch pipeline authoring
Strong understanding of batch ingestion pipeline design - migrating from relational sources like MySQL and PostgreSQL into a lakehouse architecture
Experience with a modern pipeline orchestrator such as Dagster, Prefect, or Databricks Workflows; Dagster experience is a strong positive
Familiarity with vector databases, embedding pipelines, and RAG patterns for AI workloads - using tools such as Databricks Vector Search, pgvector, or Amazon OpenSearch
Exposure to AI agent and LLM-serving infrastructure including Amazon Bedrock, AgentCore, and Strands
Experience with data cataloging and governance tools such as Unity Catalog or OpenMetadata
Data modeling for multi-tenant analytical workloads - partitioning strategy, schema design, and tenant isolation patterns
Databricks on AWS - workspace configuration, S3 integration, IAM, and cost governance
Infrastructure as code using Databricks Asset Bundles or Terraform
Strong Python and SQL skills

Preferred, but Not Required:

Databricks certifications - Data Engineer Associate or Professional
Salesforce or CRM data integration experience
Prior experience in a multi-tenant SaaS environment with strict data isolation requirements
Experience migrating from OLTP to a lakehouse architecture

AWS-Native Experience - A Strong Positive:

Candidates with experience in AWS-native data services are strongly valued. Engineers who understand both Databricks and AWS-native approaches bring a broader architectural perspective that helps the team make better long-term platform decisions.
Apache Iceberg, AWS Glue, Athena, and DynamoDB experience

Benefits & Perks:

Fuel Cycle is committed to supporting the well-being, flexibility, and growth of our team. We offer a competitive and inclusive benefits package that includes:

Comprehensive Health Coverage: Medical, dental, and vision insurance plans
401(k) with Company Match: Plan for your future with our retirement savings program
Equity Purchase Option: Participate in Fuel Cycle's long-term success
Flexible Work Schedule: Empowering you to balance life and work
Generous Time Off:
- 15 vacation days and 7 sick days per year
- 12 company holidays
- 4 floating holidays/recharge days to rest or celebrate what matters to you
Paid Parental Leave: Time to bond with your growing family
Monthly Internet & Phone Stipend: Support for remote work setup
Wellness & Lifestyle Perks: Access to tools like Rightway (healthcare navigation), Headspace (mental wellness), Peloton (fitness), and more
Team Connection Perks:
- Weekly community lunches, refreshments, and snacks at our LA & NY headquarters
- Pet-friendly office environments

Compensation Overview:

The expected starting salary range for this position is $185,000 - $200,000. This range represents the typical starting compensation offered to candidates hired into this role. Final base salary will be determined based on a variety of factors, including location, work experience, skills, knowledge, education, and certifications.

This role may also be eligible for an equity grant or purchase option. These components make up your total compensation package, which will be reviewed in greater detail during your initial recruiter conversation.

* Ladders Estimates

Similar Jobs

Senior AI Data Engineer
$160K — $200K *
Cortica
San Diego, CA 92154 (San Diego County)
Today
Lead Data Platform Engineer
$155K — $208K *
The Walt Disney Company
Burbank, CA 91505 (Los Angeles County)
Reposted Today
Lead Data Platform Engineer
$155K — $208K *
The Walt Disney Company
Burbank, CA 91505 (Los Angeles County)
Reposted Today
Lead Data Platform Engineer
$155K — $208K *
The Walt Disney Company
Burbank, CA 91505 (Los Angeles County)
Reposted Today
Data Quality Engineer (Remote)
$175K — $195K *
GovCIO
Remote
Today
Senior Cloud Data Engineer (Remote)
$195K — $225K *
GovCIO
Remote
Today

Get Ready For Your
Next Interview

More Jobs at Fuel Cycle

Senior Data Engineer
$185K — $200K *
Los Angeles, CA 90011 (Los Angeles County)
Today
Enterprise Technology
In-Person
IT Systems Engineer - Tier 1
$75K — $90K *
Los Angeles, CA 90011 (Los Angeles County)
1 week ago
Information Technology
In-Person
Research Manager
$70K — $95K *
New York, NY 10025 (New York County)
1 month ago
Business Services
In-Person
Research Manager
$75K — $95K *
Los Angeles, CA 90011 (Los Angeles County)
1 month ago
Business Services
In-Person

More Enterprise Technology Jobs

Principal Project/Program Manager (Customer Care Ops - AI Transformation)
$130K — $196K *
AT&T
Dallas, TX 75217 (Dallas County)
Reposted Today
HPC Product Development Engineer
$95K — $161K *
KLA Tencor
Milpitas, CA 95035 (Santa Clara County)
Reposted Today
Algorithmic Developer
$200K — $300K *
Seven Research
New York, NY 10025 (New York County)
Today
Vice President, Global AI Sales Specialists
$231K — $407K *
Genesys
Virginia, MN 55792 (Saint Louis County)
Reposted Today
NA Mid-Market Expansion Account Executive
$80K — $120K *
Lucid Software Inc
Raleigh, NC 27610 (Wake County)
Reposted Today

Find similar Senior Data Engineer jobs:

Nationwide Los Angeles, CA

Senior Data Engineer

Job Overview by Ladders

Full Job Description

Get Ready For Your Next Interview

Find similar Senior Data Engineer jobs:

Get Ready For Your
Next Interview