Senior Autonomy Data Engineer

Torc Robotics • $160K — $193K *

US-AnywhereRemote in Blacksburg, VA

Transportation

Less than 5 years of experience

Today

Be an Early Applicant

By clicking Apply, I agree with Ladders' Terms of Use and Privacy Policy

Job Overview by Ladders

Qualifications

Bachelor's degree in Computer Science, Computer Engineering, Software Engineering, Electrical Engineering or a related field (6+ years of experience or 4+ years with a Master's degree).
Strong proficiency in Python and SQL, capable of building production-quality data pipelines.
Deep experience with cloud data infrastructure (preferably AWS: S3, Glue, Athena, Redshift) and infrastructure-as-code tools (e.g. Terraform, CloudFormation).
Solid understanding of data partitioning strategies and columnar storage formats (Parquet, Orc, etc.).
Experience with data pipelines that process time-series and binary data.

Responsibilities

Own design and organization of the program's data lake, including schema definitions and metadata indexing.
Design and maintain end-to-end pipelines for high-bandwidth sensor logs ingestion with reliability.
Develop data validation checks to detect corrupted information and inconsistent calibration.
Build tooling for querying raw logs to produce curated training datasets.
Deploy data visualization tooling to support log review and autonomy debugging workflows.
Establish data contracts between data services and model training consumers.
Contribute to the data roadmap and mentor junior engineers in data infrastructure best practices.

Benefits

100% paid medical, dental, and vision premiums for full-time employees.
401K plan with 6% employer match.
Flexible schedule with generous paid vacation available immediately after start date.
Company-wide holiday office closures.
AD+D and Life Insurance.

Full Job Description

Meet The Team:

Torc is hiring a Senior Autonomy Data Engineer to design, build and operate the data infrastructure that powers our autonomy program. You will build the pipelines, storage systems, and tooling that turn raw vehicle sensor logs in to the curated, structured datasets that our perception, planning and simulation engineers depend on.

This is a high-ownership role on a lean team. Moving large scale sensor data reliably from vehicles operating in demanding environments and making it quickly available for model training is a difficult and high-impact problem to solve. You will work directly with ML engineers, autonomy developers and platform engineers to close this data loop.

What You'll Do

Data Lake and Ingestion Pipeline
- Own the design and organization of the program's data lake, including schema definitions, partitioning strategy and metadata indexing.
- Design and maintain end-to-end pipelines that ingest high-bandwidth sensor logs from vehicles into cloud storage with high reliability and tolerant of ad-hoc and intermittent connectivity mechanisms.
- Develop data validation and integrity checks that can detect corrupted information, missing sensors, and inconsistent calibration prior to the data being processed by downstream systems.
- Implement retention, tiering and lifecycle policies for data to balance storage costs with development value.
Dataset Curation and Labeling Infrastructure
- Build tooling to query raw logs to produce curated training and evaluation datasets.
- Build automation to run cost-effective pseudo-labeling workflows at the scale of data ingest.
- Implement data quality and model performance metrics that are used to direct labeling effort toward the highest-value examples.
Autonomy Data Visualization
- Deploy and maintain data visualization tooling to support log review, annotation QA, and autonomy debugging workflows.
- Build integrations between the visualization tooling and the data lake so engineers can navigate from a dataset entry or model failure directly to the origin log data
- Work with autonomy engineers to define and surface custom visualization panels and implement metrics for analyzing unstructured operating environments.
- Build dashboards that provide the autonomy engineers visibility into data coverage by terrain type, operating environment and geographic region.
Cross-functional Collaboration
- Establish and document data contracts between the data services and model training consumers.
- Partner with perception, planning and embedded engineers across the data lifecyle: from shaping the logging schemas and collection triggers to defining the dataset interfaces that supply model training and evaluation.
- Define data engineering standards, best practices, and tooling choices for an innovative and fast-paced team.
- Contribute to the data roadmap and provide input to technical leadership on investment priorities.
- Mentor junior engineers and raise the team's capabilities in data infrastructure scalability and operational hygiene.

What You'll Need to Succeed:

Bachelor's degree in Computer Science, Computer Engineering, Software Engineering, Electrical Engineering or a related field with 6+ years of data engineering experience or a Master's with 4+ years.
Strong proficiency in Python and SQL, with demonstrated ability to build production-quality data pipelines
Deep experience with cloud data infrastructure (AWS preferred: S3, Glue Athena, redshift, or equivalent) and infrastructure-as-code tools (Terraform, Cloud Formation).
Solid understanding of data partitioning strategies and columnar storage formats (Parquet, Orc, etc.)
Experience building and operating data pipelines that process time-series and binary data.
Proven ability to evaluate and integrate open-source tooling when appropriate versus building from scratch.
Strong instincts for delivering data quality through first-class implementations of monitoring, validation and lineage tracking.

Bonus points!

Experience with autonomous vehicles, robotics, or other sensor-driven autonomous systems.
Deep experience with Foxglove or Rerun beyond basic playback, e.g. building custom extensions or integrating them into a structured log review or annotation QA workflow.
Familiarity with the MCAP CLI and/or python library and experience converting MCAP data to columnar data formats for further querying and processing.
Experience with data curation for ML training, e.g. diversity sampling, pseudo-labeling, and dataset versioning.

Perks of Being a Torc'r

Torc cares about our team members and we strive to provide benefits and resources to support their health, work/life balance, and future. Our culture is collaborative, energetic, and team focused. Torc offers:

A competitive compensation package that includes a bonus component and stock options
100% paid medical, dental, and vision premiums for full-time employees
401K plan with a 6% employer match
Flexibility in schedule and generous paid vacation (available immediately after start date)
Company-wide holiday office closures
AD+D and Life Insurance

Job ID: R-102765

Hiring Range for Job Opening

US Pay Range

$160,800-$193,000 USD

About Torc Robotics

Torc Robotics is a company that develops autonomous vehicle technology. It was founded in 2005 in Blacksburg, Virginia, and has since become a leader in the field of self-driving vehicles. Torc Robotics has developed autonomous technology for a variety of applications, including military vehicles, mining trucks, and consumer cars. The company has partnerships with major automotive manufacturers, including Daimler Trucks North America and Caterpillar. In 2019, Torc Robotics was acquired by Daimler Trucks North America, and it continues to operate as a subsidiary of the company.

Learn more about Torc Robotics

Size

200 employees

Industry

Enterprise Technology

Founded

2005

* Ladders Estimates

Similar Jobs

Senior Data Reliability Engineer - Central Technology
$91K — $194K *
Activision
Sherman Oaks, CA 91423 (Los Angeles County)
Reposted Today
Director, Data Engineering
$112K — $161K *
Saatchi & Saatchi
Chicago, IL 60629 (Cook County)
Today
Sr. Data Engineer (Starlink Network Analytics, Wi-Fi)
$165K — $230K *
SpaceX
Redmond, WA 98052 (King County)
Today
Lead Data Engineer
$179K — $204K *
Capital One Financial Corporation
Plano, TX 75025 (Collin County)
Reposted Today
Senior Data Engineer, Growth, Insights & Analytics
$112K — $168K *
Edgewell Personal Care Company
Shelton, CT 06484 (Greater Bridgeport County)
Today
Staff Software Engineer - Streaming Data - Seattle, WA
$167K — $230K *
CloudKitchens
Seattle, WA 98115 (King County)
Today

Get Ready For Your
Next Interview

More Jobs at Torc Robotics

Software Engineer, I - Mission Control
$114K — $137K *
Ann Arbor, MI 48103 (Washtenaw County)
4 days ago
Information Technology
In-Person
Director of Engineering, TorcOS
$226K — $271K *
Ann Arbor, MI 48103 (Washtenaw County)
6 days ago
Manufacturing & Automotive
In-Person
Senior, ML Engineer - Neural Rendering
$177K — $234K *
Remote
Reposted 1 week ago
Transportation
Remote in Ann Arbor, MI
Ingénieur·e principal·e en apprentissage automatique - Rendu neuronal
$160K — $212K *
Remote
Reposted 1 week ago
Information Technology
Remote in Montreal, QC
Software Engineer, I - Data Engineering
$114K — $137K *
Ann Arbor, MI 48103 (Washtenaw County)
1 week ago
Information Technology
In-Person

More Transportation Jobs

Intelligent Transportation Systems Analyst
$76K — $115K *
City of Glendale, AZ
Glendale, AZ 85308 (Maricopa County)
Today
Operations Review Manager, PE (Manager 3, Engineering & Plans Review)
$128K — $192K *
City of Seattle, WA
Seattle, WA 98115 (King County)
Today
Traffic Engineer
$70K — $95K *
Volkert Inc.
Springfield, VA 22153 (Fairfax County)
Reposted Today
Head of Operational Technology
$170K — $190K *
MaerskSealand
Elizabeth, NJ 07202 (Union County)
Reposted Today
Logistics Management Specialist
$90K — $120K *
Federal Aviation Administration
Washington, DC 20011 (District Of Columbia County)
Today

Find similar Senior Autonomy Data Engineer jobs:

Nationwide Remote

Senior Autonomy Data Engineer

Job Overview by Ladders

Full Job Description

Get Ready For Your Next Interview

Find similar Senior Autonomy Data Engineer jobs:

Get Ready For Your
Next Interview