Data Operations Engineer

Specter

• $100K — $140K *

San Francisco, CA 94112In-Person

Information Technology

Less than 5 years of experience

Today

Be an Early Applicant

By clicking Apply, I agree with Ladders' Terms of Use and Privacy Policy

Job Overview by Ladders

Qualifications

1-3+ years in data operations or project management, ideally in ML or engineering contexts.
Proficient in Python with skills in building tools, scripts, and dashboards.
Strong communication skills for effectively managing external vendors or cross-functional teams.
Familiar with machine learning workflows and their impact on model performance.
Highly organized and capable of managing multiple priorities simultaneously.
Self-directed and able to work independently.
Bonus: experience with computer vision data and labeling operations.

Responsibilities

Own the relationship with data labeling providers, managing tasks and issues.
Build and maintain tools for data labeling, including interfaces and pipelines.
Define and implement quality control standards for labeled data.
Collaborate with researchers to develop data collection strategies and identify coverage gaps.
Create dashboards to monitor dataset diversity and class balance.
Track the flow of data from labeling to model training, identifying failure modes for improvement.
Integrate new data sources and define labeling taxonomies.

Benefits

Flexible work environment and schedule
Opportunity to work at the intersection of engineering and research
Role has a significant impact on model effectiveness and research outcomes
Collaborative team atmosphere with cross-functional partners
Access to the latest in machine learning technologies and methodologies

Full Job Description

Role:

Specter is hiring a data operations engineer to build our research data operation. This individual will own the full pipeline from defining what data we need, to getting it labeled at high quality, to ensuring it meets the needs of our research team and ultimately improves our models. The role sits at the intersection of engineering and research, with a focus on building systems and tooling.

Responsibilities:

Own the end-to-end relationship with our data labeling provider, including task scoping, timeline management, and issue resolution
Build and maintain internal tooling for labelers, including annotation interfaces, task pipelines, and dataset browsers
Define and enforce quality control standards across all labeled data, implementing automated checks and audit workflows
Partner with researchers to translate perception model needs into data collection strategies, identifying gaps in coverage across object types, scenes, lighting conditions, and sensor modalities
Build dashboards and metrics to monitor dataset diversity, class balance, and domain coverage
Close the loop on the data flywheel: track how labeled data flows into training, surface failure modes, and drive iteration on the pipeline from collection through to model improvement
Evaluate and integrate new data sources
Define labeling taxonomies and annotation specifications

Qualifications:

1-3+ years of experience in data operations, project management, or a technical coordination role, ideally supporting ML or engineering teams
Proficiency in Python and comfort building lightweight tools, scripts, and dashboards
Strong written and verbal communication skills, with experience managing external vendors or cross-functional stakeholders
Familiarity with ML workflows and how training data impacts model performance
Highly organized, with a track record of managing multiple concurrent workstreams
Self-directed and autonomous
Bonus: experience with computer vision data, annotation platforms, or labeling operations

* Ladders Estimates

Similar Jobs

Database Reliability Engineer
$100K — $150K *
Fuse Energy
Remote
Today
Data Quality Engineer
$80K — $100K *
QualiTest Group
Santa Clara, CA 95051 (Santa Clara County)
Today
Cloud Data Engineer
$136K — $276K *
Hewlett Packard Enterprise Development LP
San Jose, CA 95123 (Santa Clara County)
Reposted Yesterday
Software Engineer, Growth Data Platform
$120K — $160K *
Omni Analytics
San Francisco, CA 94112 (San Francisco County)
Yesterday
Software Engineer, Growth Data Platform
$120K — $150K *
Omni Analytics
Remote
Yesterday
Software Engineer - VE
$120K — $160K *
Apple
Cupertino, CA 95014 (Santa Clara County)
Yesterday