Data Machine Learning Engineer

techire ai

• $100K — $150K *

US-AnywhereRemote in United States

Information Technology

Less than 5 years of experience

1 month ago

By clicking Apply, I agree with Ladders' Terms of Use and Privacy Policy

Job Overview by Ladders

Qualifications

5-7 years of experience in building ML data pipelines at scale
Hands-on expertise with speech or audio data
Strong understanding of various speech representations
Familiarity with multi-channel audio data processing including diarisation and alignment
Experience with multilingual data is a plus

Responsibilities

Own end-to-end data pipelines from audio ingestion to training-ready datasets
Build quality assurance systems to identify annotation errors before training
Maintain training infrastructure to optimize GPU usage
Develop and refine tools for various speech representations
Handle complex audio pipeline tasks like two-channel alignment

Benefits

Remote-friendly work environment
Competitive base salary
Stock options available

Full Job Description

Job Description

Want to own the data infrastructure behind some of the most naturalistic voice models in production?

You'll be joining a well-funded speech AI startup - just closed their Series A - with strong enterprise traction and revenue that more than doubled last quarter. They're building ultra-realistic voice technology that handles natural laughter, breathing, seamless language switching, and accurate pronunciation across languages and accents. Their models are powering hundreds of millions of conversations monthly.

Before training a single model, they built their own corpus - full-duplex, studio-quality conversational speech annotated by PhD linguists. As their MLE, you'll own the pipelines that turn that raw material into clean, training-ready data.

What you'll do

Own end-to-end data pipelines from raw audio ingestion through to versioned, training-ready datasets
Build quality systems that catch annotation errors and alignment issues before they reach a training run
Maintain the training infrastructure that keeps GPUs fed - dataloaders, streaming datasets, multi-modal batching
Build and iterate on tooling across speech representations including neural codecs, semantic tokens and mel features
Handle full- and half-duplex pipeline work including two-channel alignment and overlap handling

What you'll bring

Strong engineering fundamentals with experience building ML data pipelines at scale
Hands-on experience with speech or audio data
Solid understanding of speech representations and the tradeoffs between them
Experience with multi-channel audio data including diarisation and alignment

Nice to have

Experience with multilingual data pipelines
Large-scale training infrastructure experience - FSDP, DeepSpeed, Ray
Annotation tooling and human-in-the-loop systems

Remote-friendly. Competitive base plus stock.

* Ladders Estimates

Similar Jobs

GenAI Python Systems Engineer – Experienced Associate
$61K — $100K *
PWC
New York, NY 10025 (New York County)
Today
GenAI Python Systems Engineer – Experienced Associate
$61K — $100K *
PWC
Minneapolis, MN 55407 (Hennepin County)
Today
GenAI Python Systems Engineer – Experienced Associate
$61K — $100K *
PWC
Little Rock, AR 72204 (Pulaski County)
Today
GenAI Python Systems Engineer – Experienced Associate
$61K — $100K *
PWC
Indianapolis, IN 46227 (Marion County)
Today
GenAI Python Systems Engineer – Experienced Associate
$61K — $100K *
PWC
Denver, CO 80219 (Denver County)
Today
GenAI Python Systems Engineer – Experienced Associate
$61K — $100K *
PWC
Tulsa, OK 74133 (Tulsa County)
Today

Get Ready For Your
Next Interview

More Jobs at techire ai

Inference Engineer
$130K — $180K *
San Francisco, CA 94112 (San Francisco County)
2 weeks ago
Consumer Technology
In-Person
Full Stack Software Engineer
New York, NY 10025 (New York County)
2 weeks ago
Information Technology
In-Person
Full Stack Engineer
$160K — $280K *
San Francisco, CA 94112 (San Francisco County)
2 weeks ago
Information Technology
In-Person
Frontend Engineer
New York, NY 10025 (New York County)
2 weeks ago
Enterprise Technology
In-Person
Data Machine Learning Engineer
$100K — $150K *
Remote
1 month ago
Information Technology
Remote in United States

More Information Technology Jobs

Sales Operations Specialist
Dotcomteam LLC
Salem, NH 03079 (Rockingham County)
1 week ago
Data Analysis Manager - Analytics & Transformation
$164K — $188K *
Capital One Financial Corporation
Mclean, VA 22101 (Fairfax County)
Today
Senior Data Analyst - Analytics & Transformation
$111K — $126K *
Capital One Financial Corporation
Mclean, VA 22101 (Fairfax County)
Today
Senior Lead Systems Operations Engineer
$159K — $305K *
Wells Fargo
Iselin, NJ 08830 (Middlesex County)
Today
Sr. Full Stack Engineer (SaaS Applications)
$106K — $159K *
Strategic Education, Inc.
Remote
Today

Find similar Data Machine Learning Engineer jobs:

Nationwide Remote

Data Machine Learning Engineer

Job Overview by Ladders

Full Job Description

Get Ready For Your Next Interview

Find similar Data Machine Learning Engineer jobs:

Get Ready For Your
Next Interview