Model Distillation Engineer

Hark

• $120K — $300K *

San Jose, CA 95123In-Person

Information Technology

Less than 5 years of experience

Today

Be an Early Applicant

By clicking Apply, I agree with Ladders' Terms of Use and Privacy Policy

Job Overview by Ladders

Qualifications

3+ years of professional experience in model compression, distillation, or efficient deep learning.
Strong fluency in PyTorch or TensorFlow with modern compression libraries.
Hands-on experience transitioning models to fixed-point or int8 formats.
Comfortable working with hardware constraints like compute and memory bandwidth.
Track record of delivering models for constrained devices.
Solid foundation in audio or sequence model architectures.

Responsibilities

Design and execute distillation strategies for compressing teacher models.
Apply quantization, pruning, and architecture search to meet product specifications.
Build a reusable distillation and compression toolchain for the team.
Collaborate with audio ML and runtime teams on training and deployment.
Define and track accuracy retention and resource KPIs during the release cycle.
Profile compressed models on hardware and work with engineers to resolve bottlenecks.

Benefits

Opportunity to work at the intersection of research and production.
Collaboration with high-caliber teams in audio machine learning.
Ability to influence model architecture and performance directly.
Exposure to cutting-edge hardware technologies and techniques.

Full Job Description

About the Role

We are looking for a Model Distillation Engineer to compress large audio and multimodal models into student models that meet the size, latency, and power budgets of our shipping hardware. This role sits between training and production. You will take teacher models from our research pipeline and produce student models that run on DSP, NPU, and microcontroller targets across our product line. You will own distillation, quantization, and architecture-aware compression as a first-class work-stream.

Responsibilities

Design and execute distillation strategies (response, feature, and self-distillation) to compress teacher models into deployable students
Apply quantization (PTQ and QAT), pruning, and architecture search to hit per-product size, latency, and power budgets
Build a reusable distillation and compression toolchain that the broader audio ML team can adopt across model families
Partner with the broader audio ML team on training pipelines and with the runtime team on deployment targets
Define accuracy retention and resource KPIs per product and track them through the release cycle
Profile compressed models on target hardware and iterate with DSP and runtime engineers on bottlenecks

Requirements

3+ years of professional experience in model compression, distillation, quantization, or efficient deep learning
Strong fluency in PyTorch or TensorFlow and modern compression libraries
Hands-on experience taking models from full precision to fixed-point or int8 with controlled accuracy loss
Comfort working close to hardware and reasoning about compute, memory bandwidth, and power as design constraints
Track record of producing models that have shipped to constrained devices
Solid foundation in audio or sequence model architectures (CNNs, transformers, RNN-T, conformers)

Bonus Qualifications

Experience with Hexagon DSP, NPUs, Ambiq class MCUs, or similar
Experience with knowledge distillation at scale, including teacher-ensemble or multi-stage distillation
Familiarity with neural architecture search and hardware-aware NAS
Background shipping voice-first or far-field audio products
Contributions to open-source compression toolchains (TFLite, ONNX Runtime, AIMET, and similar)

Compensation

The US base salary range for this full-time position is between $120,000 - $300,000 annually.

The pay offered for this position may vary based on several individual factors, including job-related knowledge, skills, and experience. The total compensation package may also include additional components/benefits depending on the specific role. This information will be shared if an employment offer is extended.

* Ladders Estimates

Similar Jobs

AI Platform Engineer
$120K — $160K *
Notable
San Mateo, CA 94403 (San Mateo County)
Today
AI Engineer/ Anthropic Architect (FDE)
$120K — $150K *
Bits In Glass
Remote
Today
Senior Inference Engineer, AIConfigurator for Dynamo
$184K — $356K *
NVIDIA Corporation
Remote
Today
Senior Inference Engineer, AIConfigurator for Dynamo
$184K — $356K *
NVIDIA Corporation
Santa Clara, CA 95051 (Santa Clara County)
Today
Senior AI Engineer - ServiceNow Delivery Acceleration
$134K — $158K *
Cognizant
Santa Clara, CA 95051 (Santa Clara County)
Today
Senior Staff AI Engineer
$150K — $200K *
Hippocratic AI
Menlo Park, CA 94025 (San Mateo County)
Reposted Today

Get Ready For Your
Next Interview

More Jobs at Hark

Model Distillation Engineer
$120K — $300K *
San Jose, CA 95123 (Santa Clara County)
Today
Information Technology
In-Person
Privacy Engineer
$150K — $300K *
San Jose, CA 95123 (Santa Clara County)
6 days ago
Information Technology
In-Person
Security Engineer
$150K — $300K *
San Jose, CA 95123 (Santa Clara County)
6 days ago
Information Technology
In-Person
Acoustic Integration Engineer
$120K — $300K *
San Jose, CA 95123 (Santa Clara County)
1 week ago
Consumer Technology
In-Person
Audio Quality and Data Engineer
$250K — $300K *
San Jose, CA 95123 (Santa Clara County)
2 weeks ago
Telecommunications & Hardware
In-Person

More Information Technology Jobs

Client Partner - Banking / Financial Services / Capital Markets
$325K — $350K + $100K bonus *
Large IT Services Firm (client of TechLink Systems)
New York, NY 10001 (New York County)
1 week ago
Sr. SDET Automation Engineer
$140K — $165K *
Yubico
Bellevue, WA 98006 (King County)
Today
Project Engineer III
$90K — $120K *
Palmetto Technology Group
Tucson, AZ 85705 (Pima County)
Reposted Today
HPC-Kubernetes Solutions Architect
$200K — $350K *
INSPYR Solutions
Dallas, TX 75217 (Dallas County)
Reposted Today
Sr Network Engineer / Architect, Global Network & Security - Alpharetta, GA, Boston, MA or Billerica, MA Hybrid
$143K — $214K *
Cabot Corporation
Boston, MA 02115 (Suffolk County)
Today

Find similar Model Distillation Engineer jobs:

Nationwide San Jose, CA

Model Distillation Engineer

Job Overview by Ladders

Full Job Description

Get Ready For Your Next Interview

Find similar Model Distillation Engineer jobs:

Get Ready For Your
Next Interview