Research Scientist / Engineer - Efficient Modeling

Rhoda AI

• $120K — $160K *

Mountain View, CA 94040In-Person

Enterprise Technology

Less than 5 years of experience

Today

Be an Early Applicant

By clicking Apply, I agree with Ladders' Terms of Use and Privacy Policy

Job Overview by Ladders

Qualifications

Strong understanding of model compression and efficient architectures for large models.
Hands-on experience with quantization, distillation, or pruning applied to transformers or large neural networks.
Deep knowledge of where efficiency gains are possible in modern architectures.
Proficiency with PyTorch and familiarity with hardware-aware optimization (CUDA, TensorRT, or similar).
Ability to run principled experiments that characterize capability-efficiency tradeoffs.

Responsibilities

Research and implement model compression techniques: quantization, pruning, structured sparsity, distillation, and low-rank approximation.
Design efficient architectures and attention mechanisms suited to real-time inference on edge and robot hardware.
Develop training strategies that produce better accuracy-efficiency tradeoffs from the start.
Profile and benchmark models across hardware targets to identify and resolve efficiency bottlenecks.
Build evaluation frameworks that measure capability retention after compression or architecture changes.
Collaborate with training systems and deployment teams to ensure efficient models translate to faster real-world inference.
Publish and present work at top-tier venues.

Benefits

Opportunity to influence real-time robot deployments with advanced model capabilities.
High impact role that enhances the efficiency of all models the team develops.
Unique blend of deep learning research with practical systems implementation.

Full Job Description

We're looking for a Research Scientist or Research Engineer focused on model efficiency - making our foundation world models faster, smaller, and more deployable without sacrificing capability. This work is critical to closing the gap between research-scale models and real-time operation on robot hardware. **What You'll Do** - Research and implement model compression techniques: quantization, pruning, structured sparsity, distillation, and low-rank approximation - Design efficient architectures and attention mechanisms suited to real-time inference on edge and robot hardware - Develop training strategies that produce better accuracy-efficiency tradeoffs from the start - Profile and benchmark models across hardware targets to identify and resolve efficiency bottlenecks - Build evaluation frameworks that measure capability retention after compression or architecture changes - Collaborate with training systems and deployment teams to ensure efficient models translate to faster real-world inference - Publish and present work at top-tier venues **What We're Looking For** - Strong understanding of model compression and efficient architectures for large models - Hands-on experience with quantization, distillation, or pruning applied to transformers or large neural networks - Deep knowledge of where efficiency gains are possible in modern architectures - Proficiency with PyTorch and familiarity with hardware-aware optimization (CUDA, TensorRT, or similar) - Ability to run principled experiments that characterize capability-efficiency tradeoffs **Nice to Have (But Not Required)** - PhD in ML, CS, or a related field - or equivalent research/engineering experience - Publication record at NeurIPS, ICML, ICLR, MLSys, or related venues - Experience with efficient video or multimodal model architectures - Familiarity with edge deployment targets (Jetson, custom ASICs, or mobile hardware) - Prior work on speculative decoding, early exit, or adaptive compute - Experience deploying compressed models on physical robots or latency-constrained systems **Why This Role** - Bridge the gap between large-scale research models and real-time robot deployments - Your work determines whether frontier capabilities actually run on our hardware - High leverage: efficiency improvements benefit every model the team trains and deploys - Work at a rare intersection of deep learning research and systems

* Ladders Estimates

Similar Jobs

Laird Lab Specialist: UCSF/IVF, Small RNA Biology, and Reproductive Epigenetics
$93K — $194K *
University of California San Francisco
San Francisco, CA 94112 (San Francisco County)
Today
Materials Informatics Engineer
$120K — $150K *
Apple
Cupertino, CA 95014 (Santa Clara County)
Today
Research Scientist / Engineer - Dexterous Manipulation
$120K — $150K *
Rhoda AI
Mountain View, CA 94040 (Santa Clara County)
Today
Associate Scientist, Oncology Research
$115K — $135K *
CytomX Therapeutics, Inc.
South San Francisco, CA 94080 (San Mateo County)
Today
Research Scientist / Engineer - Robot Learning Data
$120K — $150K *
Rhoda AI
Mountain View, CA 94040 (Santa Clara County)
Today
Senior Statistician
$90K — $180K *
Abbott
Alameda, CA 94501 (Alameda County)
Reposted Today

Get Ready For Your
Next Interview

More Jobs at Rhoda AI

Inference Optimization ML Engineer
$130K — $180K *
Mountain View, CA 94040 (Santa Clara County)
Today
Information Technology
In-Person
Research Scientist / Engineer - Efficient Modeling
$120K — $160K *
Mountain View, CA 94040 (Santa Clara County)
Today
Enterprise Technology
In-Person
Research Scientist / Engineer - Dexterous Manipulation
$120K — $150K *
Mountain View, CA 94040 (Santa Clara County)
Today
Technical Services
In-Person
Research Scientist / Engineer - Robot Learning Data
$120K — $150K *
Mountain View, CA 94040 (Santa Clara County)
Today
Consumer Technology
In-Person
Robot Software Engineer
$120K — $160K *
Mountain View, CA 94040 (Santa Clara County)
Today
Consumer Technology
In-Person

More Enterprise Technology Jobs

Presales Technical Consultant
$67K — $97K *
HP Development Company, L.P.
Rio Rancho, NM 87124 (Sandoval County)
Reposted Today
Enterprise Account Executive
$200K — $300K *
LocalStack
Remote
Reposted Today
Lead Member of Technical Staff
$120K — $150K *
Salesforce
Indianapolis, IN 46227 (Marion County)
Today
Staff Software Engineer
$120K — $150K *
Trellix
Frisco, TX 75034 (Denton County)
Today
Senior AI Software Engineer
$121K — $206K *
T Rowe Price Group, Inc
New York, NY 10025 (New York County)
Reposted Today

Find similar Research Scientist / Engineer - Efficient Modeling jobs:

Nationwide Mountain View, CA

Research Scientist / Engineer - Efficient Modeling

Job Overview by Ladders

Full Job Description

Get Ready For Your Next Interview

Find similar Research Scientist / Engineer - Efficient Modeling jobs:

Get Ready For Your
Next Interview