Research Scientist: Pretraining

Generalist AI, Inc

• $100K — $150K *

Boston, MA 02115In-Person

Consumer Technology

Less than 5 years of experience

2 months ago

By clicking Apply, I agree with Ladders' Terms of Use and Privacy Policy

Job Overview by Ladders

Qualifications

5-7 years of experience in large model training (transformer or diffusion)
Demonstrated leadership in multi-node, multi-GPU distributed training
Expertise in scaling laws and optimizations
Proficient in PyTorch with troubleshooting at all stack layers
Strong emphasis on empirical rigor and rapid iterations
Passion for developing foundational robotic intelligence

Responsibilities

Design and execute large-scale pretraining runs for robot models
Define model architectures and training goals using multimodal data
Develop scalable datasets and strategic sampling methods
Lead data collection initiatives and identify new datasets
Conduct ablation studies to analyze scaling laws and data quality
Collaborate with ML infrastructure teams to enhance system performance
Convert raw robotic data into actionable model capabilities

Benefits

Flexible work hours
Opportunity to work at the forefront of robotics and AI
Collaborative and innovative team environment
Access to cutting-edge technology and resources
Professional development opportunities

Full Job Description

About the Role

You will build the base intelligence layer for robotics. We train large-scale robot foundation models from massive multimodal datasets spanning video, proprioception, action traces, language, and more. You will design and run the core large-scale training efforts that give our models fundamentally new general capabilities across embodiments, tasks, and environments. You will "live and breathe" all forms of robot data.

You'll be responsible for:

Designing and executing large-scale pretraining runs for robot foundation models (transformer- and diffusion-based architectures)
Defining model architectures, objectives, and training curricula across multimodal robotic data (vision, action, state, language)
Developing scalable data mixtures and sampling strategies across petabyte-scale datasets
Guiding data collection operations towards new directions, as well as sourcing new datasets
Running ablations to understand scaling laws, data quality effects, and architecture tradeoffs
Collaborating closely with ML Infra and Systems to push cluster utilization, throughput, and reliability
Turning raw robotic interaction data into generalizable model capabilities

You might thrive in this role if you:

Have deep experience training large transformer or diffusion models at scale (for generative models e.g. including language models, audio models, or video models)
Have led or significantly contributed to multi-node, multi-GPU distributed training efforts
Have worked on scaling laws, optimization dynamics, and large-model failure modes
Have strong PyTorch fundamentals and comfort debugging at every layer of the stack
Care about both empirical rigor and raw iteration speed
Are excited about building general-purpose robot intelligence from first principles

* Ladders Estimates

Similar Jobs

Principal ML/AI Cheminformatics Scientist
$106K — $176K *
Pfizer
Groton, CT 06340 (Southeastern Ct County)
Today
Scientist-Vaccine Immunology
$79K — $132K *
Pfizer
Pearl River, NY 10965 (Rockland County)
Today
Associate Director of Real World Evidence (Pharma Co Experience Required) - Remote US
$114K — $210K *
Syneos Health Careers
Remote
Reposted Today
Research Scientist 1
$90K — $120K *
Lincoln Laboratory
Cambridge, MA 02139 (Middlesex County)
Today
Junior Machine Learning Researcher
$80K — $110K *
Straumann
Andover, MA 01810 (Essex County)
Today
Materials Scientist
$75K — $115K *
Formlabs
Boston, MA 02115 (Suffolk County)
Today

Get Ready For Your
Next Interview

More Consumer Technology Jobs

Manager, Product Marketing - New Content Experiences & Product Innovation
$240K — $362K *
Netflix
Los Gatos, CA 95032 (Santa Clara County)
Today
Lead Product Technology
$128K — $215K *
AT&T
Dallas, TX 75217 (Dallas County)
Today
Lead Manager Communications & PR
$118K — $178K *
AT&T
Dallas, TX 75217 (Dallas County)
Today
Direct-Response Copywriter
$120K — $180K *
Growth Partner & Consultancy Limited
Remote
Reposted Today
Head of Growth Strategy & Operations
$251K — $377K *
Snap Inc
San Francisco, CA 94112 (San Francisco County)
Today

Find similar Research Scientist: Pretraining jobs:

Nationwide Boston, MA

Research Scientist: Pretraining

Job Overview by Ladders

Full Job Description

Get Ready For Your Next Interview

Find similar Research Scientist: Pretraining jobs:

Get Ready For Your
Next Interview