5+ years in foundation model research and engineering
Proven track record in pioneering machine learning methods or enhancing existing ones
First-author publications or significant project contributions
High attention to detail with a focus on data and code quality
Expertise in modern Python programming
Experience with Vision-Language-Action models or video generation is a plus
Responsibilities
Develop and optimize training for Vision-Language-Action foundation models in robotics
Collaborate with teams to curate diverse datasets
Aggregate Internet-scale datasets for embodied perception and robot video generation
Design novel simulation techniques for data expansion and diversity
Train and evaluate generative models of 3D objects and environments
Work with a dedicated team to advance Physical AI
Benefits
Collaborative environment with skilled professionals
Opportunity to work on cutting-edge technology in robotics
Engagement in projects with significant real-world applications
Support for independent problem-solving and initiative
Potential for personal and professional growth in the AI field
Full Job Description
What You'll Do
Develop and optimize the training and inference stack for Vision-Language-Action foundation models in robotics
Collaborate with simulation and real-world robotics teams to curate high-quality, diverse, and large-scale datasets
Curate the world's best Internet-scale datasets for embodied perception and first-person robot video generation
Design new generative simulation techniques to expand simulation data scale and diversity, training and evaluating generative models of 3D objects and environments, and language/code models to generate tasks and reward functions
Collaborate with a team of driven individuals committed to building general-purpose Physical AI
What You'll Bring
Passion for your craft and demonstrated excellence in foundation model research and engineering
Exceptional ownership and initiative-finding and solving problems independently
Extensive experience pioneering new machine learning ideas or refining existing methods, supported by first-author publications or impactful projects (5+ years)
A relentless commitment to data and code quality, rigorous evaluation, and meticulous attention to detail
Production-level expertise in modern Python
Bonus: Experience with Vision-Language-Action models for robotics or web agents, embodied perception, or video generation