5+ years of experience in machine learning research or engineering
Demonstrated excellence in foundation model research, backed by first-author publications
Strong problem-solving skills with a track record of independent initiative
Proven commitment to high standards of data and code quality
Advanced proficiency in Python programming
Bonus: Familiarity with Vision-Language-Action models for robotics or video generation
Responsibilities
Develop and optimize training stacks for Vision-Language-Action models
Collaborate with robotics teams to create large-scale datasets
Curate high-quality datasets for embodied perception
Design generative simulation techniques for enhanced data diversity
Train and evaluate generative models for 3D environments
Work with a team focused on building general-purpose Physical AI
Benefits
Collaborative team environment with a focus on innovation
Opportunities to work on cutting-edge technology and research
Engagement in meaningful projects contributing to the field of robotics
Support for professional development and continuous learning
Flexible work arrangements promoting work-life balance
Full Job Description
What You'll Do
Develop and optimize the training and inference stack for Vision-Language-Action foundation models in robotics
Collaborate with simulation and real-world robotics teams to curate high-quality, diverse, and large-scale datasets
Curate the world's best Internet-scale datasets for embodied perception and first-person robot video generation
Design new generative simulation techniques to expand simulation data scale and diversity, training and evaluating generative models of 3D objects and environments, and language/code models to generate tasks and reward functions
Collaborate with a team of driven individuals committed to building general-purpose Physical AI
What You'll Bring
Passion for your craft and demonstrated excellence in foundation model research and engineering
Exceptional ownership and initiative-finding and solving problems independently
Extensive experience pioneering new machine learning ideas or refining existing methods, supported by first-author publications or impactful projects (5+ years)
A relentless commitment to data and code quality, rigorous evaluation, and meticulous attention to detail
Production-level expertise in modern Python
Bonus: Experience with Vision-Language-Action models for robotics or web agents, embodied perception, or video generation