What we're seekingA visionary Machine Learning Engineer to join our founding team who will help bridge the gap between high-level AI research and real-world physical actuation for our next-generation autonomous transport platforms. We are actively hiring across three core specialized subcategories: AI Research, Post-Training Optimization, and Data Engineering.
AI Researcher (World Models & VLA)What you'll do- Research and develop cutting edge RL and distillation techniques for trajectory planning
- Integrate emerging research from the broader AI community, identifying and prototyping the most promising solutions
- Design and deploy end-to-end multimodal models that translate real-time visual perception and high-level behavioral goals into physical vehicle actuation
- Develop interactive world models from raw multi-sensor logs, allowing the team to re-simulate events and query what a vehicle would see if it altered its trajectory
- Ensure core autonomous driving models can seamlessly adapt to novel urban environments and edge cases
- Partner with validation and QA teams to run model releases through rigorous simulated scenarios, detecting regressions and identifying systemic performance bottlenecks.
What we're looking for- 4+ years of non-internship professional MLE experience.
- Deep expertise in applying AI Transformers to robotics, physical actuation, or spatial-temporal data.
- Proven track record designing or training multimodal systems, large-scale VLA models, or generative Diffusion models.
- Strong background in Sensor Fusion, combining inputs from Cameras, LiDAR, and Radar.
- Fluency in PyTorch or JAX for training large-scale models.
- Experience with multi-task learning, Birds-Eye-View (BEV) frameworks, representation learning, or data tokenization is highly preferred.
- Proficiency in Python and familiarity with C++.
Post-Training & OptimizationWhat you'll do- Own the post-training lifecycle by distilling, quantizing, and optimizing massive models to run with low latency on vehicle edge hardware.
- Profile real-time inference pipelines to identify and eliminate CPU, GPU, and memory bandwidth bottlenecks on the vehicle.
- Work with low-level hardware, electrical, and firmware teams to iterate on custom carrier boards, sensor interfaces, and GPUs on edge devices.
- Benchmark and deploy models utilizing hardware-accelerated runtimes (e.g., TensorRT, CUDA) to minimize inference times under strict constraints.
What we're looking for- 4+ years of non-internship professional MLE experience.
- Strong background in machine learning engineering with a focus on model optimization, distillation, and deployment.
- Hands-on experience optimizing models for edge deployment or custom embedded GPU targets.
- Deep understanding of profiling tools and debugging resource constraints across CPU/GPU boundaries.
- Experience with modern deep learning frameworks (PyTorch or JAX) and runtime compilation.
- Robust programming skills in Python and C++.
- Familiarity with low-level camera/sensor interfaces and robotics hardware is a significant plus.
Data & Long-Tail ScenariosWhat you'll do- Architect automated pipelines to ingest, filter, and identify rare, high-value, and long-tail scenarios out of multi-petabyte multi-sensor datasets.
- Target and extract complex structural corner cases from real-world driving logs to continuously feed, challenge, and improve our end-to-end behavior models.
- Iterate closely with QA, testing, and simulation teams to transform ambiguous real-world anomalies into concrete data blocks for simulation testing.
- Implement programmatic data curation, active learning strategies, and statistical quality metrics to optimize the signal-to-noise ratio of our training pipelines.
What we're looking for- 4+ years of non-internship professional MLE experience.
- Professional experience building data curation pipelines, active learning workflows, or data mining architectures for massive physical datasets.
- Strong familiarity with robotics data structures and spatial frameworks, including Birds-Eye-View (BEV) or spatial tokenization.
- Experience processing and structuring raw data from Cameras, LiDAR, and Radar.
- Expert-level proficiency in Python, data engineering frameworks, and PyTorch/JAX.
- Exceptional ability to navigate, structure, and derive signal from highly ambiguous, messy, or undefined real-world data distributions.
What else you need to knowThis role is based in our San Francisco office. Atoms is a company driven by invention and continuous change - we are constantly reimagining our industries, building new products, and refining how we operate. We do our best work together. That's why all of our office-based teams work onsite, five days a week.
The base salary range for this role is
$208,000 - $263,500Actual compensation will be determined on an individual basis and may vary depending on experience, skills, and qualifications.
Base salary is just one part of your total rewards package. You may also be eligible for equity awards and an annual performance-based bonus.
Benefits Summary (USA Full-Time Exempt Employees):- Medical, Dental, Vision, Disability, and Life Insurance
- Flexible Spending Account / Health Savings Account Options
- 401(k)
- Equity
- Sick Time, Unlimited Flexible Time Off, and Paid Holidays
- Paid Parental Leave
- Pre-Tax Commuter Benefit Plan
- Team lunch in our SoMa office every Tuesday and Thursday
Benefits are subject to change at the company's discretion.
Atoms accepts applications on an ongoing basis.