ML/RL Engineer, Behavior Planning

Bot Auto

• $120K — $160K *

San Francisco, CA 94112In-Person

Transportation

Less than 5 years of experience

Today

Be an Early Applicant

By clicking Apply, I agree with Ladders' Terms of Use and Privacy Policy

Job Overview by Ladders

Qualifications

Proven experience in training and deploying deep RL algorithms (e.g., PPO, SAC).
Strong proficiency in Python and PyTorch with a solid grasp of deep learning techniques.
MS or PhD in Computer Science, Robotics, or a related quantitative discipline.
Ability to troubleshoot RL training challenges like variance and distribution shift.

Responsibilities

Develop and train realistic conditioned policies for stress-testing autonomous driving behaviors.
Lead research and implement safety-constrained reinforcement learning algorithms.
Collaborate to design reward functions balancing safety, progress, and comfort.
Optimize large-scale training environments for complex multi-agent scenarios.
Enhance neural architectures for better spatial reasoning and planning.
Work with teams to integrate research-grade models into production software.

Benefits

Comprehensive health insurance.
Paid time off.
Work at the forefront of the autonomous trucking industry.

Full Job Description

Role Overview

We are seeking a ML/RL Engineer to join our Algo team and drive the development of our unified behavioral architecture. In this role, you will help bridge the gap between simulation and the real world by developing a scalable policy framework that represents both our L4 ego-policy and a diverse population of simulated agents. You will work at the intersection of Multi-Agent Reinforcement Learning (MARL) and safety-critical system design to ensure our autonomous semi-trucks navigate highways with superhuman safety and precision.
Key Responsibilities

Behavioral Modeling: Develop and train diverse, conditioned policies that simulate realistic driving behaviors to stress-test and validate our autonomous driving stack.
Safety-Constrained Learning: Lead the research and implementation of advanced RL algorithms to ensure safety metrics are treated as primary constraints in the learning process.
Reward & Objective Design: Collaborate with cross-functional teams to design robust reward functions and evaluation metrics that balance safety, progress, and comfort.
Scalable Training Pipelines: Contribute to the optimization of our large-scale, high-throughput training environments to enable rapid iteration on complex multi-agent scenarios.
Model Architecture: Advance our state-of-the-art neural architectures to improve spatial reasoning, long-horizon planning, and interaction modeling.
Cross-Team Collaboration: Work closely with Simulation and Planning teams to integrate research-grade models into production-quality, safety-critical software.

Required Qualifications

Professional RL Experience: Proven track record of training and deploying deep RL algorithms (e.g., PPO, SAC) for complex, real-world robotic or autonomous systems.
Technical Mastery: Expertise in Python and PyTorch; strong understanding of modern deep learning architectures and optimization techniques.
Academic Background: MS or PhD in Computer Science, Robotics, or a related quantitative field.
Scientific Intuition: Ability to diagnose and solve fundamental challenges in RL training, such as variance management and distribution shift.

Preferred Qualifications

Safe RL Specialization: Experience with constrained optimization or safety-critical learning frameworks.
Multi-Agent Systems: Background in MARL training stability, including self-play and decentralized execution strategies.
Autonomous Driving Domain: Familiarity with vehicle dynamics and behavior planning, particularly for long-haul highway environments.

Additional Information

Compensation: Competitive salary based on experience, with opportunities for performance bonuses and equity.
Benefits: Comprehensive health insurance, paid time off, and the opportunity to work at the forefront of the autonomous trucking industry.

* Ladders Estimates

Similar Jobs

Machine Learning Engineer
$150K — $180K *
BetterHelp
Remote
Today
CVML Engineer, See and Spray
$100K — $176K *
Blue River Technology
Remote
Today
Member of Technical Staff, Model Training
$130K — $180K *
Parallel Web Systems Inc
San Francisco, CA 94112 (San Francisco County)
Today
Member of Technical Staff, Search Ranking
$130K — $180K *
Parallel Web Systems Inc
San Francisco, CA 94112 (San Francisco County)
Today
Member of Technical Staff, Search Ranking
$130K — $180K *
Parallel Web Systems Inc
Palo Alto, CA 94303 (Santa Clara County)
Today
Member of Technical Staff, Model Training
$130K — $180K *
Parallel Web Systems Inc
Palo Alto, CA 94303 (Santa Clara County)
Today

Get Ready For Your
Next Interview

More Jobs at Bot Auto

ML/RL Engineer, Behavior Planning
$100K — $150K *
Houston, TX 77084 (Harris County)
Today
Transportation
In-Person
ML/RL Engineer, Behavior Planning
$120K — $160K *
San Francisco, CA 94112 (San Francisco County)
Today
Transportation
In-Person
Senior Software Engineer, Simulation Systems
$120K — $150K *
Houston, TX 77084 (Harris County)
2 weeks ago
Transportation
In-Person
Senior Software Engineer, Simulation Systems
$130K — $180K *
San Francisco, CA 94112 (San Francisco County)
2 weeks ago
Consumer Technology
In-Person
Senior Software Engineer, Operation Platforms
$130K — $180K *
San Francisco, CA 94112 (San Francisco County)
2 weeks ago
Transportation
In-Person

More Transportation Jobs

Logistics Manager
$70K — $95K *
Triple-s Steel
Houston, TX 77084 (Harris County)
Today
General Manager
$70K — $95K *
Americold
Zumbrota, MN 55992 (Goodhue County)
Today
Manager-Logistics
$96K — $144K *
AT&T
Anchorage, AK 99504 (Anchorage County)
Today
Lead Logistics Vendor Management - Reverse Logistics
$118K — $178K *
AT&T
Dallas, TX 75217 (Dallas County)
Reposted Today
Director, SaaS Sales ShipperGuide
$157K — $200K *
Loadsmart
Remote
Today

Find similar ML/RL Engineer, Behavior Planning jobs:

Nationwide San Francisco, CA

ML/RL Engineer, Behavior Planning

Job Overview by Ladders

Full Job Description

Get Ready For Your Next Interview

Find similar ML/RL Engineer, Behavior Planning jobs:

Get Ready For Your
Next Interview