Senior Staff Research Engineer - Reinforcement Learning for AI Agents

XPENG

• $244K — $413K *

Santa Clara, CA 95051In-Person

Information Technology

Less than 5 years of experience

More than 3 months ago

By clicking Apply, I agree with Ladders' Terms of Use and Privacy Policy

Job Overview by Ladders

Qualifications

MS or PhD in Computer Science, AI, Machine Learning, Robotics, or related field
Strong background in reinforcement learning or machine learning
Experience implementing RL algorithms such as PPO, Actor-Critic, or policy gradient methods
Strong programming skills in Python with PyTorch or JAX
Experience building ML training systems or infrastructure

Responsibilities

Develop reinforcement learning methods for LLM-driven agents and decision systems
Optimize policies for long-horizon reasoning and planning
Learn from human or AI feedback (RLHF / RLAIF)
Build agent training pipelines on the agent infrastructure platform
Evaluate and benchmark agent capabilities
Create learning loops integrating real-world and simulation data
Contribute to AI systems that continuously improve post-deployment

Benefits

A fun, supportive and engaging environment
Opportunity to significantly impact the transportation revolution through autonomous driving
Work on cutting-edge technologies with top talent in the field
Competitive compensation package
Snacks, lunches, and fun activities

Full Job Description

We are looking for exceptional Research Engineers / Scientists to design learning systems that allow agents to plan over long horizons, learn effective strategies, and improve through experience.

This role sits at the intersection of reinforcement learning, large language models, and real-world autonomous systems. Autonomous systems must operate reliably in complex, dynamic environments. We believe the next generation of autonomy will involve learning agents that continuously improve through interaction, feedback, and large-scale data. You will help build the learning systems that power these agents.

Key Responsibilities:

Reinforcement learning methods for LLM-driven agents and decision systems.
Policy optimization for long-horizon reasoning and planning.
Learning from human or AI feedback (RLHF / RLAIF).
Agent training pipelines built on top of our agent infrastructure platform.
Evaluation and benchmarking systems for agent capabilities.
Learning loops that integrate real-world and simulation data.
Contribute to AI systems that continuously improve after deployment.

Basic Qualifications

MS or PhD in Computer Science, AI, Machine Learning, Robotics, or a related field.
Strong background in reinforcement learning or machine learning.
Experience implementing RL algorithms such as PPO, Actor-Critic, or policy gradient methods.
Strong programming skills in Python with PyTorch or JAX.
Experience building ML training systems or infrastructure.

Preferred Qualifications

Experience with RLHF or preference learning.
Experience with LLM agents or tool-using AI systems.
Multi-agent systems or long-horizon planning.
Simulation environments for RL.
Publications in NeurIPS, ICML, ICLR, ACL, or related venues.

What do we provide:

A fun, supportive and engaging environment.
Opportunity to make significant impact on transportation revolution by the means of advancing autonomous driving.
Opportunity to work on cutting edge technologies with the top talent in the field.
Competitive compensation package.
Snacks, lunches and fun activities.

The base salary range for this full-time position is $244,140 - $413,160, in addition to bonus, equity and benefits. Our salary ranges are determined by role, level, and location. The range displayed on each job posting reflects the minimum and maximum target for new hire salaries for the position across all US locations. Within the range, individual pay is determined by work location and additional factors, including job-related skills, experience, and relevant education or training.

* Ladders Estimates

Similar Jobs

Research Engineer / Research Scientist, Vision
$350K — $500K+*
Anthropic
San Francisco, CA 94112 (San Francisco County)
Reposted 2 days ago
Senior Researcher/ Principal Researcher
$160K — $250K *
Fujitsu
Santa Clara, CA 95051 (Santa Clara County)
2 days ago
Research Engineer, Rule of Law
$320K — $485K *
Anthropic
San Francisco, CA 94112 (San Francisco County)
4 days ago
Sr. Research Data Scientist
$330K — $375K *
Roku
San Jose, CA 95123 (Santa Clara County)
4 days ago
Senior research Scientist - Machine Learning Systems & Efficiency Engineer
$187K — $270K *
Adobe Inc.
San Jose, CA 95123 (Santa Clara County)
4 days ago
Member of Technical Staff, Post-Training, RL
$350K — $500K *
Mirendil
San Francisco, CA 94112 (San Francisco County)
5 days ago

Get Ready For Your
Next Interview

More Jobs at XPENG

Staff Machine Learning Engineer
$215K — $364K *
Santa Clara, CA 95051 (Santa Clara County)
4 days ago
Transportation
In-Person
Senior Staff Physical AI Data Algorithm Engineer
$203K — $344K *
Santa Clara, CA 95051 (Santa Clara County)
1 month ago
Manufacturing & Automotive
In-Person
Staff Robotics Engineer / Tech Lead - Whole-Body Control & Robot Learning
$215K — $364K *
Santa Clara, CA 95051 (Santa Clara County)
1 month ago
Consumer Technology
In-Person

More Information Technology Jobs

SDET (Software Development Engineer In Test)
Confidential Company
Washington, DC 20001 (District Of Columbia County)
1 week ago
Platform Testing Specialist
$90K — $130K *
Amyx, Inc.
Remote
Today
Principal Machine Learning Engineer
$171K — $269K *
Atlassian
Remote
Today
Manager, Infrastructure Security
$100K — $160K *
Applied Systems
Remote
Today
Access Management Platform Engineer IV
$100K — $130K *
Airlines Reporting Corporation
Louisville, KY 40214 (Jefferson County)
Today

Find similar Senior Staff Research Engineer - Reinforcement Learning for AI Agents jobs:

Nationwide Santa Clara, CA

Senior Staff Research Engineer - Reinforcement Learning for AI Agents

Job Overview by Ladders

Full Job Description

Get Ready For Your Next Interview

Find similar Senior Staff Research Engineer - Reinforcement Learning for AI Agents jobs:

Get Ready For Your
Next Interview