Member of Technical Staff, Post-Training, RL Infra

Mirendil

• $350K — $500K *

San Francisco, CA 94112In-Person

Information Technology

Less than 5 years of experience

4 days ago

Be an Early Applicant

By clicking Apply, I agree with Ladders' Terms of Use and Privacy Policy

Job Overview by Ladders

Qualifications

5+ years of experience in engineering, specifically with reinforcement learning (RL) systems
Proven expertise in building scalable infrastructure for machine learning models
Strong programming skills, preferably in Python or similar languages
Familiarity with performance optimization techniques in large-scale systems
Experience collaborating with cross-functional teams and translating research into production systems

Responsibilities

Design and build robust infrastructure for large-scale RL training
Implement performance optimizations across the training stack
Develop evaluation and benchmarking systems for model assessment
Create data collection and feedback pipelines for iterative training improvements
Collaborate with various teams to accelerate the deployment of RL experiments

Benefits

Meaningful equity grant based on experience
Competitive benefits package

Full Job Description

The Role

We are looking for engineers to help build the post-training stack for frontier reasoning models. This role sits at the intersection of research and infrastructure. You will work to push the scale of our RL stack, whether it is novel recipe ideas, reliability, or performance. Some example areas you might work on (not limited to):

Design and build reliable infrastructure for large-scale RL training
Implement novel performance optimizations across the training stack
Develop evaluation and benchmarking infrastructure to measure model progress, throughput, and uptime
Build data collection and feedback pipelines that close the loop between human signal, reward modeling, and training
Collaborate with multiple teams to rapidly iterate on RL algorithms and get experiments into production training runs

If you're excited about building the infrastructure that makes frontier RL research possible at scale, we'd love to hear from you.

We offer a base salary of $350,000-$500,000 USD and a meaningful equity grant, depending on experience and background, along with competitive benefits.

* Ladders Estimates

Similar Jobs

Reliability Engineer, Supercomputing
$350K — $475K *
Thinking Machines Lab
San Francisco, CA 94112 (San Francisco County)
2 days ago
Member of Technical Staff, Agent Harness
$350K — $500K *
Mirendil
San Francisco, CA 94112 (San Francisco County)
4 days ago
Member of Technical Staff, Inference
$350K — $500K *
Mirendil
San Francisco, CA 94112 (San Francisco County)
4 days ago
Member of Technical Staff, Pretraining
$350K — $500K *
Mirendil
San Francisco, CA 94112 (San Francisco County)
4 days ago

Get Ready For Your
Next Interview

More Jobs at Mirendil

Member of Technical Staff, Product Development
$350K — $500K *
San Francisco, CA 94112 (San Francisco County)
3 days ago
Enterprise Technology
In-Person
Member of Technical Staff, Kernels
$350K — $500K *
San Francisco, CA 94112 (San Francisco County)
4 days ago
Information Technology
In-Person
Member of Technical Staff, Agent Harness
$350K — $500K *
San Francisco, CA 94112 (San Francisco County)
4 days ago
Enterprise Technology
In-Person
Member of Technical Staff, Enterprise Platform Engineer
$350K — $500K *
San Francisco, CA 94112 (San Francisco County)
4 days ago
Enterprise Technology
In-Person
Member of Technical Staff, Security Engineer
$350K — $500K *
San Francisco, CA 94112 (San Francisco County)
4 days ago
Information Technology
In-Person

More Information Technology Jobs

SDET (Software Development Engineer In Test)
Confidential Company
Washington, DC 20001 (District Of Columbia County)
1 week ago
ServiceNow Developer
$113K — $188K *
Guidehouse
Huntsville, AL 35810 (Madison County)
Reposted Today
Senior Full-Stack Security/GRC Platform Engineer
$86K — $129K *
Guidehouse
Remote
Today
Full Stack Developer
$80K — $133K *
Guidehouse
Remote
Today
Data Infrastructure Engineer
$98K — $163K *
Guidehouse
Remote
Reposted Today

Find similar Member of Technical Staff, Post-Training, RL Infra jobs:

Nationwide San Francisco, CA

Member of Technical Staff, Post-Training, RL Infra

Job Overview by Ladders

Full Job Description

Get Ready For Your Next Interview

Find similar Member of Technical Staff, Post-Training, RL Infra jobs:

Get Ready For Your
Next Interview