Member of Technical Staff, Post-Training, RL Environments

Mirendil

• $350K — $500K *

San Francisco, CA 94112In-Person

Information Technology

Less than 5 years of experience

4 days ago

Be an Early Applicant

By clicking Apply, I agree with Ladders' Terms of Use and Privacy Policy

Job Overview by Ladders

Qualifications

5+ years of experience in research engineering or a related field
Strong background in machine learning, particularly reinforcement learning
Proficiency in designing and building data pipelines
Experience with system optimization and infrastructure development
Familiarity with collaborative development environments and tools
Ability to work in a team-oriented and cross-functional setting

Responsibilities

Build and automate data collection pipelines for complex RL tasks
Develop systems to identify and prevent reward hacking
Create scalable and sandboxed execution environments for multi-agent tasks
Design evaluation systems for training environments' effects on model behavior
Collaborate across teams to enhance production model performance
Drive initiatives to continuously improve data and environment quality

Benefits

Meaningful equity grant based on experience
Comprehensive health, dental, and vision insurance
Generous vacation policies
Opportunities for professional development and training
Flexible work arrangements

Full Job Description

The Role

We are looking for a research engineer to build the data systems and execution environments that power reinforcement learning at Mirendil. The quality of our models depends directly on the quality of the data and environments we train on; you will own those systems end-to-end. Some example areas you might work on (not limited to):

Build and automate data collection pipelines for complex, long-horizon RL tasks.
Build robust systems to identify and prevent reward hacking.
Build scalable sandboxed execution environments for realistic tasks involving potentially multiple agents, nodes, and users.
Design systems to estimate the influence of training environments on production model behavior.
Collaborate with teams across the stack to identify potential axes of improvements in production model behavior, and develop training environments to push these axes.

If you're excited about building the data and environment infrastructure that determine what our models learn, we'd love to hear from you.

We offer a base salary of $350,000-$500,000 USD and a meaningful equity grant, depending on experience and background, along with competitive benefits.

* Ladders Estimates

Similar Jobs

Staff Machine Learning Engineer - Search
$192K — $357K *
Warner Bros. Entertainment Inc.
San Francisco, CA 94112 (San Francisco County)
1 week ago
Sr. Staff Machine Learning Engineer, Agentic Ads
$227K — $469K *
Pinterest
San Francisco, CA 94112 (San Francisco County)
2 weeks ago
Sr. Staff Machine Learning Engineer, Agentic Ads
$227K — $469K *
Pinterest
Remote
2 weeks ago
Staff Machine Learning Systems Engineer, Embeddings Platform
$253K — $354K *
Reddit
Remote
2 weeks ago
Senior Staff Machine Learning Engineer, Notifications
$266K — $372K *
Reddit
Remote
3 weeks ago
Staff Machine Learning Engineer
$189K — $389K *
Pinterest
Remote
3 weeks ago

Get Ready For Your
Next Interview

More Jobs at Mirendil

Member of Technical Staff, Product Development
$350K — $500K *
San Francisco, CA 94112 (San Francisco County)
3 days ago
Enterprise Technology
In-Person
Member of Technical Staff, Kernels
$350K — $500K *
San Francisco, CA 94112 (San Francisco County)
4 days ago
Information Technology
In-Person
Member of Technical Staff, Agent Harness
$350K — $500K *
San Francisco, CA 94112 (San Francisco County)
4 days ago
Enterprise Technology
In-Person
Member of Technical Staff, Enterprise Platform Engineer
$350K — $500K *
San Francisco, CA 94112 (San Francisco County)
4 days ago
Enterprise Technology
In-Person
Member of Technical Staff, Security Engineer
$350K — $500K *
San Francisco, CA 94112 (San Francisco County)
4 days ago
Information Technology
In-Person

More Information Technology Jobs

SDET (Software Development Engineer In Test)
Confidential Company
Washington, DC 20001 (District Of Columbia County)
1 week ago
Full Stack Developer
$92K — $112K *
Fortinet
Burnaby, BC V3J 1A1
Today
Security Operations Analyst
$79K — $88K *
GrubHub
Remote
Today
IT Manager
$90K — $120K *
CareOne
Houston, TX 77043 (Harris County)
Today
IT Security Supervisor - Information Technology
$118K — $147K *
Fort Bend County
Richmond, TX 77469 (Fort Bend County)
Today

Find similar Member of Technical Staff, Post-Training, RL Environments jobs:

Nationwide San Francisco, CA

Member of Technical Staff, Post-Training, RL Environments

Job Overview by Ladders

Full Job Description

Get Ready For Your Next Interview

Find similar Member of Technical Staff, Post-Training, RL Environments jobs:

Get Ready For Your
Next Interview