Research Engineer - Post training & RL

techire ai

•

San Francisco, CA 94112In-Person

Information Technology

Less than 5 years of experience

1 month ago

By clicking Apply, I agree with Ladders' Terms of Use and Privacy Policy

Job Overview by Ladders

Qualifications

Research experience in post-training, reinforcement learning, or evaluation for LLMs.
Strong understanding of transformer models and experimental design.
Publication record at leading academic venues (NeurIPS, ICLR, ICML, ACL, EMNLP).
PhD or equivalent research experience in Computer Science, Machine Learning, Natural Language Processing, or Reinforcement Learning.

Responsibilities

Build reinforcement learning environments that challenge reasoning and planning abilities.
Create dynamic simulations that assess real intelligence beyond accuracy metrics.
Design novel post-training algorithms such as RLHF, DPO, and GRPO.
Develop advanced reward models that improve upon exact-match scoring.
Establish evaluation frameworks to enhance next-generation AI training and understanding.
Integrate deep research with implementation, from writing academic papers to deploying methods in active systems.

Benefits

Up to $300K base salary depending on experience.
Meaningful equity opportunities.
Comprehensive benefits package including 401k.
Unlimited paid time off (PTO).
Relocation assistance and sponsorship available.

Full Job Description

Job Description

Want to build the simulated worlds that test what frontier models are really capable of?

This is a chance to join a team advancing the science of post-training and scalable evaluation - building reinforcement learning environments that push reasoning, planning, and long-horizon behaviour to their limits.

Instead of static benchmarks, you'll create dynamic simulations that measure real intelligence - not just accuracy. You'll design new post-training algorithms (RLHF, DPO, GRPO and beyond), develop richer reward models that move past exact-match scoring, and build evaluation frameworks that define how next-generation AI is trained, aligned, and understood.

The work combines deep research with hands-on implementation - from writing papers to seeing your methods deployed in live systems. It's ideal for researchers who care about bridging academic insight and practical impact, helping AI progress beyond metrics that no longer tell the whole story.

You'll bring:

Research experience in post-training, reinforcement learning, or evaluation for LLMs.
Strong understanding of transformer models and experimental design.
Publication record at leading venues (NeurIPS, ICLR, ICML, ACL, EMNLP).
PhD or equivalent research experience in CS, ML, NLP, or RL.

Package: Up to $300K base (DOE) + meaningful equity + comprehensive benefits (401k, unlimited PTO, relocation and sponsorship available).
Location: On-site or hybrid San Francisco.

If you want to shape how AI is trained, tested, and trusted - this is the place to do it.

* Ladders Estimates

Similar Jobs

Multimodal LLM Researcher
$300K — $400K *
DEEPREC.AI
Palo Alto, CA 94303 (Santa Clara County)
Today
Research Engineer - Evaluations
$120K — $160K *
Gem.com
San Francisco, CA 94112 (San Francisco County)
Today
Research Engineer - Evaluations
$120K — $150K *
Gem.com
Redwood City, CA 94061 (San Mateo County)
Today
RE/RS, Data Understanding (MM)
$120K — $180K *
OpenAI
San Francisco, CA 94112 (San Francisco County)
3 days ago
Principal AI Research Scientist Post-Training - Alignment - Reinforcement Learning Autodesk AI Lab: London - San Francisco - Toronto - Remote (US/CA/EU
$150K — $200K *
Autodesk, Inc
San Francisco, CA 94112 (San Francisco County)
5 days ago
AI Research Scientist- World Model
$165K — $185K *
Bosch Group
Sunnyvale, CA 94087 (Santa Clara County)
6 days ago

Get Ready For Your
Next Interview

More Jobs at techire ai

Senior Research Scientist
$400K — $500K *
San Francisco, CA 94112 (San Francisco County)
1 month ago
Information Technology
In-Person
Engineer, Inference & Model serving
$220K — $320K *
San Francisco, CA 94112 (San Francisco County)
1 month ago
Technical Services
In-Person
Research Engineer - Post training & RL
San Francisco, CA 94112 (San Francisco County)
1 month ago
Information Technology
In-Person
Backend Engineer
$190K — $250K *
New York, NY 10025 (New York County)
1 month ago
Enterprise Technology
In-Person
ML Engineer
$250K — $400K *
San Francisco, CA 94112 (San Francisco County)
1 month ago
Enterprise Technology
In-Person

More Information Technology Jobs

Business Development Director
$300K — $345K + $120K bonus *
Tier1 IT Services Firm
Kansas City, MO 64116 (Clay County)
6 days ago
Client Partner / Business Developemnt - Banking
$250K — $320K + $70K bonus *
IT Services Firm (client of TechLink Systems)
New York, NY 10001 (New York County)
6 days ago
Customer Support
Confidential Company
Austin, TX 78701 (Travis County)
2 weeks ago
Sr Assoc, Cyber Sec ThreatMgmt - Detection Engineer
$88K — $151K *
Northern Trust
Naperville, IL 60540 (Dupage County)
Today
Global Director – Vulnerability Management & Security Configuration
$164K — $288K *
Northern Trust
Chicago, IL 60629 (Cook County)
Today

Find similar Research Engineer - Post training & RL jobs:

Nationwide San Francisco, CA

Research Engineer - Post training & RL

Job Overview by Ladders

Full Job Description

Get Ready For Your Next Interview

Find similar Research Engineer - Post training & RL jobs:

Get Ready For Your
Next Interview