Research Engineer - Post training & RL

techire ai

Information Technology
Less than 5 years of experience
Job Overview by Ladders

Qualifications

  • Research experience in post-training, reinforcement learning, or evaluation for LLMs.
  • Strong understanding of transformer models and experimental design.
  • Publication record at leading academic venues (NeurIPS, ICLR, ICML, ACL, EMNLP).
  • PhD or equivalent research experience in Computer Science, Machine Learning, Natural Language Processing, or Reinforcement Learning.

Responsibilities

  • Build reinforcement learning environments that challenge reasoning and planning abilities.
  • Create dynamic simulations that assess real intelligence beyond accuracy metrics.
  • Design novel post-training algorithms such as RLHF, DPO, and GRPO.
  • Develop advanced reward models that improve upon exact-match scoring.
  • Establish evaluation frameworks to enhance next-generation AI training and understanding.
  • Integrate deep research with implementation, from writing academic papers to deploying methods in active systems.

Benefits

  • Up to $300K base salary depending on experience.
  • Meaningful equity opportunities.
  • Comprehensive benefits package including 401k.
  • Unlimited paid time off (PTO).
  • Relocation assistance and sponsorship available.
Full Job Description
Job Description

Want to build the simulated worlds that test what frontier models are really capable of?

This is a chance to join a team advancing the science of post-training and scalable evaluation - building reinforcement learning environments that push reasoning, planning, and long-horizon behaviour to their limits.

Instead of static benchmarks, you'll create dynamic simulations that measure real intelligence - not just accuracy. You'll design new post-training algorithms (RLHF, DPO, GRPO and beyond), develop richer reward models that move past exact-match scoring, and build evaluation frameworks that define how next-generation AI is trained, aligned, and understood.

The work combines deep research with hands-on implementation - from writing papers to seeing your methods deployed in live systems. It's ideal for researchers who care about bridging academic insight and practical impact, helping AI progress beyond metrics that no longer tell the whole story.

You'll bring:
  • Research experience in post-training, reinforcement learning, or evaluation for LLMs.
  • Strong understanding of transformer models and experimental design.
  • Publication record at leading venues (NeurIPS, ICLR, ICML, ACL, EMNLP).
  • PhD or equivalent research experience in CS, ML, NLP, or RL.

Package: Up to $300K base (DOE) + meaningful equity + comprehensive benefits (401k, unlimited PTO, relocation and sponsorship available).
Location: On-site or hybrid San Francisco.

If you want to shape how AI is trained, tested, and trusted - this is the place to do it.

Similar Jobs

More Jobs at techire ai

More Information Technology Jobs

Find similar Research Engineer - Post training & RL jobs: