Research Scientist

Anysphere, Inc

$100K — $150K *
Information Technology
Less than 5 years of experience
Job Overview by Ladders

Qualifications

  • 5-7 years of experience in Reinforcement Learning (RL) and machine learning fundamentals
  • Strong programming and software engineering skills
  • Ability to navigate and resolve ambiguous research problems independently
  • Commitment to data quality and analysis when necessary
  • A mindset focused on scientific exploration rather than merely validating personal hypotheses

Responsibilities

  • Enhance understanding of RL for managing longer horizon tasks and reduced compute needs
  • Design and implement training methods to improve coding task performance with non-verifiable rewards
  • Develop high-quality and challenging data points for model training
  • Conduct real-time reinforcement learning experiments for coding agents

Benefits

  • Autonomy and significant scope of work compared to standard research facilities
  • Opportunity to work at the cutting edge of coding technology
  • Chance to contribute to meaningful projects impacting real user data
  • Collaborative team environment with focus on innovative research
Full Job Description
Engineering • Full-time • San Francisco; New York
Apply

Research Scientist

Cursor is building the future of coding. We train frontier coding agents and scale RL on real user data to make them increasingly effective.

About the role

We're looking for Research Scientists who can drive effective RL or mid-training research in a small-team setting. You'll own ambiguous, hard research problems end-to-end: forming hypotheses, designing experiments, building the training/eval/data needed to test them, and pushing results into the next model. You should expect significantly more scope and autonomy than in other research labs.

What you'll do
  • Improve our understanding of RL, what it takes to handle longer horizon tasks, and train with less compute
  • Train graders to improve performance on coding tasks with non-verifiable reward
  • Improve the quality and difficulty of datapoints we use for training our models
  • Realtime RL for coding agents

You may be a fit if
  • You have a deep background in RL and strong machine learning fundamentals
  • You're an excellent programmer and software engineer
  • You can handle ambiguous research tasks with little guidance
  • You care a lot about data quality, and can dive into the data when appropriate
  • You are truth seeking, aiming to learn more about the science than proving your ideas are correct.

#LI-DNI

Similar Jobs

More Jobs at Anysphere, Inc

More Information Technology Jobs

Find similar Research Scientist jobs: