Research Scientist - Agency and Reasoning

Zyphra Technologies Inc

$120K — $150K *
Information Technology
Less than 5 years of experience
Job Overview by Ladders

Qualifications

  • Experience with reinforcement learning in language model reasoning or classical tasks
  • Proficient in language-model-supervised fine-tuning and methods like DPO and simPO
  • Familiarity with context-length extension methods
  • Ability to intuitively understand and correct model behaviors through iterative fine-tuning
  • Experience in data engineering and synthetic data generation
  • Postgraduate degree in Computer Science, Engineering, Mathematics, or Physics
  • Published machine learning research in respected venues
  • Highly proficient in PyTorch and Python
  • Eager to learn new fields and implement innovative ideas
  • Strong communication skills for effective collaboration in research and engineering.

Responsibilities

  • Conduct groundbreaking research in reinforcement learning and human preference learning
  • Execute research projects from concept through experimentation and publication
  • Develop and prototype new ideas quickly
  • Collaborate effectively with team members in a high-paced environment
  • Engage in details of data and data engineering practices, including data generation techniques

Benefits

  • Comprehensive medical, dental, vision, and FSA plans
  • Relocation and immigration support on a case-by-case basis
  • On-site meals prepared by a culinary team
  • Inclusive team culture with Thursday Happy Hours
  • Collaborative, high-energy environment in San Francisco
Full Job Description
Zyphra is an artificial intelligence company based in San Francisco, California.

The Role:

As a Research Scientist, you will be a core contributor to Zyphra's Agency and Reasoning Team. You will be involved with performing novel research in reinforcement learning, post-training, and human preference learning, and applying your ideas at scale to our next generation of language models.

What We're Looking For:
  • Strong research taste and intuition
  • The ability to work through a research project from conception to execution to write-up
  • Strong implementation and prototyping skillset
  • A researcher who can take an idea from conception to experimentation extremely quickly
  • The ability to work well and cooperate with others in a high-paced research setting
  • Curiosity, interest, and joy in understanding intelligence.


Qualifications:
  • Experience and aptitude with reinforcement learning, either in the context of language model reasoning or more classical RL tasks
  • Experience with language-model-supervised fine-tuning and preference-learning methods, such as DPO and simPO.
  • Experience with context-length extension methods
  • A good intuitive ability to understand model behaviors and correct them through iterative fine-tuning
  • Interest in grappling in detail with data and spending significant time involved in data engineering and synthetic data generation
  • Postgraduate degree in a scientific subject (Computer Science, EE/EECS, Mathematics, Physics)
  • Previously published machine learning research in well-respected venues
  • Highly proficient with PyTorch and Python
  • We are excited and able to rapidly learn new fields and implement new ideas
  • Excellent communication and collaboration skills, and can work effectively on both research and engineering implementation at scale


Why Work at Zyphra:
  • We strongly value new and crazy ideas and are very willing to bet big on new ideas
  • We move as quickly as we can; we aim to minimize the bar to impact as low as possible
  • We all enjoy what we do and love discussing AI


Benefits and Perks:
  • Comprehensive medical, dental, vision, and FSA plans
  • Competitive compensation and 401(k)
  • Relocation and immigration support on a case-by-case basis
  • On-site meals prepared by a dedicated culinary team; Thursday Happy Hours
  • In-person team in San Francisco, California with a collaborative, high-energy environment

Similar Jobs

More Jobs at Zyphra Technologies Inc

  • Full-Stack Software Engineer
    $120K — $160K *
    San Francisco, CA 94112 (San Francisco County)
    Enterprise Technology
    In-Person

More Information Technology Jobs

Find similar Research Scientist - Agency and Reasoning jobs: