Advanced Micro Devices, Inc

Research Scientist, Reinforcement Learning (LLM) and Post-training

Advanced Micro Devices, Inc$130K — $180K *
Consumer Technology
Less than 5 years of experience
Job Overview by Ladders

Qualifications

  • PhD in Computer Science, Machine Learning, or a related field preferred.
  • Strong publication record in reinforcement learning or related areas.
  • Hands-on experience training RL or preference-optimized models at scale.
  • Experience with LLM post-training and RLHF/RLAIF methods.
  • Familiarity with compilers, kernels, EDA workflows, or large codebases is a plus.

Responsibilities

  • Research and develop reinforcement learning methods for post-training LLMs and code models.
  • Design reward models, curricula, and training recipes for sparse or expensive labels.
  • Characterize failure modes and propose mitigations grounded in experiments.
  • Collaborate with infra engineers to scale training processes.
  • Publish research at top venues and provide internal leadership on the RL roadmap.

Benefits

  • Comprehensive health and wellness plans.
  • Paid time off and flexible working hours.
  • Opportunities for professional development and training.
  • Collaborative and innovative work environment.
Full Job Description
THE ROLE:

We are hiring a Research Scientist, Reinforcement Learning (LLM) and Post-Training, specializing in reinforcement learning to advance post-training and interactive learning for large generative models applied to demanding engineering and hardware-adjacent tasks (code, optimization, tool use, and long-horizon decision making). You will invent and analyze RL algorithms-policy optimization, preference-based methods, exploration, credit assignment, and reward modeling-run rigorous empirical studies, and partner with infra and product teams to land methods that improve measurable task success without sacrificing stability or safety.

THE PERSON:

You publish and ship. You are fluent in both RL theory and the practical path from ablation to production-scale training. You care about reward misspecification, variance reduction, and evaluation that reflects real constraints-not only toy environments.

KEY RESPONSIBILITIES:
  • Research and develop RL methods for post-training LLMs and code models on structured engineering tasks with verifiable or preference-based feedback
  • Design reward models, curricula, and off-policy or on-policy training recipes suited to sparse, noisy, or expensive labels from experts and simulators
  • Characterize failure modes (reward hacking, degenerate policies, instability) and propose mitigations grounded in experiments
  • Collaborate with RL infra engineers to scale training; define interfaces for rollout generation, logging, and reproducibility
  • Publish at top venues (e.g. NeurIPS, ICML, ICLR) and contribute internal technical leadership on the RL roadmap

PREFERRED EXPERIENCE:
  • Strong publication record in reinforcement learning or closely related machine learning areas.
  • Hands-on experience training RL or preference-optimized models at non-trivial scale (GPUs, distributed jobs)
  • Experience with LLM post-training, RLHF/RLAIF, or policy optimization for language or code agents
  • Familiarity with compilers, kernels, EDA-style workflows, or large-scale codebases is a plus

ACADEMIC CREDENTIALS:
  • PhD in Computer Science, Machine Learning, or related field strongly preferred.


#LI-BM1

#LI-Hybrid

Benefits offered are described: AMD benefits at a glance.

About Advanced Micro Devices, Inc

Advanced Micro Devices, Inc. Careers

Join the innovative forefront of technology with a career at Advanced Micro Devices, Inc. (AMD), a leader in semiconductor development. As part of our global team, you will contribute to an organization renowned for its dedication to innovation, leadership, and diversity in the tech industry.

Work You’ll Do

At AMD, we offer job opportunities that push the boundaries of what is possible. Our team is composed of professionals who lead the way in microprocessor and graphics technology, driving industry standards and innovation. With AMD, you will be part of a culture that values growth and professional development, ensuring that every team member has the opportunity to excel.

Transform Your Career

AMD is not just about advancing technology, but also about advancing careers. Whether you are looking for an internship, a full-time position, or leadership roles, AMD provides the platform to propel your career to new heights. Our commitment to professional growth is matched by our dedication to diversity and inclusion, making AMD a place where everyone can thrive.

Innovative Work Environment

Join a team of over 12,000 dedicated professionals at the intersection of technology, industry expertise, and digital innovation. At AMD, you will work on groundbreaking projects that shape the future of computing and graphics. Our collaborative environment encourages networking and the sharing of ideas across teams and disciplines.

Career Development and Benefits

AMD is committed to the development of its employees. We offer robust training programs, including leadership development and diversity training, to ensure our team is equipped for both current challenges and future opportunities. Our benefits package is designed to support the well-being and financial security of our employees and their families.

Explore Job Opportunities

From engineering to marketing, AMD offers a range of career paths that cater to diverse skills and interests. Our hiring process is designed to be transparent and engaging, helping you to understand where you fit within our team and how you can contribute to our collective goals.

Stay Connected

Join Our Team Search open positions that match your skills and interest. We look for passionate, curious, creative, and solution-driven team players. Explore the opportunities to join a company that’s committed to your career growth and to innovation in the technology sector.

Keep Up to Date

Stay ahead with career tips, insider perspectives, and industry-leading insights you can put to use today—all from the people who work here.

Job Alert Emails

Personalize your subscription to receive job alerts, latest news, and insider tips tailored to your preferences. Discover the exciting and rewarding career opportunities that await at Advanced Micro Devices, Inc.

Interview and Resume Tips

Prepare for your future with AMD by accessing resources that help you craft your resume and excel in interviews. Our goal is to help you showcase your best professional self and align your skills with the needs of our dynamic team. At Advanced Micro Devices, Inc., we empower our employees to innovate, lead, and grow. Join us in driving the future of technology while building a rewarding and sustainable career.
Learn more about Advanced Micro Devices, Inc
Size
15,500 employees
Market Cap
$100.9 billion
Industry
Net Income
$2.4 billion
Founded
1969
5 Year Trend
+30.9%
Revenue
$9.7 billion
NASDAQ

Similar Jobs

More Jobs at Advanced Micro Devices, Inc

More Consumer Technology Jobs

Find similar Research Scientist, Reinforcement Learning (LLM) and Post-training jobs: