Reinforcement Learning Engineer

Code Metal

• $120K — $160K *

Boston, MA 02115In-Person

Information Technology

Less than 5 years of experience

More than 3 months ago

By clicking Apply, I agree with Ladders' Terms of Use and Privacy Policy

Job Overview by Ladders

Qualifications

2+ years experience in distributed training with PyTorch
Strong background in reinforcement learning, specifically recent RLHF experience
Proven ability to build data curation and quality assurance pipelines
Experience in developing evaluation frameworks
Ideally, expertise in both data pipeline and orchestration methods
Eligibility for TS/SCI clearance

Responsibilities

Build and maintain robust distributed training systems using PyTorch
Design and implement scalable data curation and quality assurance pipelines
Develop orchestration tools for complex workflows in AI model training
Drive innovation by creating evaluation frameworks and RL solutions
Engage with frontier research through open-source contributions and potential publications

Benefits

100% premium coverage for health care including medical, dental, and vision
401k plan with 5% matching
Uncapped Paid Time Off, plus Sick and Public Holidays
Flexible hybrid work arrangement
Relocation assistance for qualifying employees

Full Job Description

At Code Metal AI, you'll be part of a world-class team with talent from MIT, OpenAI and other top companies, focused on pioneering work in large language models (LLMs) and code generation. Our projects directly involve leading chip manufacturers, applying advanced AI to solve meaningful, practical challenges with real-world impact.

This role bridges two critical areas:

Production

Build and maintain robust distributed training systems using PyTorch (2+ years experience required).
Design and implement scalable data curation and quality assurance pipelines to ensure top-tier training datasets.
Develop orchestration tools that manage complex workflows across large-scale AI model training and evaluation.

Research

Drive innovation by developing evaluation frameworks and reinforcement learning solutions, including recent advancements in Reinforcement Learning with Human Feedback (RLHF).
Engage with frontier research through open-source projects and potential publications, applying RLHF to Large Language Models (LLMs), ideally focusing on code generation tasks.

Requirements

2+ years experience in distributed training, preferably with PyTorch.
Strong background in reinforcement learning, with recent RLHF experience highly preferred.
Proven ability to build data curation and quality assurance pipelines.
Experience with evaluation framework development.
Ideally, experience across both data pipeline and orchestration sides.
Eligible for TS/SCI clearance.

Nice to have:

Contributions to open-source AI or ML projects.
Published work or demonstrable research experience in related fields.
Hands-on experience applying RLHF to LLMs, especially for code generation.
Experience with large-scale synthetic data generation.

Benefits

Health care plan with 100% premium coverage, including medical, dental, and vision.
401k with 5% matching.
Paid Time Off (Uncapped Vacation, plus Sick & Public Holidays).
Flexible hybrid work arrangement.
Relocation assistance for qualifying employees.

* Ladders Estimates

Similar Jobs

Manager Financial Planning and Cost Analysis
$102K — $164K *
AHN Saint Vincent
New York, NY 10025 (New York County)
Today
Product Manager - Utilization Management
$86K — $138K *
AHN Saint Vincent
New York, NY 10025 (New York County)
Today
Senior AI Security Engineer
$94K — $151K *
AHN Saint Vincent
Remote
Today
Product Manager - Health Plan Capabilities
$102K — $164K *
AHN Saint Vincent
New York, NY 10025 (New York County)
Today
Manager Decision Support Analytics
$102K — $164K *
AHN Saint Vincent
New York, NY 10025 (New York County)
Today
Senior Pharmacist - Strategy
$118K — $196K *
AHN Saint Vincent
New York, NY 10025 (New York County)
Today

Get Ready For Your
Next Interview

More Jobs at Code Metal

IT Security Analyst
$90K — $120K *
Boston, MA 02115 (Suffolk County)
3 weeks ago
Information Technology
In-Person
IT Support Engineer
$75K — $95K *
Boston, MA 02115 (Suffolk County)
4 weeks ago
Information Technology
In-Person
Senior Recruiter
$90K — $120K *
Boston, MA 02115 (Suffolk County)
1 month ago
Staffing
In-Person
Forward Deployed Engineer
$130K — $180K *
San Francisco, CA 94112 (San Francisco County)
1 month ago
Technical Services
In-Person

More Information Technology Jobs

Business Development Director
$300K — $345K + $120K bonus *
Tier1 IT Services Firm
Kansas City, MO 64116 (Clay County)
6 days ago
Client Partner / Business Developemnt - Banking
$250K — $320K + $70K bonus *
IT Services Firm (client of TechLink Systems)
New York, NY 10001 (New York County)
6 days ago
Customer Support
Confidential Company
Austin, TX 78701 (Travis County)
2 weeks ago
Sr Assoc, Cyber Sec ThreatMgmt - Detection Engineer
$88K — $151K *
Northern Trust
Naperville, IL 60540 (Dupage County)
Today
Global Director – Vulnerability Management & Security Configuration
$164K — $288K *
Northern Trust
Chicago, IL 60629 (Cook County)
Today

Find similar Reinforcement Learning Engineer jobs:

Nationwide Boston, MA

Reinforcement Learning Engineer

Job Overview by Ladders

Full Job Description

Get Ready For Your Next Interview

Find similar Reinforcement Learning Engineer jobs:

Get Ready For Your
Next Interview