Principal Research Engineer, Post-Training

Character.ai

• $130K — $180K *

Redwood City, CA 94061In-Person

Information Technology

5 - 7 years of experience

Today

Be an Early Applicant

By clicking Apply, I agree with Ladders' Terms of Use and Privacy Policy

Job Overview by Ladders

Qualifications

PhD in Computer Science, Machine Learning, AI, or equivalent industry experience.
Experience leading technical projects or teams in machine learning, AI research, or distributed systems.
Deep knowledge of modern machine learning techniques, especially transformers and reinforcement learning.
Proven record of delivering impactful research or applied ML systems in production.
Expertise in designing and maintaining production-quality ML infrastructure.
Experience with training and optimizing large-scale models on GPU systems.
Strong software engineering skills for writing clean, maintainable code.

Responsibilities

Define and drive the technical roadmap for mid- and post-training systems.
Mentor and grow a team of researchers and engineers through technical guidance.
Lead the development of alignment algorithms and optimization techniques.
Drive advancements in mid- and post-training methodologies such as reinforcement learning.
Design efficient training and inference systems for large-scale generative models.
Architect scalable data pipelines for high-quality training datasets.
Collaborate with infrastructure teams to optimize distributed training and serving efficiency.

Benefits

Opportunity to directly influence AI entertainment systems.
Collaboration with cross-functional teams including researchers and product teams.
Opportunity for career development through mentorship roles.
Engagement in cutting-edge research and model development.
Potential to develop leadership skills in a dynamic environment.

Full Job Description

About the Role and Team As a Principal Research Engineer on the Post-Training team, you will drive the technical vision, execution, and evolution of the systems that transform foundation models into intelligent, engaging, and aligned products. Specifically, your team focuses on post-training of top-tier OSS LLMs (such as Mistral and Qwen) to power the highly immersive role-playing chat features of Character.AI.

You will lead initiatives spanning data, algorithms, infrastructure, and evaluation, helping define how our models learn from feedback and improve over time. This is a highly cross-functional role that combines deep technical expertise with organizational leadership. You will partner closely with researchers, engineers, product teams, and infrastructure teams to identify the highest-leverage opportunities for improving model performance and user experience. Your work will directly shape the conversational experiences of millions of users every day. At Character.AI, you will have the opportunity to influence both the direction of our research and the systems that bring it into production, helping build the next generation of AI entertainment.

What You'll Do

Technical Leadership & Mentorship: Define and drive the technical roadmap for mid- and post-training systems, balancing research innovation with production reliability and scalability. You will mentor and grow a team of researchers and engineers through technical guidance, design reviews, and career development. Establish best practices for experimentation, model development, and deployment.
Research & Model Development: Lead the development of alignment algorithms, optimization techniques, and training objectives to improve model capabilities and data efficiency. Drive advances in mid- and post-training methodologies including reinforcement learning, preference optimization, supervised fine-tuning, and emerging alignment approaches. Identify and execute high-impact research opportunities that improve model behavior, safety, and user engagement. Develop robust evaluation frameworks and quality signals to measure real-world model performance.
Systems & Infrastructure: Lead the design of efficient training and inference systems for large-scale generative models. Architect scalable data pipelines that transform diverse data sources into high-quality training datasets. Partner with infrastructure teams to optimize distributed training, GPU utilization, and serving efficiency. Drive improvements in experimentation platforms, data quality systems, and model observability.

Who You Are (Required Qualifications)

PhD in Computer Science, Machine Learning, AI, or a related field, or equivalent industry experience.
Significant experience leading technical projects or teams in machine learning, AI research, or large-scale distributed systems. Experience scaling and mentoring high-performing research and engineering teams.
Deep understanding of modern machine learning techniques, including transformers, reinforcement learning, alignment methods, and large language models.
Strong track record of delivering impactful research or applied ML systems in production environments.
Expertise in designing, building, and maintaining production-quality ML systems and infrastructure.
Experience training, serving, debugging, and optimizing large-scale models on GPU-based systems.
Experience leading teams working on large language model training, mid-training, or post-training.
Experience with product experimentation, online evaluation, and A/B testing frameworks.
Strong software engineering skills with the ability to write clean, maintainable, and scalable code.
Excellent communication skills and the ability to influence technical direction across teams. Lead complex, cross-functional initiatives across data, training infrastructure, evaluation, and model serving.

Nice to Have

Hands-on experience working directly with open-source models like Mistral and Qwen, particularly adapting them via mid- and post-training for specific personas, creative writing, or role-playing applications.
Familiarity with cloud-native ML infrastructure, including Kubernetes, Docker, and modern orchestration platforms.
Publications in leading machine learning conferences or demonstrated contributions to the broader AI community.

* Ladders Estimates

Similar Jobs

Distinguished Machine Learning Engineer, Customer Intelligence and Recommendation
$150K — $300K *
Geico
Palo Alto, CA 94303 (Santa Clara County)
Reposted Yesterday
Principal Machine Learning Engineer
$130K — $180K *
Acclaim
Remote
4 days ago
Principal AI Platform Engineer
$130K — $180K *
NextData
San Francisco, CA 94112 (San Francisco County)
4 days ago
Machine Learning Architect, Platform Architecture
$130K — $180K *
Apple
Cupertino, CA 95014 (Santa Clara County)
5 days ago
Senior/Principal Machine Learning Scientist, Perturbation Biology, AI Biology & Translation (AIBT)
$147K — $320K *
Roche
South San Francisco, CA 94080 (San Mateo County)
6 days ago
Principal Data Engineer, Data Platform
$170K — $195K *
A Place For Mom Inc
Remote
1 week ago

Get Ready For Your
Next Interview

More Jobs at Character.ai

Principal Research Engineer, Post-Training
$130K — $180K *
Redwood City, CA 94061 (San Mateo County)
Today
Information Technology
In-Person

More Information Technology Jobs

SDET (Software Development Engineer In Test)
Confidential Company
Washington, DC 20001 (District Of Columbia County)
6 days ago
Software Engineer III
$100K — $130K *
Bank of America Corporation
Jacksonville, FL 32210 (Duval County)
Today
Associate Director, Product Software Engineering
$159K — $284K *
Wolters Kluwer
Minneapolis, MN 55407 (Hennepin County)
Today
Software Engineer (Member of Technical Staff) - Contract Lifecycle Management
$100K — $130K *
Salesforce
Dallas, TX 75217 (Dallas County)
Reposted Today
Principal Research Engineer, Post-Training
$130K — $180K *
Character.ai
Redwood City, CA 94061 (San Mateo County)
Today

Find similar Principal Research Engineer, Post-Training jobs:

Nationwide Redwood City, CA

Principal Research Engineer, Post-Training

Job Overview by Ladders

Full Job Description

Get Ready For Your Next Interview

Find similar Principal Research Engineer, Post-Training jobs:

Get Ready For Your
Next Interview