Senior Machine Learning Research Scientist - Frontier Lab

Carnegie Mellon University • $120K — $160K *

Arlington, VA 22204In-Person

Education, Government & Non-Profit

5 - 7 years of experience

Reposted 3 weeks ago

By clicking Apply, I agree with Ladders' Terms of Use and Privacy Policy

Job Overview by Ladders

Qualifications

BS in Computer Science, Electrical Engineering, Statistics, or related field with 10 years of experience; OR MS with 8 years of experience; OR PhD with 5 years of experience.
Deep expertise in frontier AI areas such as agentic systems or LLM reliability.
Strong engineering skills for building and maintaining high-quality prototypes.
Excellent verbal and written communication skills for effective stakeholder engagement.
Proven track record of leading technical workstreams successfully.

Responsibilities

Execute projects within operational contexts to align with mission needs.
Lead technical execution, setting realistic milestones and ensuring delivery quality.
Design and conduct applied research and prototyping efforts.
Establish evaluation strategies for assessing ML models in mission scenarios.
Serve as the primary technical interface for communicating with stakeholders.
Mentor junior staff to elevate research and engineering standards.
Maintain awareness of frontier research developments to guide lab direction.

Benefits

Comprehensive medical, prescription, dental, and vision insurance.
Generous retirement savings program with employer contributions.
Tuition benefits for eligible employees.
Ample paid time off and observed holidays.
Access to a fitness center and a Family Concierge Team for childcare needs.

Full Job Description

Position Summary

As a Senior Machine Learning Research Scientist in the Frontier Lab, you will serve as a senior individual contributor and technical leader, shaping and executing applied research and prototype capability development for government and DoW missions. This role spans the research-engineering spectrum: some SR MLRS hires may lean more research-heavy and others more engineering-heavy, but successful candidates collaborate effectively across both.

You will operate with high autonomy, represent technical work with customers and stakeholders, and help guide Frontier Lab research direction—while remaining hands-on in development, evaluation, and delivery. Your work may span Frontier Lab focus areas such as:

Agentic AI for mission workflows (e.g., planning, analysis, decision support) where autonomous and human-guided agents interact with tools, data systems, and operators.

AI test, evaluation, verification, and validation (TEVV) to improve confidence in performance, robustness, uncertainty, and trustworthiness of ML-enabled systems.

Mission-tailored language models, including techniques to improve accuracy and reliability, reduce hallucinations, and integrate structured knowledge for operational tasks.

Mission modalities and multimodal learning, including sensor fusion and learning under noisy, sparse, or constrained data conditions (including synthetic data and weakly-/self-supervised approaches).

AI at the tactical edge, enabling capability under constrained compute/connectivity through efficient inference, compression, rapid adaptation, and update/redeploy patterns.

Key Responsibilities / Duties

Senior MLRS staff are expected to operate with a high degree of autonomy and technical ownership while remaining hands-on in development, evaluation, and delivery.

Mission-context execution: Execute work within the operational context—understanding users, workflows, constraints, success criteria, and outcomes—so technical decisions are grounded in real mission needs.

Technical leadership / Tech lead: Lead technical execution by defining technical tasking, sequencing work into realistic milestones, maintaining delivery quality, and delegating appropriately across the team.

Applied research and prototyping: Design and run studies, build convincing prototypes and reference implementations, and produce evidence-backed insights that can be matured and transitioned into operational settings.

Evaluation, assurance, and evidence: Establish credible evaluation strategies and test pipelines that assess performance, robustness, reliability, and trustworthiness in mission-representative scenarios.

Customer-facing technical ownership: Serve as the primary technical interface whenappropriate; translate mission goals into measurable technical outcomes; communicate progress, decisions, and risks clearly to stakeholders.

Mentorship and talent development: Proactively mentor junior staff and teammates, raising the bar for research rigor, engineering practice, and delivery habits across project teams.

State-of-the-artawareness and agenda shaping:Maintainstrong awareness of frontier developments aligned to the Frontier Lab, share insights with the lab, and help shape research directions and future work selection.

Self-direction and time management: Manage multiple priorities effectively, sustain steady execution cadence, and resolve blockers with minimal oversight.

Community building (internal and external): Build a strong research culture through internal talks, reading groups, and workshops; and engage with external AI/ML communities (professional societies, consortiums, working groups, and conferences) to strengthen collaboration pathways and keep the lab connected to emerging practice.

Requirements

Education / Experience

BSin Computer Science, Electrical Engineering, Statistics, or related field with 10 yearsof relevant experience; OR MSwith 8 yearsof relevant experience; OR PhDwith 5 yearsof relevant experience.

Deepexpertisein one or more Frontier Lab-aligned areas (agentic systems, LLM reliability/evaluation, CV evaluation, robustness/assurance, TEVV pipelines, multimodal learning, edge ML).

Strong engineering capability– can build andmaintainhigh-quality prototypes, evaluation infrastructure, and repeatable experimentation workflows.

Strong written and verbal communication skills; able torepresenttechnical work credibly to senior stakeholders.

Demonstrated ability to lead technical workstreams and coordinate multi-person execution.

Knowledge, Skills, & Abilities (KSAs)

Technical judgment: Makes sound architectural and methodological decisions; balances ambition with mission constraints.

Customer translation: Converts mission needs into tractable technical plans, measurable success criteria, and credible evaluation evidence.

Scientific leadership: Maintains rigor; identifies flawed assumptions; improves evaluation quality and research practices.

Mentorship & influence: Elevates team performance through hands-on guidance and strong technical standards.

Initiative:Proactively identifiesrisks/opportunities, proposes new work, and creates alignment without directive management.

Self-direction and time management: Plans work effectively under ambiguity,maintainsexecution cadence, and escalates risks early.

Desired Experience

Leading applied research projects resulting ineffectiveprototypes, mission-relevant evaluation outcomes, or transitioned methods.

Publications at strong venues (e.g.,NeurIPS/ ICLR / ICML, relevant workshops, MLCON), and/or demonstrable impact through applied research artifacts (benchmarks, evaluation suites, open-source, technical reports).

Designing and operating TEVV efforts including evaluation pipelines, robustness analysis, calibration/uncertainty work, regression suites, and scenario-based evaluation protocols.

Building agentic capabilities integrated with tools, data systems, and human workflows (decision support, planning, analytic contexts).

Experience with secure or operational environments and delivery constraints typical of government settings.

Experience shaping a technical roadmap or research portfolio aligned to sponsor priorities and lab strategy.

Other Requirements

Flexible to travel to SEI offices inPittsburgh, PAand Washington, DC / Arlington, VA, sponsor sites, conferences, and offsite meetings (~10% travel).
You must be able and willing to work onsite at an SEI office in Pittsburgh, PA or Arlington, VA 5 days per week.

You will be subject to a background investigation and must be eligible to obtain andmaintaina Department of War)security clearance.

About Carnegie Mellon University

Carnegie Mellon University is a private research university that was founded in 1900. The university is located in Pittsburgh, Pennsylvania and is known for its programs in computer science, engineering, and the arts. Carnegie Mellon has a diverse student body and offers undergraduate and graduate programs in a variety of fields. The university has a strong focus on research and has partnerships with a number of companies and organizations. Carnegie Mellon is consistently ranked among the top universities in the United States.

Learn more about Carnegie Mellon University

Size

14,000 employees

Industry

Education, Government & Non-Profit

* Ladders Estimates

Similar Jobs

Senior Engineer, Systematic Research Technology
$130K — $180K *
Balyasny Asset Management L.P.
New York, NY 10025 (New York County)
Yesterday
Senior AI Research Scientist (NLP)
$120K — $160K *
hireVouch
Remote
Yesterday
Research Scientist, Foundational Data Science
$120K — $150K *
Prior Labs
New York, NY 10025 (New York County)
Yesterday
AI Research Scientist, CoreML - Monetization
$130K — $180K *
Meta
New York, NY 10025 (New York County)
1 week ago
Research Scientist - Reality Labs
$130K — $180K *
Meta
New York, NY 10025 (New York County)
1 week ago
Head of Research
$130K — $180K *
Neon Mobile, Inc
Remote
1 week ago

Get Ready For Your
Next Interview

More Jobs at Carnegie Mellon University

Low Voltage Power Distribution Electrical Engineer
$88K — $154K *
Pasadena, CA 91104 (Los Angeles County)
Reposted Today
Technical Services
In-Person
Low Voltage Power Distribution Electrical Engineer
$88K — $154K *
San Antonio, TX 78228 (Bexar County)
Reposted Today
Technical Services
In-Person
Technology Sector Pricing Manager
$99K — $232K *
Chicago, IL 60629 (Cook County)
Reposted Today
Enterprise Technology
Hybrid
Manager, GWAM Risk Center
$88K — $138K *
Waterloo, ON N2J 1A1
Reposted Today
Finance & Insurance
In-Person
Manager, GWAM Risk Center
$88K — $138K *
Montreal, QC H1A 0A1
Reposted Today
Finance & Insurance
In-Person

More Education, Government & Non-Profit Jobs

Director of Facilities Operations
$80K — $150K *
The Vernon Staffing Group
Vincennes, IN 47591 (Knox County)
2 days ago
Director of Somatic Medical Services
$150K — $250K *
State of Maryland
Jessup, MD 20794 (Howard County)
4 days ago
Controller
Maryland State Education Association
Annapolis, MD 21401 (Anne Arundel County)
5 days ago
Account Executive, Government
$78K — $140K *
T-Mobile
Remote
Reposted Today
All Source Analyst
$62K — $141K *
TeleTech
Washington, DC 20011 (District Of Columbia County)
Today

Find similar Senior Machine Learning Research Scientist - Frontier Lab jobs:

Nationwide Arlington, VA

Senior Machine Learning Research Scientist - Frontier Lab

Job Overview by Ladders

Full Job Description

Get Ready For Your Next Interview

Find similar Senior Machine Learning Research Scientist - Frontier Lab jobs:

Get Ready For Your
Next Interview