Senior Machine Learning Engineer, Agentic AI

Robinhood • $209K — $245K *

Menlo Park, CA 94025In-Person

Information Technology

5 - 7 years of experience

Today

Be an Early Applicant

By clicking Apply, I agree with Ladders' Terms of Use and Privacy Policy

Job Overview by Ladders

Qualifications

5+ years of experience in production AI systems with LLMs or autonomous agents.
Expertise in modern agent architectures and multi-agent systems.
Proficiency in designing evaluation frameworks for agentic AI.
Strong debugging skills in complex agent interactions and reasoning.
Leadership experience in mentoring engineers and influencing technical direction.

Responsibilities

Lead the development of agentic AI systems for enhanced customer experiences.
Set technical benchmarks for evaluating autonomous agents' performance.
Create scalable evaluation frameworks using automated and human feedback methods.
Drive model selection and optimization for various AI systems.
Collaborate with cross-functional teams to set quality and success standards for agent deployments.
Enhance agent reliability by diagnosing and addressing production failures.
Mentor peers while establishing best practices for agentic AI implementations.

Benefits

100% paid health insurance for employees; 90% for dependents.
Flexible benefits account for wellness and educational expenses.
Employer-paid life, disability insurance, and mental health support.
Generous time off including holidays, paid leave, and sick days.
Engaging office atmosphere with catered meals and team events.

Full Job Description

The Agentic AI team builds agentic AI systems that power intelligent, reliable customer experiences across Robinhood products. The team focuses on reducing the time to ship agents with fine-tuned models and while doing so enables other teams to build, evaluate, and improve their own agents. You will contribute to a culture grounded in first-principles thinking, high performance, and strong focus on customer outcomes! As a Senior Machine Learning Engineer (IC5), you will define and uphold the quality bar for agentic systems across the organization. You will design evaluation frameworks, guide model selection, and partner with product, data science, and engineering teams to ensure systems meet clear standards for correctness, safety, latency, and user satisfaction. Your work will shape how agentic systems are built, evaluated, and improved across Robinhood! At Robinhood, we believe in the power of in-person work to accelerate progress, spark innovation, and strengthen community. Our office experience is intentional, energizing, and designed to fully support high-performing teams. This role is based in our Bellevue, WA, New York, NY, or Menlo Park, CA office, with in-person attendance expected at least 3 days per week. What you'll do • Lead the design and evolution of agentic AI systems that power intelligent customer experiences across Robinhood. • Define the technical direction for evaluating autonomous agents, including reasoning quality, planning, tool selection, memory, task completion, safety, latency, and overall user experience. • Design and build scalable evaluation frameworks for agentic systems using automated evals, benchmark datasets, LLM-as-a-Judge techniques, and human feedback to continuously improve agent performance. • Drive model selection and optimization across frontier foundation models, fine-tuned models, retrieval systems, and tool-using agents, balancing quality, latency, cost, and reliability. • Partner closely with Product, Data Science, and Engineering to establish launch criteria, quality standards, and measurable success metrics for production agentic systems. • Improve agent reliability by investigating production failures, identifying root causes across reasoning, planning, retrieval, and tool execution, and driving architectural improvements. • Mentor engineers and influence technical direction across teams while helping establish best practices for building reliable, production-ready agentic AI systems. What you bring • Significant experience building and deploying production AI systems powered by large language models, autonomous agents, or multi-step reasoning workflows. • Deep understanding of modern agent architectures, including tool calling, planning, memory, retrieval-augmented generation (RAG), orchestration, and multi-agent systems. • Experience designing evaluation frameworks for agentic AI, including automated evals, benchmark datasets, LLM-as-a-Judge methodologies, human evaluation pipelines, and continuous quality measurement. • Strong understanding of the tradeoffs between prompting, fine-tuning, retrieval, and agent orchestration, and when to apply each approach. • Experience evaluating frontier foundation models across quality, latency, safety, cost, robustness, and production readiness. • Proven ability to debug complex agent behaviors, identify failure modes, and improve reasoning, reliability, and overall system performance. • Strong software engineering skills with experience building scalable distributed systems and production ML infrastructure. • Demonstrated technical leadership through architecture design, mentorship, and influencing engineering direction across multiple teams. • Experience with agent frameworks, AI observability platforms, model evaluation tooling, or regulated AI applications is a strong plus. What we offer • Challenging, high-impact work to grow your career • Performance driven compensation with multipliers for outsized impact, bonus programs, equity ownership, and 401(k) matching • Best in class benefits to fuel your work, including 100% paid health insurance for employees with 90% coverage for dependents • Lifestyle wallet - a highly flexible benefits spending account for wellness, learning, and more • Employer-paid life & disability insurance, fertility benefits, and mental health benefits • Time off to recharge including company holidays, paid time off, sick time, parental leave, and more! • Exceptional office experience with catered meals, events, and comfortable workspaces In addition to the base pay range listed below, this role is also eligible for bonus opportunities + equity + benefits. Base pay for the successful applicant will depend on a variety of job-related factors, which may include education, training, experience, location, business needs, or market demands. The expected base pay range for this role is based on the location where the work will be performed and is aligned to one of 3 compensation zones. For other locations not listed, compensation can be discussed with your recruiter during the interview process. Base Pay Range: Zone 1 (Menlo Park, CA; New York, NY; Bellevue, WA; Washington, DC) $209,000-$245,000 USD Zone 2 (Denver, CO; Westlake, TX; Chicago, IL) $184,000-$216,000 USD Zone 3 (Lake Mary, FL; Clearwater, FL; Gainesville, FL) $163,000-$191,000 USD Click here to learn more about our Total Rewards, which vary by region and entity. If our mission energizes you and you're ready to build the future of finance, we look forward to seeing your application.

About Robinhood

Robinhood is a financial services company that offers commission-free trading through its website and mobile app. The company was founded in 2013 by Vladimir Tenev and Baiju Bhatt, and is headquartered in Menlo Park, California. Robinhood's mission is to democratize finance for all by making investing accessible to everyone. The company has raised over $5 billion in funding and has over 13 million users. Robinhood has faced criticism for its business model, which relies on selling order flow to market makers, and for its handling of the GameStop trading frenzy in early 2021.

Learn more about Robinhood

Size

2,000 employees

Industry

Finance & Insurance

Founded

2013

* Ladders Estimates

Similar Jobs

AI Engineer, Developer Experience
$154K — $315K *
Gem
San Francisco, CA 94112 (San Francisco County)
Today
AI/ML Engineer - Agentic
$136K — $276K *
Hewlett Packard Enterprise Development LP
San Jose, CA 95123 (Santa Clara County)
Reposted Today
AI Platform Engineer (Remote)
$140K — $215K *
CrowdStrike Holdings, Inc.
Remote
Reposted Today
Senior Software Engineer (AI Agents)
$200K — $240K *
Traba
San Francisco, CA 94112 (San Francisco County)
Yesterday
Software Development Engineer - AI Tools
$148K — $222K *
Workday
Pleasanton, CA 94566 (Alameda County)
Yesterday
Tech Product & Offering Dev Manager
$94K — $224K *
Accenture
Sacramento, CA 95823 (Sacramento County)
Yesterday

Get Ready For Your
Next Interview

More Jobs at Robinhood

Senior Machine Learning Engineer, Agentic AI
$209K — $245K *
Menlo Park, CA 94025 (San Mateo County)
Today
Information Technology
In-Person
Senior Machine Learning Engineer, Agentic AI
$209K — $245K *
Bellevue, WA 98006 (King County)
Today
Information Technology
In-Person
Copy Lead
$149K — $175K *
New York, NY 10025 (New York County)
5 days ago
Consumer Technology
In-Person
Senior iOS Engineer, Crypto Trading
$196K — $230K *
Menlo Park, CA 94025 (San Mateo County)
5 days ago
Finance & Insurance
In-Person
Quality Engineer, iOS/Android
$80K — $95K *
Toronto, ON M3C 0E3
1 week ago
Consumer Technology
In-Person

More Information Technology Jobs

SDET (Software Development Engineer In Test)
Confidential Company
Washington, DC 20001 (District Of Columbia County)
2 weeks ago
Senior ServiceNow Engineer
$110K — $140K *
Comcast
Reston, VA 20191 (Fairfax County)
Today
Senior Data Engineer
$135K — $205K *
Ardent Eagle Solutions
Arlington, VA 22204 (Arlington County)
Today
Senior Manager - Salesforce
$136K — $230K *
MiniMed
Johns Creek, GA 30022 (Fulton County)
Today
Senior Manager, Cybersecurity
$147K — $170K *
Leprino Foods
Denver, CO 80219 (Denver County)
Today

Find similar Senior Machine Learning Engineer, Agentic AI jobs:

Nationwide Menlo Park, CA

Senior Machine Learning Engineer, Agentic AI

Job Overview by Ladders

Full Job Description

Get Ready For Your Next Interview

Find similar Senior Machine Learning Engineer, Agentic AI jobs:

Get Ready For Your
Next Interview