Robinhood

Senior Machine Learning Engineer, Agentic AI

Robinhood$209K — $245K *
Information Technology
5 - 7 years of experience
Job Overview by Ladders

Qualifications

  • 5+ years of experience in production AI systems with LLMs or autonomous agents.
  • Expertise in modern agent architectures and multi-agent systems.
  • Proficiency in designing evaluation frameworks for agentic AI.
  • Strong debugging skills in complex agent interactions and reasoning.
  • Leadership experience in mentoring engineers and influencing technical direction.

Responsibilities

  • Lead the development of agentic AI systems for enhanced customer experiences.
  • Set technical benchmarks for evaluating autonomous agents' performance.
  • Create scalable evaluation frameworks using automated and human feedback methods.
  • Drive model selection and optimization for various AI systems.
  • Collaborate with cross-functional teams to set quality and success standards for agent deployments.
  • Enhance agent reliability by diagnosing and addressing production failures.
  • Mentor peers while establishing best practices for agentic AI implementations.

Benefits

  • 100% paid health insurance for employees; 90% for dependents.
  • Flexible benefits account for wellness and educational expenses.
  • Employer-paid life, disability insurance, and mental health support.
  • Generous time off including holidays, paid leave, and sick days.
  • Engaging office atmosphere with catered meals and team events.
Full Job Description
The Agentic AI team builds agentic AI systems that power intelligent, reliable customer experiences across Robinhood products. The team focuses on reducing the time to ship agents with fine-tuned models and while doing so enables other teams to build, evaluate, and improve their own agents. You will contribute to a culture grounded in first-principles thinking, high performance, and strong focus on customer outcomes! As a Senior Machine Learning Engineer (IC5), you will define and uphold the quality bar for agentic systems across the organization. You will design evaluation frameworks, guide model selection, and partner with product, data science, and engineering teams to ensure systems meet clear standards for correctness, safety, latency, and user satisfaction. Your work will shape how agentic systems are built, evaluated, and improved across Robinhood! At Robinhood, we believe in the power of in-person work to accelerate progress, spark innovation, and strengthen community. Our office experience is intentional, energizing, and designed to fully support high-performing teams. This role is based in our Bellevue, WA, New York, NY, or Menlo Park, CA office, with in-person attendance expected at least 3 days per week. What you'll do • Lead the design and evolution of agentic AI systems that power intelligent customer experiences across Robinhood. • Define the technical direction for evaluating autonomous agents, including reasoning quality, planning, tool selection, memory, task completion, safety, latency, and overall user experience. • Design and build scalable evaluation frameworks for agentic systems using automated evals, benchmark datasets, LLM-as-a-Judge techniques, and human feedback to continuously improve agent performance. • Drive model selection and optimization across frontier foundation models, fine-tuned models, retrieval systems, and tool-using agents, balancing quality, latency, cost, and reliability. • Partner closely with Product, Data Science, and Engineering to establish launch criteria, quality standards, and measurable success metrics for production agentic systems. • Improve agent reliability by investigating production failures, identifying root causes across reasoning, planning, retrieval, and tool execution, and driving architectural improvements. • Mentor engineers and influence technical direction across teams while helping establish best practices for building reliable, production-ready agentic AI systems. What you bring • Significant experience building and deploying production AI systems powered by large language models, autonomous agents, or multi-step reasoning workflows. • Deep understanding of modern agent architectures, including tool calling, planning, memory, retrieval-augmented generation (RAG), orchestration, and multi-agent systems. • Experience designing evaluation frameworks for agentic AI, including automated evals, benchmark datasets, LLM-as-a-Judge methodologies, human evaluation pipelines, and continuous quality measurement. • Strong understanding of the tradeoffs between prompting, fine-tuning, retrieval, and agent orchestration, and when to apply each approach. • Experience evaluating frontier foundation models across quality, latency, safety, cost, robustness, and production readiness. • Proven ability to debug complex agent behaviors, identify failure modes, and improve reasoning, reliability, and overall system performance. • Strong software engineering skills with experience building scalable distributed systems and production ML infrastructure. • Demonstrated technical leadership through architecture design, mentorship, and influencing engineering direction across multiple teams. • Experience with agent frameworks, AI observability platforms, model evaluation tooling, or regulated AI applications is a strong plus. What we offer • Challenging, high-impact work to grow your career • Performance driven compensation with multipliers for outsized impact, bonus programs, equity ownership, and 401(k) matching • Best in class benefits to fuel your work, including 100% paid health insurance for employees with 90% coverage for dependents • Lifestyle wallet - a highly flexible benefits spending account for wellness, learning, and more • Employer-paid life & disability insurance, fertility benefits, and mental health benefits • Time off to recharge including company holidays, paid time off, sick time, parental leave, and more! • Exceptional office experience with catered meals, events, and comfortable workspaces In addition to the base pay range listed below, this role is also eligible for bonus opportunities + equity + benefits. Base pay for the successful applicant will depend on a variety of job-related factors, which may include education, training, experience, location, business needs, or market demands. The expected base pay range for this role is based on the location where the work will be performed and is aligned to one of 3 compensation zones. For other locations not listed, compensation can be discussed with your recruiter during the interview process. Base Pay Range: Zone 1 (Menlo Park, CA; New York, NY; Bellevue, WA; Washington, DC) $209,000-$245,000 USD Zone 2 (Denver, CO; Westlake, TX; Chicago, IL) $184,000-$216,000 USD Zone 3 (Lake Mary, FL; Clearwater, FL; Gainesville, FL) $163,000-$191,000 USD Click here to learn more about our Total Rewards, which vary by region and entity. If our mission energizes you and you're ready to build the future of finance, we look forward to seeing your application.

About Robinhood

Robinhood is a financial services company that offers commission-free trading through its website and mobile app. The company was founded in 2013 by Vladimir Tenev and Baiju Bhatt, and is headquartered in Menlo Park, California. Robinhood's mission is to democratize finance for all by making investing accessible to everyone. The company has raised over $5 billion in funding and has over 13 million users. Robinhood has faced criticism for its business model, which relies on selling order flow to market makers, and for its handling of the GameStop trading frenzy in early 2021.
Learn more about Robinhood
Size
2,000 employees
Industry
Founded
2013

Similar Jobs

More Jobs at Robinhood

More Information Technology Jobs

Find similar Senior Machine Learning Engineer, Agentic AI jobs: