Full Job Description
We are looking for a Senior Product Manager, Technical to define and drive the product vision for AI Agent Evaluations within our Core Services AI Foundations team. You will own the end-to-end evaluation framework that enables application development teams to measure, benchmark, and continuously improve the quality, safety, and reliability of their AI-powered agents. This includes defining the product strategy for evaluation tooling, quality scoring methodologies, regression testing frameworks, and human-in-the-loop review workflows that give builders confidence their agents perform as intended before and after deployment. You will work backward from the needs of development teams in Applied AI Solutions and AWS building agentic AI applications and define how they assess agent behavior across dimensions including correctness, safety, groundedness, and customer satisfaction. You will partner closely with engineering and applied science teams to translate complex evaluation methodologies into intuitive, self-service products that scale across diverse use cases and agent architectures.
The Core Services AI Foundations team within AWS Applied AI Solutions builds the foundational platform that enables application development teams to ship production-grade AI agents and applications with confidence. We provide the shared infrastructure, tooling, and guardrails that handle the hardest cross-cutting concerns in agentic AI: evaluation, identity and access management, observability and analytics, data and knowledge management, foundational agents, and user experience.
Key job responsibilities
Own the product vision, strategy, and roadmap for AI agent evaluation capabilities, including automated evaluation pipelines, human review workflows, and quality benchmarking tools
Write detailed product requirements that articulate evaluation framework APIs, scoring algorithms, and integration patterns
Partner with applied scientists and engineers to translate research into production-grade product features
Drive customer discovery to generalize evaluation requirements from diverse agent architectures, domains, and deployment patterns
Define success metrics and instrumentation strategies that make evaluation product adoption, coverage, and impact visible to stakeholders
Own go-to-market strategy, including positioning evaluation tooling relative to existing AWS services and third-party alternatives
Drive alignment across dependent teams (identity, observability, data platform) to ensure evaluation signals integrate seamlessly with the broader Core Services AI Foundations platform
A day in the life
You split your time between customer discovery, technical design, and cross-team alignment. On any given day you might be meeting with an application development team to understand how they assess agent quality, writing the narrative for a new evaluation capability, reviewing a technical design document with engineers, probing assumptions about scale, and defining acceptance criteria. You operate independently, make trade-offs across competing priorities, and own outcomes from requirements through launch and post-launch performance.
BASIC QUALIFICATIONS
- Bachelor's degree
- Experience owning/driving roadmap strategy and definition
- Experience with feature delivery and tradeoffs of a product
- Experience contributing to engineering discussions around technology decisions and strategy related to a product
- Experience managing technical products or online services
- Experience in representing and advocating for a variety of critical customers and stakeholders during executive-level prioritization and planning
PREFERRED QUALIFICATIONS
- Experience in using analytical tools, such as Tableau, Qlikview, QuickSight
- Experience in building and driving adoption of new tools
The base salary range for this position is listed below. Your Amazon package will include sign-on payments and restricted stock units (RSUs). Final compensation will be determined based on factors including experience, qualifications, and location. Amazon also offers comprehensive benefits including health insurance (medical, dental, vision, prescription, Basic Life & AD&D insurance and option for Supplemental life plans, EAP, Mental Health Support, Medical Advice Line, Flexible Spending Accounts, Adoption and Surrogacy Reimbursement coverage), 401(k) matching, paid time off, and parental leave. Learn more about our benefits at https://amazon.jobs/en/benefits.
USA, WA, Seattle - 151,200.00 - 204,600.00 USD annually