Senior Staff Machine Learning Engineer, AI Agent Platform

Geico • $130K — $300K *

Palo Alto, CA 94303Hybrid

Information Technology

8 - 10 years of experience

1 month ago

By clicking Apply, I agree with Ladders' Terms of Use and Privacy Policy

Job Overview by Ladders

Qualifications

8+ years of software development experience in at least two programming languages (Java, C++, Python, Go, C#)
6+ years experience designing AI/ML platforms with open-source/cloud-agnostic tools
5+ years managing end-to-end software development life cycles
4+ years in building training and inferencing systems for large language models (LLMs)
3+ years in production environments dealing with multi-agent AI systems
Strong grasp of context engineering for managing resources in LLMs
Proven track record in technical leadership and mentoring.

Responsibilities

Define the long-term technical strategy for the AI agent platform
Architect an enterprise skill ecosystem and manage the skills marketplace
Lead the design of reliable AI agent harnesses for long-running workflows
Oversee high-performance platform components for agentic workflows
Establish governance frameworks for AI agent autonomy and security
Collaborate with cross-functional teams and mentor engineers
Promote best practices in AI engineering throughout the organization.

Benefits

Support for further education and professional development
Flexible work arrangements
Opportunities for career advancement within a leading insurance company
Participation in an innovative and impactful AI project
Access to cutting-edge technology and resources.

Full Job Description

Sr. Staff Machine Learning Engineer – AI Agent Platform Position Description GEICO is seeking an exceptional Sr. Staff ML Engineer to join our AI organization. You will serve as a technical leader and key architect for GEICO's virtual assistant platform that elevates productivity for 30K+ internal associates and the customer experience for millions of policyholders. Sr. Staff AI Agent Platform Engineers set the technical vision and drive the architecture of multi-tenant services that power the building, testing, deployment, and hosting of LLM-based AI agents. This includes multi-agent orchestration, standardized interoperability protocols (MCP, A2A), AI agent skill ecosystems with marketplace and governance capabilities, production-grade harness & context engineering, and guardrail frameworks for safe autonomous operation at enterprise scale. Responsibilities

Technical Vision & Architecture: Define the long-term technical strategy for GEICO's AI agent platform — including multi-agent orchestration, AI agent lifecycle management, evaluation frameworks, skill registries and marketplace, and workflow orchestration.
AI Agent Skills & Marketplace: Architect an enterprise skill ecosystem — reusable capability packages that encode domain expertise and workflows into portable, discoverable modules. Build and govern an internal skill marketplace with versioning, security vetting, approval workflows, progressive disclosure loading, and usage analytics.
Harness & Context Engineering: Lead design of production-grade AI agent harnesses (tool dispatch, context management, error recovery, session state, fine-grained Authn/AuthZ) that makes AI agents reliable for long-running workflows. Apply feedforward guides (linters, architecture constraints, spec-driven validation) and feedback sensors (test execution, LLM-as-judge) mixing computational and inferential controls. Design context engineering systems that treat the LLM context window as a managed resource — memory hierarchies, RAG pipelines, context compaction, scratchpads, and dynamic skill/tool loading.
Platform & Interoperability: Own high-performance platform components powering end-to-end agentic workflows: MCP server/registry management, A2A communication infrastructure, prompt management, workflow orchestration, guardrail enforcement, and observability pipelines.
AI Safety & Governance: Establish AI agent governance frameworks including bounded autonomy, human-in-the-loop escalation, audit trails, prompt guardrails, and RBAC/ABAC access controls. Extend governance to skill-level security — vetting published skills for hidden payloads, injection vectors, and data exfiltration risks.
Leadership: Collaborate cross-functionally with data scientists, engineers, product managers, and designers. Mentor engineers at all levels. Elevate AI engineering best practices — including harness engineering patterns and agentic coding tools — across the company.

Basic Qualifications

8+ years of professional software development experience with at least two languages (Java, C++, Python, Go, or C#).
6+ years designing and building AI/ML platforms using open-source/cloud-agnostic components (Elasticsearch, Qdrant, Kafka, PostgreSQL, MongoDB, Spark, Ray, Temporal, Redis, Neo4j, etc.).
5+ years managing end-to-end SDLCs (CI/CD, Kubernetes, testing, monitoring, production support).
4+ years building training, fine-tuning, and inferencing systems for LLMs, especially on GPU infrastructure.
3+ years designing and operating multi-agent or agentic AI systems in production.
Strong understanding of context engineering — memory architectures, RAG, context compaction, and dynamic information management for LLMs.
Demonstrated track record leading technical initiatives, setting architectural direction, and mentoring across teams.
Bachelor's degree in CS, Engineering, or related field; advanced degree highly desirable.

Preferred Qualifications

6+ years with cloud providers (Azure, AWS), including container orchestration and GPU compute.
3+ years building agentic workflows with open-source and proprietary LLMs (Llama, Qwen, Claude, Gpt, etc.).
Hands-on experience with MCP and A2A protocols — MCP server development, AI agent card discovery, task delegation patterns.
Experience with harness engineering. (tool dispatch, error recovery, session state, sub-agent coordination, planning & reasoning)
Experience designing AI agent skill systems: building and governing reusable skill packages, skill marketplaces with discovery, versioning, security vetting, and progressive disclosure.
Experience with context engineering at scale: memory hierarchies, RAG optimization, compaction/summarization, state isolation, etc.
Experience with multi-agent orchestration frameworks (LangGraph, AutoGen, CrewAI).
Experience with LLM observability & evaluation platforms (LangSmith, Arize Phoenix, Langfuse).
Experience building guardrail systems (prompt injection defense, PII detection, skill-level security auditing).
Understanding of AI safety, model governance, and regulatory compliance in regulated industries.

If you are passionate about pushing the boundaries of generative AI platforms, thrive in a hands-on technical leadership role, and enjoy solving complex, large-scale problems, we encourage you to apply! Annual Salary $130,000.00 - $300,000.00

The above annual salary range is a general guideline. Multiple factors are taken into consideration to arrive at the final hourly rate/ annual salary to be offered to the selected candidate. Factors include, but are not limited to, the scope and responsibilities of the role, the selected candidate’s work experience, education and training, the work location as well as market and business considerations.

GEICO will consider sponsoring a new qualified applicant for employment authorization for this position.

About Geico

GEICO (Government Employees Insurance Company) is an American auto insurance company with headquarters in Chevy Chase, Maryland. It is the second largest auto insurer in the United States, after State Farm. GEICO is a wholly owned subsidiary of Berkshire Hathaway that provides coverage for more than 24 million motor vehicles owned by more than 15 million policy holders as of 2017. GEICO writes private passenger automobile insurance in all 50 U.S. states and the District of Columbia. The insurance agency sells policies through local agents, called GEICO Field Representatives, and over the phone directly to the consumer, and through their website.

Learn more about Geico

Size

40,000 employees

Industry

Finance & Insurance

Founded

1936

* Ladders Estimates

Similar Jobs

Software Development Engineer, Quality Platform & AI Test Automation
$156K — $316K *
TikTok
San Jose, CA 95123 (Santa Clara County)
Today
AI Software Engineer, Full Stack, NotebookLM
$147K — $211K *
Google
Mountain View, CA 94040 (Santa Clara County)
Today
LLM AIOps Development Engineer - Data Center Networking
$150K — $387K *
TikTok
San Jose, CA 95123 (Santa Clara County)
Reposted Today
AI Software Engineer III
$87K — $202K *
LightSpeed Retail
Palo Alto, CA 94303 (Santa Clara County)
Today
Software Engineer in Natural Language Processing (NLP) and Machine Learning (ML)
$130K — $180K *
Apple
Cupertino, CA 95014 (Santa Clara County)
Today
AI Engineer - Remote
$100K — $150K *
Huzzle
Remote
Reposted Today

Get Ready For Your
Next Interview

More Jobs at Geico

Insurance Products - Senior Counsel
$148K — $260K *
Beachwood, OH 44122 (Cuyahoga County)
Reposted Today
Legal & Accounting
Hybrid
Senior Software Engineer (Full Stack Java/React) - Commissions Platform - HYBRID
$100K — $215K *
Richardson, TX 75080 (Dallas County)
Reposted Today
Finance & Insurance
Hybrid
Senior Software Engineer (Full Stack Java/React) - Commissions Platform - HYBRID
$100K — $215K *
Bethesda, MD 20817 (Montgomery County)
Reposted Today
Finance & Insurance
Hybrid
Assistant Insurance Product Manager
$97K — $151K *
Remote
Reposted Today
Finance & Insurance
In-Person
Assistant Insurance Product Manager
$97K — $151K *
Jacksonville, FL 32210 (Duval County)
Reposted Today
Finance & Insurance
In-Person

More Information Technology Jobs

Client Partner - Banking / Financial Services / Capital Markets
$325K — $350K + $100K bonus *
Large IT Services Firm (client of TechLink Systems)
New York, NY 10001 (New York County)
2 days ago
Business Development Director
$300K — $345K + $120K bonus *
Tier1 IT Services Firm
Kansas City, MO 64116 (Clay County)
1 week ago
Client Partner / Business Developemnt - Banking
$250K — $320K + $70K bonus *
IT Services Firm (client of TechLink Systems)
New York, NY 10001 (New York County)
1 week ago
Backend Java Engineer - Post Trade Accounting, Associate
$132K — $162K *
BlackRock, Inc
New York, NY 10025 (New York County)
Today
Site Reliability Engineer
$100K — $125K *
Broadridge
Toronto, ON M3C 0E3
Today

Find similar Senior Staff Machine Learning Engineer, AI Agent Platform jobs:

Nationwide Palo Alto, CA

Senior Staff Machine Learning Engineer, AI Agent Platform

Job Overview by Ladders

Full Job Description

Get Ready For Your Next Interview

Find similar Senior Staff Machine Learning Engineer, AI Agent Platform jobs:

Get Ready For Your
Next Interview