Senior AI Engineer (Agent OS Platform)

ServiceTitan • $168K — $224K *

US-AnywhereRemote in California, US

Information Technology

5 - 7 years of experience

1 week ago

By clicking Apply, I agree with Ladders' Terms of Use and Privacy Policy

Job Overview by Ladders

Qualifications

5+ years of production software engineering experience
Strong hands-on coding ability in Python, Java, C#, or another backend language, with a preference for Python
Experience building AI, ML, data, platform, infrastructure, workflow, automation, or developer-platform systems in production
Practical understanding of modern LLM application architecture
Experience with distributed systems and event-driven architectures
Strong data and context instincts including SQL and unstructured data management
Good engineering judgment across APIs, reliability, security, and multi-tenant SaaS constraints

Responsibilities

Design and implement core Agent OS platform services
Write production code and review implementation from other engineers
Build reliable APIs, workflows, tools, and services for agent execution
Inspect traces, debug failures, and improve production behavior
Work through real agent failure modes
Partner with domain teams to turn agent use cases into reusable platform patterns
Contribute to technical direction while assuring safe and timely delivery

Benefits

Flexible time off and support for autonomous work
Comprehensive health benefits including medical, dental, and vision plans
Parental leave and various fertility services support
401k matching and financial planning tools
Recognition programs and learning opportunities

Full Job Description

What You’ll Build

Agent runtime and workflow execution: Build the runtime for role-specific agents, tool use, delegation, pause/resume, durable checkpoints, retries, and failure recovery. Agents must resume safely without losing state or duplicating side effects.
Typed tools and action contracts: Build deterministic controls around non-deterministic reasoning: governed reads, proposed writes, precondition checks, business invariants, scoped permissions, idempotency, audit trails, and rollback.
Context and memory systems: Build tenant-scoped context assembly, retrieval, freshness controls, provenance, transcripts, artifacts, tool results, and replayable evidence. ServiceTitan systems of record stay authoritative; memory provides context and coordination.
Trust and approval infrastructure: Build human-in-the-loop gates, approval thresholds, reversibility, tenant policy enforcement, and audit history for financial, contractual, dispatch, warranty, and compliance-sensitive workflows.
Evaluation and observability: Build offline and online evals, scenario libraries, simulation, trajectory review, regression detection, cost and latency telemetry, and autonomy promotion gates.
Reusable capability platform: Help product teams package prompts, tools, context requirements, policies, evals, rollout controls, ownership, and rollback into governed capabilities for owners, CSRs, dispatchers, technicians, managers, and back-office teams.
Model and inference architecture: Make practical tradeoffs across latency, cost, quality, structured outputs, caching, fallback behavior, provider choice, and model routing behind a shared platform layer.

What You’ll Do

Design and implement core Agent OS platform services.
Write production code and review implementation details from other engineers.
Build reliable APIs, workflows, tools, and services for agent execution.
Inspect traces, debug failures, and improve production behavior.
Design evaluation scenarios and regression suites for agent workflows.
Work through real agent failure modes: stale context, wrong tool calls, missing permissions, unsafe actions, poor retrieval, latency spikes, and cost regressions.
Partner with domain teams to turn agent use cases into reusable platform patterns.
Help define platform contracts for tools, actions, approvals, context, memory, evidence, and evaluation.
Contribute to technical direction while staying grounded in what can ship quickly and safely.
Communicate clearly with engineers, product managers, architects, security partners, and engineering leadership.

What You’ll Bring

5+ years of production software engineering experience.
Strong hands-on coding ability in Python, Java, C#, or another backend language. Python experience is strongly preferred.
Experience building AI, ML, data, platform, infrastructure, workflow, automation, or developer-platform systems in production.
Practical understanding of modern LLM application architecture: model gateways, prompt and context assembly, retrieval, tool calling, structured outputs, memory, agent workflows, and human approval patterns.
Experience with distributed systems, event-driven systems, async workflows, queues, durable execution, or message-driven architectures.
Strong production-safety instincts for non-deterministic systems: typed contracts, scoped permissions, precondition checks, idempotency, audit trails, rollback, and monitoring.
Experience designing or operating evaluation systems: behavioral evals, regression suites, scenario tests, trajectory review, simulation, online metrics, or production monitoring.
Strong data and context instincts: SQL, unstructured data, vector search, metadata, provenance, freshness, source authority, and privacy boundaries.
Experience with databases, warehouses, or search systems such as PostgreSQL, SQL Server, Snowflake, BigQuery, Elasticsearch, or vector stores.
Experience building services on public cloud infrastructure such as Azure, AWS, or GCP.
Good engineering judgment across APIs, reliability, security, observability, and multi-tenant SaaS constraints.

Bonus points

Experience building or operating agent runtimes, workflow engines, model gateways, ML platforms, evaluation platforms, developer platforms, or internal control planes.
Experience with LangGraph, LangChain, LlamaIndex, Semantic Kernel, OpenAI Agents SDK, Anthropic tooling, or similar frameworks.
Experience with MCP, A2A, tool protocols, agent interoperability, or agent-commerce patterns.
Experience with Kubernetes, Docker, serverless platforms, or cloud-native infrastructure.
Experience with compliance-sensitive workflows, approval-gated automation, audit trails, policy engines, or governed writes to systems of record.
Experience in SaaS, vertical software, fintech, ERP, CRM, marketplace, field service, or other domains where software decisions affect real business operations.
Experience with graph-based data models, knowledge graphs, entity resolution, or cross-domain operational context systems.

Why this role matters

Most AI products fail when the demo becomes a production workflow. The hard problems show up in the platform: context freshness, tool reliability, permissions, evaluation, traceability, rollback, and trust.

That is what this team is building.

At ServiceTitan, agents need to work inside real contractor operations. They need to understand the job, the customer, the technician, the equipment, the agreement, the invoice, the warranty, and the business policy. They need to explain what evidence they used. They need to know when to ask for approval. They need to recover when something fails.

The engineer in this role helps set the technical standard for every AI surface ServiceTitan builds next. This is a high-leverage engineering role for someone who wants to build the platform underneath production agents, not just another agent demo.

Remote Location (US and Canada only)- Candidates based in PST highly preferred.

What We Offer:
When you join our team, you’re not just accepting a job. You’re making a career move. Here’s how we’ll support you in doing some of the most impactful work of your career:

Flextime, recognition, and support for autonomous work: Flexible time off with ample learning and development opportunities to continue growing your career. We offer a comprehensive onboarding program, leadership training for Titans at all levels, and other programs and events. Great work is rewarded through Bonusly, peer-nominated awards, and more.
Holistic health and wellness benefits: Company-paid medical, dental, and vision (with 100% employer paid options and 90% coverage for dependents), FSA and HSA, 401k match, and telehealth options including memberships to One Medical.
Support for Titans at all stages of life: Parental leave and support, up to $20k in fertility services (i.e. IUI and IVF), surrogacy, and adoption reimbursement, on demand maternity support through Maven Maternity, free breast milk shipping through Maven Milk, pet insurance, legal advisory services, financial planning tools, and more.

About ServiceTitan

Learn more about ServiceTitan

Industry

Information Technology

Founded

2013

* Ladders Estimates

Similar Jobs

Senior AI Software Engineer
$160K — $180K *
Latham & Watkins LLP
Los Angeles, CA 90011 (Los Angeles County)
Reposted Today
Senior AI/ML Engineer - Engineering Excellence (Full Stack Developer) Vice President
$125K — $188K *
Citigroup, Inc
Jacksonville, FL 32210 (Duval County)
Reposted Today
Applied AI Engineer
$115K — $192K *
Relx Group
Raleigh, NC 27610 (Wake County)
Today
Senior Machine Learning Platform Engineer
$148K — $247K *
Guidewire Software
San Mateo, CA 94403 (San Mateo County)
Reposted Today
Machine Learning Engineer III, Core Agents
$175K — $219K *
Box Inc
Redwood City, CA 94061 (San Mateo County)
Today
Software Engineer, Computer Vision and Deep Learning
$180K — $260K *
Mashgin
Palo Alto, CA 94303 (Santa Clara County)
Today

Get Ready For Your
Next Interview

More Jobs at ServiceTitan

Director, Success Engineer (Strategic)
$204K — $327K *
Remote
4 days ago
Enterprise Technology
Remote in United States
Senior Manager, AI Engineering
$239K — $358K *
Remote
4 days ago
Enterprise Technology
Remote in United States
Senior AI Engineer (Agent OS Platform)
$168K — $224K *
Remote
1 week ago
Information Technology
Remote in California, US
Manager, Global Mobility
$114K — $152K *
Remote
3 weeks ago
Business Services
Remote in United States
Sales Enablement Manager
$86K — $115K *
Remote
3 weeks ago
Business Services
Remote

More Information Technology Jobs

Client Partner - Banking / Financial Services / Capital Markets
$325K — $350K + $100K bonus *
Large IT Services Firm (client of TechLink Systems)
New York, NY 10001 (New York County)
Yesterday
Business Development Director
$300K — $345K + $120K bonus *
Tier1 IT Services Firm
Kansas City, MO 64116 (Clay County)
1 week ago
Client Partner / Business Developemnt - Banking
$250K — $320K + $70K bonus *
IT Services Firm (client of TechLink Systems)
New York, NY 10001 (New York County)
1 week ago
Wiki Platform Engineer
$80K — $100K *
Valnet Inc.
Saint-laurent, QC H4K 1H9
Reposted Today
Data Center Technician
$62K — $112K *
Amazon
Hermiston, OR 97838 (Umatilla County)
Reposted Today

Find similar Senior AI Engineer (Agent OS Platform) jobs:

Nationwide Remote

Senior AI Engineer (Agent OS Platform)

Job Overview by Ladders

Full Job Description

Get Ready For Your Next Interview

Find similar Senior AI Engineer (Agent OS Platform) jobs:

Get Ready For Your
Next Interview