Zywave

AgenticOps Engineer

Zywave$100K — $140K *
US-AnywhereRemote in Milwaukee, WI
Information Technology
5 - 7 years of experience
Job Overview by Ladders

Qualifications

  • 5+ years of software engineering experience with solid systems thinking and debugging skills.
  • Hands-on experience with LLM APIs, including prompt design and function calling.
  • Proven ability to address complex, cross-functional technical problems.
  • Strong analytical skills for building dashboards and analyzing output quality patterns.
  • Excellent written and verbal communication skills for effective documentation and knowledge sharing.

Responsibilities

  • Monitor agent output quality, latency, and failure rates across product teams.
  • Detect patterns of issues related to agent performance and prompt structures.
  • Develop and manage dashboards and alert systems for tracking agent health.
  • Collaborate on prompt engineering and maintain a playbook documenting effective practices.
  • Assist Product Engineers by diagnosing persistent agent quality issues and testing solutions.
  • Build evaluation tools and benchmarks for agent workflows.
  • Share insights and best practices through cross-team collaboration.

Benefits

  • Flexible work hours and remote work options.
  • Opportunities for professional development and training.
  • Access to the latest AI tools and technologies.
  • Collaborative and innovative work environment.
  • Health and wellness benefits, including mental health support.
Full Job Description
Description

Role Summary

The Agentic Ops Engineer operates as a cross-team specialist responsible for the health, reliability, and continuous improvement of AI agent output across the organization. They are not embedded in any single team's daily sprint cycle. Instead, they maintain a bird's-eye view of how agents perform across a handful of product engineering teams, identifying systemic patterns, diagnosing recurring failure modes, and tuning the shared prompt and tooling infrastructure that every team depends on.

When a Product Engineer hits a wall with agent output quality-whether it's a spec the agent can't parse, a prompt structure that produces inconsistent results, or a workflow that degrades over time-the Agentic Ops Engineer is who they pull in.

Core Responsibilities
  1. Agent Performance Monitoring & Pattern Detection
    • Continuously monitor agent output quality, latency, and failure rates across all three product teams.
    • Identify recurring patterns such as "agents keep struggling with this type of spec" or "this prompt structure consistently produces higher-quality output.
    • Build and maintain dashboards and alerting systems that surface degradation before teams feel it.
    • Conduct periodic reviews of agent interaction logs to flag systemic issues and emerging trends.
  2. Prompt Engineering & System Tuning
    • Own the shared prompt infrastructure: templates, system prompts, few-shot libraries, and chain-of-thought scaffolding used across teams.
    • Iterate on prompt structures based on observed failure modes and A/B performance data.
    • Develop and maintain a prompt playbook documenting what works, what doesn't, and why.
    • Evaluate and integrate new model capabilities, versioning changes, and API updates as they roll out from providers.
  3. Escalation Support & Embedded Problem-Solving
    • Serve as the on-call specialist when a Product Engineer encounters persistent agent output quality issues.
    • Diagnose root causes: Is it the prompt? The spec format? The model's limitations? A context window issue?
    • Pair with Product Engineers to rapidly prototype and test fixes, then roll improvements back into shared systems.
    • Maintain a knowledge base of resolved issues and their solutions to reduce repeat escalations.
  4. Tooling, Evaluation & Infrastructure
    • Build and maintain evaluation harnesses, benchmarks, and regression test suites for agent workflows.
    • Develop internal tooling for prompt version control, output comparison, and automated quality scoring.
    • Collaborate with platform/infra teams to optimize agent execution pipelines (caching, context management, token budgets).
    • Establish and track key metrics: output acceptance rate, revision frequency, time-to-resolution on escalations.
  5. Knowledge Sharing & Team Enablement
    • Run regular cross-team syncs sharing findings, patterns, and updated best practices.
    • Produce internal documentation, guidelines, and training materials on working effectively with agents.
    • Coach Product Engineers on prompt construction, spec formatting, and debugging agent behavior.
    • Serve as the organizational point of contact for agent-related decisions (model selection, provider evaluation, capability assessments).


Requirements

Qualifications

Required
  • 5+ years of software engineering experience with strong fundamentals in systems thinking and debugging.
  • Hands-on experience building with LLM APIs (prompt design, chain-of-thought, tool use, function calling).
  • Demonstrated ability to diagnose and resolve complex, cross-cutting technical issues.
  • Strong analytical skills: comfortable building dashboards, writing queries, and reasoning about statistical patterns in output quality.
  • Excellent written and verbal communication-this role lives on documentation, cross-team clarity, and knowledge transfer.

Preferred
  • Experience with prompt evaluation frameworks, LLM observability tools (e.g., LangSmith, Braintrust, Humanloop), or building internal evaluation harnesses.
  • Background in developer tooling, platform engineering, or SRE/DevOps with an understanding of reliability principles applied to non-deterministic systems.
  • Familiarity with multiple LLM providers and models; able to reason about trade-offs in capability, cost, and latency.
  • Experience working cross-functionally across multiple product teams without direct authority.

Success Metrics (First 6 Months)
  1. Agent output acceptance rate increases measurably across all three teams (baseline established in month 1).
  2. Median escalation resolution time drops by 40%+ as patterns are documented and systemic fixes are applied.
  3. Prompt playbook and evaluation harness are live and actively used by all three teams.
  4. Repeat escalation rate for known failure modes trends toward zero as systemic fixes propagate.
  5. Cross-team visibility into agent performance is self-serve via dashboards and a monthly report cadence.

What This Role Is Not
  • Not a team lead or manager - this is an IC role with cross-cutting influence, not authority.
  • Not a data scientist - although analytical skills are essential, the focus is operational, not research.
  • Not an ML engineer - you're tuning how the organization uses models, not training or fine-tuning them.
  • Not a project manager - you don't own team roadmaps; you own the health of the agent layer beneath them.

About Zywave

Zywave is a leading provider of software solutions for the insurance industry. The company's products are designed to help insurance brokers and carriers streamline their operations, improve their sales and marketing efforts, and provide better service to their clients. Zywave's software solutions include agency management systems, client portals, and marketing automation tools. The company has a strong focus on innovation and is constantly developing new products and features to meet the evolving needs of the insurance industry. Zywave is headquartered in Waukesha, Wisconsin, and has offices in Canada, the UK, and India.
Learn more about Zywave
Size
500 employees
Industry
Founded
1995

Similar Jobs

More Jobs at Zywave

More Information Technology Jobs

Find similar AgenticOps Engineer jobs: