Manager, Production Support & Service Reliability

Accordion

$150K — $190K *
Information Technology
8 - 10 years of experience
Job Overview by Ladders

Qualifications

  • 8+ years in production support, application support, or service reliability with leadership experience.
  • Strong expertise in incident management and service operations; proficient in creating reliable processes.
  • Experience collaborating with engineering and product teams in modern software deployment environments.
  • Ability to communicate effectively with technical and business stakeholders in high-pressure situations.
  • Familiarity with finance-sensitive and business-critical applications; adept at handling complex issues.
  • Commitment to trust and quality in service delivery; preferred bachelor's degree.

Responsibilities

  • Define production support operating model, incident lifecycle, and severity framework.
  • Establish service levels and communication standards for managing production issues.
  • Create visibility into live-service health and incident trends for proactive issue management.
  • Lead and mentor production support engineers and analysts as the team expands.
  • Develop scalable support routines and issue triage practices across products.
  • Manage incident governance for significant issues, ensuring stakeholder awareness and effective escalation.
  • Oversee support for AI workflows, including classification of issues and workflow-sensitive processes.

Benefits

  • Hybrid work flexibility with remote options available.
  • Opportunity to be part of a growing team focused on future growth.
  • Chance to mentor and lead within a collaborative environment.
  • Engagement in meaningful work that impacts client service.
Full Job Description
Please note: This position is not eligible for immigration sponsorship.
Manager, Production Support & Service Reliability

The Manager of Production Support & Service Reliability owns the production support operating model, incident management, service levels, release-watch support, escalation governance, and overall live-service quality across pods. This impactful role is critical in ensuring Production Support is proactive, disciplined, and tightly connected to Engineering, Product, QA, and business stakeholders.

This position will ideally be based in one of our US offices and is a hybrid role with the flexibility to work remotely 2 days per week (also eligible for remote candidates).

This position is not eligible for immigration sponsorship.
What You'll Do:

Service Reliability Leadership
  • Define the production support operating model, incident lifecycle, severity framework, and support expectations across pods
  • Establish service levels, escalation paths, release-watch routines, and communication standards for production issues
  • Create visibility into live-service health, incident trends, and recurring support risks

Team & Process Leadership
  • Lead production support engineers and AI operations analysts as the team continues to grow
  • Build repeatable support routines, runbook discipline, and issue triage practices that scale across products and workflows
  • Partner with Product, Engineering, and QA to improve supportability and reduce recurring production issues

Incident & Stakeholder Management
  • Own incident governance for material issues, including severity calls, stakeholder updates, escalation management, and stabilization plans
  • Ensure business and technical stakeholders have clear visibility into impact, next steps, and resolution progress
  • Drive post-incident review practices that improve resilience and reduce repeat failures

Finance & AI Production Readiness
  • Ensure finance-sensitive workflows receive the right level of production support, issue classification, and escalation handling
  • Oversee support patterns for AI-enabled workflows, including degraded outputs, fallback scenarios, trust issues, and human-review triggers
  • Work with business and technical teams to distinguish software defects from data, process, training, or model-behavior issues
You Have:
  • 8+ years of production support, application support, service reliability, or engineering operations experience, including team leadership
  • Strong knowledge of incident management, service operations, support processes, release support, and escalation discipline - you are a builder of reliable operating processes, not just a responder to tickets
  • Experience working closely with engineering and product teams in modern software delivery environments - you are structured, pragmatic, and highly credible across teams
  • Ability to communicate clearly with both technical and business stakeholders during high-pressure situations - you are calm under pressure and comfortable making judgment calls with incomplete information
  • Comfort operating in finance-sensitive, workflow-heavy, or business-critical application environments
  • Consistent commitment to trust, transparency, and quality
  • Bachelor's degree preferred
You Are:
  • Excited to be part of a growing team, with a focus on driving future growth
  • Passionate about delivering exceptional client service
  • Knowledgeable about the relationship between Private Equity sponsors and their portfolio companies, particularly within the office of the CFO
  • Willing to get your hands dirty in the details of a project while simultaneously seeing the whole picture
  • Comfortable managing projects with multiple complex workstreams while also focused on your single workstream (you are managing and doing)
  • A self-starter with a strong work ethic
  • Full of entrepreneurial spirit and comfortable in a fluid, flat organization
  • Able to effectively communicate complex issues and solutions, and raise issues to senior team members when necessary
  • A team player, able to work with team members across all levels
  • Someone who enjoys mentoring others and doing meaningful work
  • A leader of others; you lead by example

The annual salary for this role ranges from: $150,000 to $190,000 + benefits + bonus.

Actual compensation packages are determined by evaluating a wide array of factors unique to each candidate, including but not limited to geographic location, skill set, years and depth of experience, education, certifications, cost of labor and internal equity.

#LI-HA1

Similar Jobs

More Jobs at Accordion

  • Director, Data Platform
    $170K — $225K *
    Charlotte, NC 28269 (Mecklenburg County)
    Enterprise Technology
    In-Person
  • Director, Data Platform
    $170K — $225K *
    Los Angeles, CA 90011 (Los Angeles County)
    Enterprise Technology
    In-Person
  • Director, Data Platform
    $170K — $225K *
    Dallas, TX 75217 (Dallas County)
    Information Technology
    In-Person
  • Director, Data Platform
    $170K — $225K *
    San Francisco, CA 94112 (San Francisco County)
    Enterprise Technology
    In-Person
  • Director, Data Platform
    $170K — $225K *
    New York, NY 10025 (New York County)
    Enterprise Technology
    In-Person

More Information Technology Jobs

Find similar Manager, Production Support & Service Reliability jobs: