Ad Hoc

Monitoring & Observability Lead

Ad Hoc$140K — $155K *
Enterprise Technology
8 - 10 years of experience
Job Overview by Ladders

Qualifications

  • Bachelor's degree and 9+ years of experience, or equivalent relevant experience.
  • Proven leadership in monitoring, observability, or operations for production systems.
  • Hands-on experience with observability tools covering metrics, logging, and alerting.
  • Familiarity with cloud infrastructure such as AWS and Kubernetes, especially Amazon EKS.
  • Ability to obtain and maintain U.S. Public Trust clearance.

Responsibilities

  • Lead the development of monitoring and observability strategy for a large-scale federal cloud platform.
  • Establish and maintain service level objectives (SLOs), dashboards, and alerting protocols.
  • Coordinate incident detection, triage, and escalation to ensure continuous operational monitoring.
  • Drive initiatives to enhance mean time to detection and recovery metrics.
  • Mentor engineering teams and establish operational excellence standards.
  • Implement observability tools and automation using infrastructure as code, specifically Terraform.
  • Collaborate with reliability, platform, and security teams to close monitoring gaps.
  • Engage with government partners to meet compliance and performance requirements.

Benefits

  • Company-subsidized health, dental, and vision insurance.
  • Flexible PTO policy allowing adaptable time off.
  • 401K with employer matching contributions.
  • Paid parental leave available after one year of service.
  • Access to Employee Assistance Programs for support.
Full Job Description
Monitoring & Observability Lead
Job number: 886

This is a remote position.

The Veterans Affairs business unit helps transform the VA into a modern digital services organization where Veteran outcomes are at the center of every effort. We partner with the VA to design and deliver seamless user experiences for Veterans, their families and caregivers, and VA employees. By applying better practices in service design, product management, and technology, we enable the VA to increase the use, quality, and reliability of services and decrease the time Veterans spend waiting for outcomes.

Primary Responsibilities:

As a Monitoring & Observability Lead, you will lead the monitoring, alerting, and observability practice for a large federal enterprise cloud platform that operates around the clock. Working with leadership, you will help shape the technical direction of platform operations and ensure issues are detected and resolved quickly. Primary expectations of a Monitoring & Observability Lead include:
  • Leading the platform's monitoring and observability strategy across metrics, logging, tracing, and alerting
  • Establishing and maintaining service level objectives (SLOs), dashboards, and alerting standards
  • Leading around-the-clock operational monitoring and coordinating incident detection, triage, and escalation
  • Driving continuous improvement of mean time to detection and mean time to recovery
  • Mentoring engineers and setting standards for operational excellence
  • Implementing observability tooling and automation as infrastructure as code (Terraform)
  • Partnering with reliability, platform, and security teams to close monitoring gaps
  • Working with government partners to meet security, SLA, and performance requirements
  • Participating in technical depth interviews with new candidates
  • Responsible for hiring, performance management, timecard reviews, PTO management and team development


Basic Qualifications:
  • Bachelor's and 9+ years of experience; relevant experience may be substituted for education
  • Demonstrated experience leading monitoring/observability or operations for production systems
  • Hands-on experience with observability tooling (metrics, logging, alerting) and incident management
  • Familiarity with cloud infrastructure (AWS) and Kubernetes (Amazon EKS)
  • Must be able to obtain and maintain a U.S. Public Trust / suitability determination


Preferred Qualifications:
  • Prior experience with the Department of Veterans Affairs
  • Experience with SLO-based reliability and on-call program leadership
  • Relevant certifications (e.g., AWS, observability platform certifications)


To learn more about working at Ad Hoc, please visit:https://adhocteam.us/join

Benefits:
  • Company-subsidized health, dental, and vision insurance
  • Flexible PTO
  • 401K with employer match
  • Paid parental leave after one year of service
  • Employee Assistance Program

In support of various state and city equal pay transparency laws, Ad Hoc job descriptions feature the starting range we reasonably expect to pay to candidates who would join our team with little to no need for training on the responsibilities we've outlined above. Actual compensation is influenced by a wide range of factors including but not limited to skill set, level of experience, and responsibility. The range of starting pay for this role is $140,000-$155,000. Our recruiters will be happy to answer any questions you may have, and we look forward to learning more about your salary requirements.

job reference:

https://adhoc.team/

About Ad Hoc

Ad Hoc is a digital services company that helps government agencies improve the user experience of their digital services. They work with clients across a range of industries, including healthcare, finance, and transportation. Ad Hoc provides a range of services, including user research, design, and development. They are known for their user-centered approach and their ability to deliver high-quality digital services that meet the needs of their clients and their users. Ad Hoc was founded in 2014 and is headquartered in Washington, DC.
Learn more about Ad Hoc
Size
200 employees
Industry
Founded
2014

Similar Jobs

More Jobs at Ad Hoc

More Enterprise Technology Jobs

Find similar Monitoring & Observability Lead jobs: