Senior Site Reliability Engineer

Ad Hoc • $135K — $150K *

Mclean, VA 22101In-Person

Information Technology

5 - 7 years of experience

Today

Be an Early Applicant

By clicking Apply, I agree with Ladders' Terms of Use and Privacy Policy

Job Overview by Ladders

Qualifications

Bachelor's degree with 7+ years of related experience; relevant experience can substitute for education.
Proven ownership of reliability for production systems including SLOs and incident response.
Expertise in at least one infrastructure-as-code tool, preferably Terraform.
Extensive knowledge of cloud infrastructure, containerization, and networking concepts.
Must be eligible for and maintain a U.S. Public Trust security clearance.

Responsibilities

Define and maintain service level objectives (SLOs) and drive the platform toward them.
Design and implement observability across all relevant metrics and logging tools.
Lead incident response initiatives and enhance time-to-recovery practices.
Facilitate blameless postmortems to foster a culture of continuous reliability improvement.
Automate processes to eliminate manual work and boost operational efficiency.
Design and maintain cloud infrastructure (AWS) and Kubernetes deployments effectively.
Mentor junior engineers and create reusable modules for reliability operations.

Benefits

Company-subsidized health, dental, and vision insurance.
Flexible paid time off (PTO) policy.
401K plan with employer matching contributions.
Paid parental leave available after one year of service.
Access to an Employee Assistance Program (EAP).

Full Job Description

Senior Site Reliability Engineer
Job number: 884

This is a remote position.

The Veterans Affairs business unit helps transform the VA into a modern digital services organization where Veteran outcomes are at the center of every effort. We partner with the VA to design and deliver seamless user experiences for Veterans, their families and caregivers, and VA employees. By applying better practices in service design, product management, and technology, we enable the VA to increase the use, quality, and reliability of services and decrease the time Veterans spend waiting for outcomes.

Primary Responsibilities:

As a Senior Site Reliability Engineer, you will serve as an experienced individual contributor responsible for the availability, performance, and reliability of a large federal enterprise cloud platform that operates around the clock. With minimal oversight, you will help meet scope, schedule, and delivery requirements while shaping the platform's reliability strategy. Primary expectations of a Senior Site Reliability Engineer include:

Defining and maintaining service level objectives (SLOs), service level indicators, and error budgets, and driving the platform toward them
Designing and operating observability across metrics, logging, tracing, and alerting
Leading incident response and on-call practices, including escalation, mitigation, and time-to-recovery improvements
Driving blameless postmortems and systemic reliability improvements
Engineering automation to eliminate toil and improve operational efficiency
Self-directed design of reliable cloud infrastructure (AWS) and Kubernetes (Amazon EKS), including tradeoffs between cost, reliability, and efficiency
Building reusable modules and mentoring engineers on reliability practices
Presenting design documents and system diagrams to stakeholders
Participating in technical depth interviews with new candidates

Basic Qualifications:

Bachelor's and 7+ years of experience; relevant experience may be substituted for education
Demonstrated experience owning reliability (SLOs, observability, incident response) for production systems
Expert-level knowledge of at least one infrastructure-as-code tool (Terraform preferred)
Deep command of cloud infrastructure, containerization, and networking
Must be able to obtain and maintain a U.S. Public Trust / suitability determination

Preferred Qualifications:

Prior experience with the Department of Veterans Affairs
Kubernetes (Amazon EKS) and AWS at scale
Familiarity with FedRAMP, NIST 800-53, and zero-trust architecture
Relevant certifications (e.g., AWS, CKA/CKS)

To learn more about working at Ad Hoc, please visit:https://adhocteam.us/join

Benefits:

Company-subsidized health, dental, and vision insurance
Flexible PTO
401K with employer match
Paid parental leave after one year of service
Employee Assistance Program

In support of various state and city equal pay transparency laws, Ad Hoc job descriptions feature the starting range we reasonably expect to pay to candidates who would join our team with little to no need for training on the responsibilities we've outlined above. Actual compensation is influenced by a wide range of factors including but not limited to skill set, level of experience, and responsibility. The range of starting pay for this role is $135,000-$150,000. Our recruiters will be happy to answer any questions you may have, and we look forward to learning more about your salary requirements.

job reference:

https://adhoc.team/

About Ad Hoc

Ad Hoc is a digital services company that helps government agencies improve the user experience of their digital services. They work with clients across a range of industries, including healthcare, finance, and transportation. Ad Hoc provides a range of services, including user research, design, and development. They are known for their user-centered approach and their ability to deliver high-quality digital services that meet the needs of their clients and their users. Ad Hoc was founded in 2014 and is headquartered in Washington, DC.

Learn more about Ad Hoc

Size

200 employees

Industry

Enterprise Technology

Founded

2014

* Ladders Estimates

Similar Jobs

SME Systems Engineer
$120K — $150K *
VTG
Chantilly, VA 20152 (Loudoun County)
Today
Technical/Functional Expert (Server)
$49K — $290K *
Gem.com
Annapolis Junction, MD 20701 (Howard County)
Reposted Today
Eng Sr Prin II - Sys
$120K — $150K *
BAE Systems
Herndon, VA 20171 (Fairfax County)
Reposted Today
Eng Sr Prin - Sys
$120K — $150K *
BAE Systems
Totowa, NJ 07512 (Passaic County)
Reposted Today
Senior Software Systems Engineer
$120K — $150K *
International SOS Ltd
Falls Church, VA 22042 (Fairfax County)
Today
Scientist, Systems Engineer
$133K — $247K *
Level 3 Communications, Inc
Rochester, NY 14609 (Monroe County)
Today

Get Ready For Your
Next Interview

More Jobs at Ad Hoc

Release Train Engineer
$120K — $135K *
Mclean, VA 22101 (Fairfax County)
Today
Education, Government & Non-Profit
In-Person
PKI / IAM Security Engineer
$130K — $135K *
Mclean, VA 22101 (Fairfax County)
Today
Information Technology
In-Person
Platform Engineer
$125K — $135K *
Mclean, VA 22101 (Fairfax County)
Today
Enterprise Technology
In-Person
Senior Site Reliability Engineer
$135K — $150K *
Mclean, VA 22101 (Fairfax County)
Today
Information Technology
In-Person
Senior Systems Engineer
$130K — $140K *
Mclean, VA 22101 (Fairfax County)
Today
Information Technology
In-Person

More Information Technology Jobs

SDET (Software Development Engineer In Test)
Confidential Company
Washington, DC 20001 (District Of Columbia County)
2 weeks ago
Cyber Threat Hunt Manager
$120K — $150K *
DTCC
Tampa, FL 33647 (Hillsborough County)
Today
Sr. Applied Science , AWS Agentic AI
$192K — $260K *
Amazon
Santa Clara, CA 95051 (Santa Clara County)
Reposted Today
Sr Manager, Regulatory Compliance
$102K — $209K *
Oracle Corporation
Houston, TX 77084 (Harris County)
Today
Enterprise Data Engineer
$100K — $130K *
VTG
Chantilly, VA 20152 (Loudoun County)
Today

Find similar Senior Site Reliability Engineer jobs:

Nationwide Mclean, VA

Senior Site Reliability Engineer

Job Overview by Ladders

Full Job Description

Get Ready For Your Next Interview

Find similar Senior Site Reliability Engineer jobs:

Get Ready For Your
Next Interview