Site Reliability Engineer (SRE)

Quindar

$100K — $130K *
Education, Government & Non-Profit
Less than 5 years of experience
Job Overview by Ladders

Qualifications

  • Bachelor's degree in Computer Science or related field
  • 3+ years of professional experience in SRE, DevOps, or related roles
  • Active U.S. Security Clearance (TS/SCI) required
  • Experience working towards ATO in federal, DoD, or IC environments preferred
  • Experience supporting GovCloud, C2S/C2E environments highly desirable

Responsibilities

  • Design, automate, deploy, and operate reliable cloud systems for U.S. Government customers
  • Continuously improve reliability and operability of Quindar's platform in production
  • Build and evolve automated deployment pipelines and hardened runtime environments
  • Support deployments to air-gapped networks for consistency and reliability
  • Define and implement best practices for availability, latency, and incident responses
  • Collaborate with cross-functional teams to meet performance and reliability requirements
  • Participate in incident response and a 24/7 on-call rotation

Benefits

  • Opportunities for professional growth and development
  • Participation in innovative projects supporting mission-critical workloads
  • Collaboration with talented teams in cutting-edge technology environments
  • Work on highly impactful projects within the U.S. Government sector
  • Engagement in a culture that promotes automation and efficiency
Full Job Description
What You'll Be Doing

Design, automate, deploy, and operate highly reliable cloud systems supporting mission-critical workloads for U.S. Government customers. This role is centered on DevSecOps and site reliability engineering, with a strong emphasis on deployment automation, operational stability, and system resilience across AWS GovCloud and AWS C2E environments.

You will be responsible for continuously improving the reliability and operability of Quindar's platform in production, ensuring systems are observable, fault-tolerant, and require minimal manual intervention. Your work will directly impact mission success by improving system uptime, deployment velocity, and operational confidence in constrained and classified environments.

A key focus of this role is building and evolving automated deployment pipelines, hardened runtime environments, and repeatable infrastructure patterns that support secure and scalable operations in regulated environments.

You will also support and improve Quindar deployments to air-gapped networks, driving consistency, reliability, and performance across all environments. As the organization grows, you will help define and implement best practices for availability, latency, incident response, and service-level objectives (SLOs).

This role includes participation in incident response and a 24/7 on-call rotation, with a strong mandate to eliminate toil through automation and continuously improve system reliability.

You will collaborate closely with frontend, backend, and platform engineers to ensure systems meet performance, reliability, and mission assurance requirements.

Technical Skills
  • Strong experience with Kubernetes and containerized workloads in production environments
  • Hands-on experience operating clusters in AWS EKS, Rancher, or similar platforms
  • Experience supporting GovCloud, IL-enclave, or C2E environments
  • Deep experience with CI/CD systems and deployment automation (GitLab preferred)
  • Proficiency in Python and Infrastructure-as-Code tools (Terraform or similar)
  • Experience with observability platforms (Grafana LGTM stack, Datadog, or equivalent)
  • Strong understanding of distributed systems, APIs, databases, caching, and event-driven architectures
  • Solid networking fundamentals (VPCs, VPNs, load balancers, TLS, service connectivity)
  • Experience with Linux/Unix systems
  • Familiarity with cloud security best practices, enclave boundaries, and secure system design
  • Experience with identity and access management (AWS IAM, Auth0, Keycloak, ICAM patterns)
  • Strong Git fundamentals and experience supporting deployments across multiple classification levels
Qualifications
  • Bachelor's degree in Computer Science or related field
  • 3+ years of professional experience as an SRE, DevOps, reliability, infrastructure, or platform engineer
  • Active U.S. Security Clearance (TS/SCI); U.S. Citizenship required
  • Experience working toward ATO/authorization in federal, DoD, or IC environments preferred
  • Experience supporting deployments in GovCloud, C2S/C2E, or IL-enclave environments highly desirable
ITAR REQUIREMENTS
  • To conform to U.S. Government export regulations, applicant must be a (i) U.S. citizen or national, (ii) U.S. lawful, permanent resident (aka green card holder), (iii) Refugee under 8 U.S.C. a7 1157, or (iv) Asylee under 8 U.S.C. a7 1158, or be eligible to obtain the required authorizations from the U.S. Department of State. Learn more about the ITAR here.

Similar Jobs

More Jobs at Quindar

More Education, Government & Non-Profit Jobs

Find similar Site Reliability Engineer (SRE) jobs: