Description & Requirements
Maximus is seeking a Lead Site Reliability Engineer (SRE). The Lead Site Reliability Engineer (SRE) ensures operational stability, performance, and reliability of the A1 MCE platform through automation, monitoring, and continuous improvement of platform services.
This role is Hybrid at our San Antonio office and requires a Secret security clearance.
Maximus TCS (Technology and Consulting Services) Internal Job Profile Code: TCS166, T4, Band 7
Job-Specific Essential Duties and Responsibilities:
- Maintain uptime, scalability, and resilience of platform services.
- Develop and implement automation (IaC, pipelines).
- Manage monitoring, logging, and alerting frameworks.
- Support VDI, CI/CD pipelines, and platform infrastructure.
- Drive incident management, problem management, and root cause analysis.
Job-Specific Minimum Requirements:
- Active Secret clearance
- Ability to commute as needed to the work location in San Antonio, Texas.
- 7+ years in systems admin/DevOps/SRE.
- Cloud/DevOps certification.
- Experience with logging tools (Splunk, Elastic).
Preferred Skills and Qualifications:
- Experience in high-availability IL5/IL6 cloud environments.
- Advanced scripting (Python, PowerShell, Bash).
- Expertise in observability (metrics, tracing, AIOps).
- Experience implementing SLO/SLI frameworks.
- Familiarity with platform engineering and internal developer platforms (IDP).
#techjobs #clearance #veteransPage
Minimum Requirements
TCS166, T4, Band 7
Minimum Salary
$
135,000.00
Maximum Salary
$
175,000.00