The Site Reliability Engineer will architect, develop, and maintain Optum Serve's cloud environment in both the commercial and government AWS cloud. The role will work closely with software engineers, architects, and DevOps engineers to architect and maintain a secure, resilient and high performance cloud infrastructure.
To support this mission, OSIT has initiated a multi year modernization program aimed at updating and enhancing enterprise technology systems in accordance with modern design standards.
You'll enjoy the flexibility to work remotely * from anywhere within the U.S. as you take on some tough challenges. For all hires in the Minneapolis or Washington, D.C. area, you will be required to work in the office a minimum of four days per week.
Primary Responsibilities: - Build, maintain, and operate IaaS and PaaS infrastructure in AWS commercial and government clouds
- Work closely with dev teams to identify and measure SLOs, SLAs and SLIs
- Act a solid contributor to development of platform services including architecture, provisioning, configuration, deployment, and support
- Perform integrations with central logging, metrics dashboards, instrumentation, incident monitoring and management
- Build/integrate/administer systems and tools that enable engineering teams to observe their applications in production with autonomy (Dashboards, APMs)
- Support software and/or cloud-infrastructure in an on-call rotation basis
- Assist with identification and remediation of technical problems at the root cause by continuously implementing automation, self-healing, and real-time monitoring to production systems
- Maintain and improve operational tooling, frameworks
- Build frameworks that test the performance and resiliency of our platform services/tools
- Automate alerts for metrics on performance, cost, vulnerabilities, risk, compliance violations
- Improve processes and champion automation of any manual items around support.
You'll be rewarded and recognized for your performance in an environment that will challenge you and give you clear direction on what it takes to succeed in your role as well as provide development for other roles you may be interested in.
Required Qualifications: - 3+ years of experience working within a cloud engineer/SRE role
- Proven solid knowledge of AWS services (ex. VPC, EC2, S3, ECS, Cloudformation, Lambda, EKS, RDS, ELB, Route53, RedShift)
- Proven expert knowledge and hands on production experience in Kubernetes (EKS or AKS) cluster setup and management required.
- Experience with infrastructure as code (IaC) tools like Terraform
- Experience with Kubernetes deployment tools like Helm, ArgoCD, Flux
- Demonstrated solid awareness of networking and internet protocols
- Proven understanding of identity and access management (IAM)
- Experience supporting infrastructure in production cloud environments
- Proven knowledge of Encryption (KMS), Public Key Infrastructure (PKI), understanding of OWASP
- Experience working with RESTful services
- Experience supporting environments adhering compliance standards like FedRAMP and NIST (800-171|53)
- Experience with monitoring tools (CloudWatch, VPC Flow Logs, Splunk, Dynatrace, Graphana, Prometheus)
- Demonstrated familiarity with IDEs and Source Control tools like Azure DevOps, Github or Gitlab.
- Ability to participate in 24/7 on-call rotation
- United States Citizenship
- If you are offered this position, you will be required to provide extensive personal information to obtain and maintain a suitability or determination of eligibility for a Confidential/Secret or Top Secret security clearance as a condition of your employment
Preferred Qualifications: - Bachelor's Degree in Computer Science, Information Technology, Software Engineering, Math, Physics
- Master's Degree with coursework focused on advanced algorithms, mathematics in computing, data structures or related field
- Expert knowledge of deploying Production grade applications in AWS
- Demonstrated passion about infrastructure automation
*All employees working remotely will be required to adhere to UnitedHealth Group's Telecommuter Policy
Pay is based on several factors including but not limited to local labor markets, education, work experience, certifications, etc. In addition to your salary, we offer benefits such as, a comprehensive benefits package, incentive and recognition programs, equity stock purchase and 401k contribution (all benefits are subject to eligibility requirements). No matter where or when you begin a career with us, you'll find a far-reaching choice of benefits and incentives. The salary for this role will range from $72,800 to $130,000 annually based on full-time employment. We comply with all minimum wage laws as applicable.