Illumio

Sr. Site Reliability Engineer

Illumio$130K — $160K *
Information Technology
5 - 7 years of experience
Job Overview by Ladders

Qualifications

  • Bachelor's degree in computer science, Engineering, or related field, or equivalent experience
  • 5+ years of experience as a Site Reliability Engineer (SRE) or similar role focusing on AWS and/or Azure
  • Hands-on experience in designing and managing AWS and/or Azure infrastructure
  • Proficiency in scripting languages like PowerShell, Python, or Go
  • Strong understanding of CI/CD principles and experience with Azure DevOps, Jenkins, or GitLab CI/CD
  • Experience with containerization technologies (e.g., Docker, Kubernetes) is a plus
  • AWS or Azure certifications preferred

Responsibilities

  • Monitor system performance and application health, implementing proactive optimizations
  • Handle on-call duties for production uptime and support escalations
  • Manage upgrades and maintenance including hotfixes
  • Lead incident response efforts and conduct root cause analysis
  • Implement security best practices in cloud environments
  • Drive continuous improvement initiatives leveraging automation and new technologies

Benefits

  • Collaborative work environment
  • Opportunity to drive innovation in cloud infrastructure
  • Focus on continuous improvement and learning
  • Exposure to cutting-edge technologies and tools
  • Support for professional development and certification opportunities
Full Job Description
Location: 5 on-site days a week in Sunnyvale, CA Headquarters.

Your Impact:

We are looking for an experienced Senior Site Reliability Engineer (SRE) with a strong background in AWS & Azure cloud platforms to play a key role in ensuring the reliability, scalability, and performance of our cloud-based systems and applications.

The ideal candidate will have hands-on experience in supporting, and managing AWS and Azure infrastructure, along with a passion for automation, continuous improvement, and collaboration with cross-functional teams.

If you are passionate about AWS and/or Azure cloud platform and have a track record of driving reliability, scalability, and performance in cloud-based environments, we'd love to hear from you. Apply now to be a part of our talented team!
  • Monitor system performance, application health, and infrastructure metrics using monitoring and logging services, and implement proactive measures to optimize performance and availability
  • Oncall duty for production uptime and support for customer escalations
  • Release upgrades and maintenance activities including hotfixes and infrastructure updates
  • Lead incident response and resolution efforts, conducting root cause analysis, implementing corrective actions, and documenting post-incident reviews
  • Implement security best practices and controls in the cloud environments to protect data, applications, and infrastructure, and ensure compliance with regulatory requirements
  • Drive continuous improvement initiatives to enhance reliability, scalability, and efficiency of infrastructure and services, leveraging automation and emerging technologies
Your Toolkit:
  • Bachelor's degree in computer science, Engineering, or related field; or equivalent work experience
  • 5+ years of experience working as a Site Reliability Engineer (SRE) or similar role, with a focus on AWS and/or Azure cloud platform
  • Hands-on experience in designing, deploying, and managing AWS and/or Azure infrastructure, including compute, storage, networking, and security services
  • Proficiency in scripting and programming languages such as PowerShell, Python, or Go for automation and infrastructure management tasks
  • Strong understanding of CI/CD principles and experience with tools such as Azure DevOps, Jenkins, or GitLab CI/CD
  • Experience with containerization technologies (e.g., Docker, Kubernetes) and microservices architecture in AWS and Azure environments is a plus
  • Excellent analytical, problem-solving, and communication skills, with the ability to collaborate effectively with cross-functional teams
  • AWS or Azure certifications such as AWS/Azure Solutions Architect, Azure DevOps Engineer, or Azure Security Engineer are preferred

#LI-KD1 #LI-ONSITE

About Illumio

Illumio is a cybersecurity company that provides software for securing enterprise computing environments. The company was founded in 2013 by Andrew Rubin and PJ Kirner and is headquartered in Sunnyvale, California. Illumio's software uses micro-segmentation to protect against cyber threats by creating security policies that restrict access to sensitive data and applications. The company's software is used by a wide range of clients, including financial institutions, healthcare providers, and government agencies. Illumio has received numerous awards for its innovative approach to cybersecurity, including being named a Gartner Cool Vendor in 2016.
Learn more about Illumio
Size
500 employees
Industry
Founded
2013

Similar Jobs

More Jobs at Illumio

More Information Technology Jobs

Find similar Sr. Site Reliability Engineer jobs: