Illumio

Sr. Manager, Site Reliability Engineering

Illumio$150K — $180K *
Information Technology
5 - 7 years of experience
Job Overview by Ladders

Qualifications

  • Bachelor's or Master's in Computer Engineering or related field, or equivalent experience.
  • 5+ years in Unix/Linux system administration with tools like Chef/Ansible, Ruby, and/or Python.
  • 7+ years developing CI/CD solutions with Concourse, Gitlab, or similar tools.
  • Proven track record of achieving uptime of at least 99.9% and meeting SLAs.
  • Experience in managing and running multi-tenant cloud infrastructure.

Responsibilities

  • Lead the SRE team in delivering SaaS security products to major clients, including Fortune 100 companies.
  • Collaborate with Development, QA, and Customer Support to maintain product health and customer satisfaction.
  • Manage global infrastructure to ensure operational efficiency through automation on cloud platforms.
  • Own the application lifecycle for SaaS delivery, focusing on availability and performance metrics.
  • Enhance CI/CD automation using Infrastructure as Code to minimize downtime effectively.
  • Develop and execute a long-term reliability and technology road map aligned with business goals.
  • Create a high-performing Cloud Operations and SRE team through mentorship and training initiatives.

Benefits

  • Access to cutting-edge technology and tools within the SaaS security space.
  • Opportunities for career advancement and professional development.
  • Participation in a supportive and collaborative team environment.
  • The potential for impactful work, ensuring security of major corporate infrastructures.
Full Job Description
Location: 5 on-site days a week in Sunnyvale, CA Headquarters.

Your Impact:

In this role, you will lead a team of talented engineers to help build a world-class SaaS security platform so we can continue to provide quality security solutions for our customers.

Every day you will lead a small team to ensure our SaaS security platform is available and performing, finding problems before our customers do, building tools to improve speed, confidence, and visibility, while embedding security into every step of the software and infrastructure life cycle.

To thrive in this role you must have at least 5 years of people leadership experience; be fluent in AWS/Azure cloud platforms and have programming language experience while hands-on building infrastructure tooling and automation at least 50% of the time.
  • Manage Illumio's SRE team to deliver SaaS security products to companies including the Fortune 100.
  • Work closely with Development, QA, Customer success, and Technical Support to ensure the health of our products and that all SLAs are being met for our customers.
  • Manage infrastructure to scale globally, utilizing automation tools to maximize operational efficiency on public clouds.
  • Lead the team responsible for supporting the infrastructure that powers the Illumio SaaS products.
  • Own and improve the SaaS delivery efficiency with end-to-end responsibility for application lifecycle, availability, performance, and SLAs.
  • Work with the team to improve CI/CD automation for deploying applications using Infrastructure as Code to minimize downtime and ensuring adherence to any contractual commitments.
  • Work with senior management in developing a long-term product reliability and technology road map, using strategies to align with business objectives and large scale.
  • Develop and maintain automation which can be consumed by multiple teams to deploy SaaS clusters.
  • Create a high-performing Cloud Operations and SRE team through career development, mentorship, and training
  • Ensure adherence of infrastructure and processes to FedRAMP, SOC2 and other requirements, and work with PM to adjust the infrastructure to meet any federal changes.
  • Continue to evolve product architecture and DevOps processes to ensure reliable CI/CD pipelines and continuous delivery.
Your Toolkit:
  • Bachelor's or Master's degree in Computer Engineering, Computer Science, or related field, or equivalent relevant experience
  • 5+ years of Unix or Linux system administration experience with Chef/Ansible, Ruby and/or Python
  • 7+ years of hands-on technical experience managing/developing CI/CD solution using Concourse, Gitlab, or equivalent
  • Proven track record of improving uptime (at least 99.9%) and SLAs.
  • Experience establishing support SLOs.
  • Working experience with Cloud Technology such as application gateway, HAProxy, Nginx, microservices, databases(Postgresql), Redis
  • Experience building observability for 24/7 monitoring with Grafana, Prometheus, Splunk, Datadog, or equivalent and ability to improve uptime and meet SLAs.
  • Experience building/running revenue generating enterprise applications in at least one of the three big public cloud providers: AWS(Prefer), Azure, or GCP
  • Experience managing and running multi-tenant cloud infrastructure.
  • Extensive experience developing and using Terraform in production
  • Strong troubleshooting experience and skillset to resolve incidents working across functional teams.
  • Ability to nurture and support a strong operations culture: customer focus, excellent technology, high quality implementations, self-motivated innovation, and problem-solving.
  • A positive can-do attitude and passion to succeed.
  • Experience with managing certifications and audits such as SOC2, FedRAMP is a big plus.
  • Experience with Vault, CloudFormation, and EKS a big plus.

This position involves access to software/technology that is subject to U.S. export controls. Any job offer made will be contingent upon the applicant's capacity to serve in compliance with U.S. export controls

#LI-KD1 #LI-ONSITE

About Illumio

Illumio is a cybersecurity company that provides software for securing enterprise computing environments. The company was founded in 2013 by Andrew Rubin and PJ Kirner and is headquartered in Sunnyvale, California. Illumio's software uses micro-segmentation to protect against cyber threats by creating security policies that restrict access to sensitive data and applications. The company's software is used by a wide range of clients, including financial institutions, healthcare providers, and government agencies. Illumio has received numerous awards for its innovative approach to cybersecurity, including being named a Gartner Cool Vendor in 2016.
Learn more about Illumio
Size
500 employees
Industry
Founded
2013

Similar Jobs

More Jobs at Illumio

More Information Technology Jobs

Find similar Sr. Manager, Site Reliability Engineering jobs: