Site Reliability Engineer Lead/Manager

Okta   •  

San Francisco, CA

Not Specified years

Posted 242 days ago

This job is no longer available.

Okta is seeking a Site Reliability Manager or Team Lead to build and lead a Site Reliability Engineering team in London. The team will share responsibility with our North American based teams for both operating our existing service and developing and automating new components. 
The ideal candidate:

• Has experience in off-premise cloud-based infrastructure such as Amazon Web Services
• Has operated complex custom applications on UNIX/Linux and/or Enterprise Java platforms
• Is passionate about automation and leveraging agile software development methodologies to deliver automation
• Has experience leading Technical Operation teams whilst still being hands-on and able to take on IC level tasks and deliverables as needed
Job Duties and Responsibilities:

• Operations Lead:
o Supervise day-to-day monitoring, maintenance (incl. application releases), and troubleshooting of our application
o Continuously refine monitoring processes, thresholds, and configuration
o Create and maintain documentation and runbooks
• Manage delivery of new infrastructure components:
o Resource planning
o Design and code reviews
o Participate in Scrum processes and ceremonies
• Respond to issues and escalations
• Participate in on-call rotation
• Work closely with product developers to ensure new features have the proper operational support and maintainability
• Partner with Recruiting and Executive Leadership to build a top-notch Engineering hub in London

Minimum REQUIRED Knowledge, Skills, and Abilities:
• Demonstrate a track record of leading or Managing at team
• Experience as a Linux Systems Administration 
• Proficient in at least one scripting language (bash, Perl, Ruby, Python)
• Experience operating and troubleshooting a complex, multi-tier service running across multiple data centers
• Prior experience in software development, DevOps role, or SRE role

Nice to have:
• Working knowledge of distributed version control systems such as Git
• Familiarity with continuous integration and deployment tools such as Jenkins, Maven, Artifactory, and Ansible
• Experience using Agile practices
• Experience with modern open source infrastructure services and concepts such as Redis, ElasticSearch, and Docker

Okta is an Equal Opportunity Employer