DevOps / SRE Manager

Cynet Systems   •  

El Segundo, CA

Industry: Technology

  •  

5 - 7 years

Posted 26 days ago

  by    Cynet Sytems

We are looking for DevOps / SRE Manager for our client in El Segundo, CA

Job Title: DevOps / SRE Manager

Job Location: El Segundo, CA

Job Type: Contract 12 Months

Job Description:

  • SRE work: Monitoring, Logging, Incident management, Communication, RCA, etc.
  • DevOps: CICD, Deployments
  • Tech: Windows, Some Unix, AWS, TFS, Git, Jenkins
  • Manage/Lead team of 4-5 Engineers and potentially may grow, Interact with Sr. leadership from Product and Engineering

Responsibilities:

  • Manage a team of SREs and lead by example - contributor more than a delegator
  • Employ deep troubleshooting skills to improve the availability, performance, and security of Services.
  • Collaborate with Product and Support teams to plan and deploy product releases readiness
  • Work with Cloud Platform and Operations leaders to develop narratives, backlog grooming, epic planning and overall sprint planning processes
  • Work with Engineering leadership to build shared services that meet the requirements and need of the platform and application teams
  • Ensure services are designed with 24/7 availability and operational readiness and rigor
  • Implementation of proactive monitoring, alerting, trend analysis and self-healing systems
  • Define non-functional requirements as part of the product life cycle to influence the new designs, standards, and methods for scalable, highly available distributed systems
  • Contribute to product development / engineering as needed to ensure Quality of Service of Highly Available services
  • Identifies, evaluates and executes preventive measures to minimize/avoid impact to the Customers experience. Proactive v/s Customer escalated Resolution of product/service defects or design changes, infrastructure changes, or operational changes

Requirements:

  • 5+ years of Systems/Applications automation in 24x7 Production Services environments
  • BS in Computer Science, Computer Engineering, Math, or equivalent professional experience
  • Fluency with one or more current generation scriptinglanguage used by DevOps professionals (Python, Perl, PHP, Ruby) + Java Development and/or .NET
  • Excellent troubleshooter, utilizing a systematic problem-solving approach
  • Demonstrated experience in designing, analyzing, and diagnosing large-scale distributed systems + Windows Server systems internals (system libraries, file systems, client-server protocols)
  • Experience operating on AWS (both PaaS and IaaS offerings)
  • Experience in both Windows (2k8R2+) and Security triage & forensic analysis
  • Experience with Continuous Integration and Continuous Delivery concepts, including Infrastructure as code utilizing tools like Terraform, Cloudformation and Chef/SaltStack
  • Expert in Containerization concepts like Docker, and PaaS services on AWS.
  • Experience with elastically scalable, fault tolerance and other cloud architecture patterns
  • Demonstrated strength in SaaS services, experience in massive scale web operations