Location: 5 on-site days a week in Sunnyvale, CA Headquarters.Your Impact:We are looking for an experienced Senior Site Reliability Engineer (SRE) with a strong background in AWS & Azure cloud platforms to play a key role in ensuring the reliability, scalability, and performance of our cloud-based systems and applications.
The ideal candidate will have hands-on experience in supporting, and managing AWS and Azure infrastructure, along with a passion for automation, continuous improvement, and collaboration with cross-functional teams.
If you are passionate about AWS and/or Azure cloud platform and have a track record of driving reliability, scalability, and performance in cloud-based environments, we'd love to hear from you. Apply now to be a part of our talented team!
- Monitor system performance, application health, and infrastructure metrics using monitoring and logging services, and implement proactive measures to optimize performance and availability
- Oncall duty for production uptime and support for customer escalations
- Release upgrades and maintenance activities including hotfixes and infrastructure updates
- Lead incident response and resolution efforts, conducting root cause analysis, implementing corrective actions, and documenting post-incident reviews
- Implement security best practices and controls in the cloud environments to protect data, applications, and infrastructure, and ensure compliance with regulatory requirements
- Drive continuous improvement initiatives to enhance reliability, scalability, and efficiency of infrastructure and services, leveraging automation and emerging technologies
Your Toolkit:- Bachelor's degree in computer science, Engineering, or related field; or equivalent work experience
- 5+ years of experience working as a Site Reliability Engineer (SRE) or similar role, with a focus on AWS and/or Azure cloud platform
- Hands-on experience in designing, deploying, and managing AWS and/or Azure infrastructure, including compute, storage, networking, and security services
- Proficiency in scripting and programming languages such as PowerShell, Python, or Go for automation and infrastructure management tasks
- Strong understanding of CI/CD principles and experience with tools such as Azure DevOps, Jenkins, or GitLab CI/CD
- Experience with containerization technologies (e.g., Docker, Kubernetes) and microservices architecture in AWS and Azure environments is a plus
- Excellent analytical, problem-solving, and communication skills, with the ability to collaborate effectively with cross-functional teams
- AWS or Azure certifications such as AWS/Azure Solutions Architect, Azure DevOps Engineer, or Azure Security Engineer are preferred
#LI-KD1 #LI-ONSITE