In this role, the selected candidate will provide leadership in product reliability with New Product Development, Technology Development, while providing support to Product Engineering, Process Engineering and Supply Chain Management for existing products, processes and parts.
In this role, the selected candidate will communicate effectively with fellow SREs and other engineering teams, and describe problems succinctly with sufficient detail that you can hand-off an ongoing problem to another team or a peer for completion.
In this role, you will enable and lead the critical incident and problem management process where you will engage appropriate colleagues, vendors, and leadership teams to restore service, manage root cause analysis and recommend solutions for long term fix.
The selected candidate will be responsible for
creating internal monitoring and alerting platforms for our product engineering teams.Developing tools and strategies for incident management and response.
In this role, youll work collaboratively with software engineering to deploy and operate our systems. Help automate all the things and streamline our operations and processes. Build and maintain tools for deployment, monitoring and operations. And troubleshoot and resolve issues in our dev, test and production environments.
In this role, the selected candidate will support the Regional Reliability and Maintenance team by establishing a culture of safety in working and in the execution of all aspects of the teams role requirements.