The Site Reliability Engineer oversees our exciting E-Commerce branded platforms, where customers can purchase our products online and receive them physically or digitally as part of our Operations team. The SEIII is the leading subject matter expert on all the platforms owned by our team, resident onsite and in the Cloud. The focus of this job is to leverage tools to allow us to automate much of the operations management and ensure the highest reliability and performance for our sites. The SEIII will use various analytics tools to measure our performance. This position is central to our platforms and he/she will act as coach for the team to ensure the systems are operating optimally and protecting our brands. This is an exciting and rapidly expanding part of our business, and the right candidate will help us continue to grow!
Responsibilities
- Implement and improve CI/CD tooling to drive total automation to release in Production
- Site reliability engineer to craft tools to automate our operations and maintain high availability
- Lead and guide the team as an expert with Azure Cloud computing
- Drive efforts for monitoring, performance, capacity planning and disaster recovery
- Liaise closely with the development team and system analysts to ensure two-way communication
- Understand and participate in an Agile or Lean development life cycle
- Work with Dev and release management to ensure an efficient delivery pipeline
- Improve the reliability, quality and performance of our e-commerce sites and measure our effectiveness using tools like Glassbox and Google Analytics. Make recommendations for improvement where appropriate.
- Serve as the SME (subject matter expert) for our supported applications
- Assist the system analyst team members to research and solve complex incidents escalated to the team from our tier one and two partners, using Splunk, Opsview & Dynatrace
- Able to work under tight deadlines while providing high-quality work
- Assist in analyzing system metrics and make suggestions for monitoring & alerts
Qualifications
- Experience deploying and managing infrastructure in the Cloud, with Azure experience preferred
- Configuration Management experience with tools like Ansible, Chef, Puppet or similar
- Strong Linux system administration background
- Experience implementing and maintain CI/CD tools such as Jenkins or Bamboo
- Strong understanding and experience with SQL and MySQL databases
- Experience deploying and maintaining Redis or Varnish caching solutions
- Experience participating in the full lifecycle of projects, including effective use of version control, build management, unit testing, and issue tracking software (SVN, Maven, JIRA, Jenkins)
- Experience with Git, GitHub and GitHub Admin
- Understanding and experience with software development best practices
You're a Super Star If You Have The Following:
- Experience with Nginx, PHP
- Experience with Magento
- Experience with Java : Spring Boot, Sprint Batch, JPA Hibernate