Site Reliability Engineer

Amerco   •  

Phoenix, AZ

Industry: Transportation


Less than 5 years

Posted 156 days ago

This job is no longer available.


U-HAUL is seeking for a candidate with a strong focus on delivering high levels of infrastructure services to internal development teams. Directly aligning with dev/devops teams, the candidate’s strong background in infrastructure operations will help to design, implement, and maintain software build engineering processes, product testing and staging environments, on a global Linux/Windows based production infrastructure. Position is located in the Phoenix, Arizona.

Experience with the end to end lifecycle of monitoring, configuration, and automation tools such as Satellite, Ansible, Puppet, Microsoft System Center Operations Manager (SCOM) and the Systems Center Configuration Manager (SCCM) in a medium to large computing environment is a requirement. It is more important to be a master of the methodologies, protocols, and architecture (with the ability to implement) of deployment, monitoring, and alerting than the specific toolset. This position requires senior level System Administration skills to ensure infrastructure is deployed, maintained and configured quickly and efficiently while still adhering to our core infrastructure standards. As a dedicated agile team member, the incumbent will provide input and guidance to the development process to ensure monitoring and operational considerations are built-in to every feature to achieve a secure and stable platform.

The ideal candidate will have the opportunity to help build a highly valued team within U-HAUL IT and contribute to our ability to streamline our release management process, support multiple pre-prod and R&D environments and production systems across our organization. The Site Reliability Engineer (SRE) will be part of a team of SRE’s, driving operational excellence and focusing on delivering and managing our agile environments. He/She will leverage knowledge, experience and expertise across multiple technology stacks to automate, deploy and validate dynamic release management environment with the goal of continuous improvement and optimization. He/She will possess the ability to influence change within their department and others and identify, propose, and implement best practices. The candidate will oversee agile deployments, configurations and ongoing maintenance of our pre-production and production environments, software, and equipment in our datacenters. To improve efficiencies, scripting, automation, and real world experience expertise to solve complex problems is equally important.

Primary Responsibilities:

* Dedicated member of an agile software/devops team

* Deployment, support and maintenance of development software stacks, overseeing build frameworks

* Manage and maintain enterprise infrastructure tools as the primary subject matter expert

* Deploy and manage virtualization infrastructure

* Respond to system issues related to the infrastructure and fulfill service requests

* Lead infrastructure deployments in the scrum

* Assist in facilitating datacenter activities such as system upgrades and hardware provisioning.

* Provide support, and implementation of security policies, compliance, and best practices

* Prioritize workload and resolve any technical issues/roadblocks

* Solid skills in logical troubleshooting, communication, documentation and problem resolution


  • Experience with VMWare virtualization technologies including vSphere, vCenter management suite.
  • Experience in Cloud Technologies – Private, Public, Hybrid, IaaS+, PaaS, SaaS
  • Experience with Ansible and Ansible Tower
  • Familiar with SRM, vROps, VCAC and vCloud technologies.
  • 2+ years in an agile Operations/DevOps environment
  • Bachelor’s degree in Information Technology (will consider technical training/job experience equivalents)
  • 2+ years’ RedHat Linux Systems Administration
  • Scripting languages for configuration and automation such as Python and Shell
  • 2+ years’ experience with VMWare ESX
  • Knowledge of protocols: HTTP, SSL, SSH, JMS, JDBC, REST API, etc.
  • Knowledge of SAN best practices for VMware, Windows and Linux operating system.
  • Proficiency in operating system, software, and hardware installation / configuration
  • Basic understanding of Networks (VLAN, sub netting, routing and switching)
  • Experience in automation of key functions, including back-up, continuous integration, provisioning is a huge plus
  • Continuous integration tools – experience with Perforce-Jenkins-Nolio and VMWare tools is a plus
  • Willing to work under different technologies and take up new technology responsibilities outside the core skills
  • Fluent English and high oral and written communication skills
  • Ability to interact with various levels of professionals
  • Ability to work under pressure in a fast paced environment and meet tight deadlines
  • Ability to act independently to drive IT goals and changes
  • Advanced troubleshooting methodology
  • Be able to judge priorities and adjust their work accordingly
  • Identify and escalate situations requiring urgent attention
  • Good understanding of networking and storage technologies related to databases
  • General server and network hardware components including rack mounted servers, blade systems, storage, and networking

Experience with these technologies a plus:

  • Being able to work cross platform, with Windows and Linux. This helps understand hybrid platform environment and thus help design considerations. Certification is preferable (RHCE or likewise)
  • Familiarity with monitoring and analysis solutions such as Extrahop, Solarwinds, NetMon,
  • Familiarity with .Net application development
  • Cisco UCS experience
  • Cisco networking
  • VMWare
  • Nutanix
  • vSAN
  • Work experience in eCommerce a plus
  • Configuration management and automation using tools such as Puppet, Chef, Salt, Ansible