Site Reliability Engineer

5 - 7 years experience  • 

Salary depends on experience
Posted on 03/21/18
Bellevue, WA
5 - 7 years experience
Salary depends on experience
Posted on 03/21/18

At Smartsheet, we are building the next generation workspace collaboration platform. Our InfrastructureEngineering team is committed to operational excellence and delivering a world class customer experience. We're in an exciting high growth stage and now is the best time to join our team. Learn more about us with this short video overview of Smartsheet: Smartsheet Overview Video.

We are currently looking for a Site Reliability Engineer to join our Site ReliabilityEngineering team. In this position, you will directly impact the reliability and performance of our critical production application systems; supporting 24/7 delivery to over 70,000 customers worldwide. We’re looking for a motivated individual to manage deployment of configuration management solutions, resolve production escalations, and iterate on improving both production and pre-production environments. This position will report directly to the Site Reliability Manager and is located at our Bellevue, WA headquarters.

Responsibilities:

  • Participate in a follow-the-sun rotation providing 24x7 production support
  • Troubleshoot, investigate, and fix production issues in cloud and hosted environments, including both hardware and internal software issues
  • Respond to automated system alerts, effectively troubleshoot system errors and work incidents to return systems to normal operating conditions
  • Manage customer support and development escalations; working directly with Sustaining Engineering
  • Track issues through the ticketing systems and follow through to resolution
  • Ensure production changes are documented, fully tested in non-production environments, and adhere to change control and audit requirements
  • Participate and support multiple teams in incident management, PIR, deployment and change processes
  • Investigate security and compliance concerns, in accordance with company policies

Qualifications:

  • 4+ year of work experience with production Linux systems administration
  • 2+ year of experience with at least one scriptinglanguage (e.g., Bash, Python, Ruby, Go )
  • Highly motivated, critical thinker with proven ability to troubleshoot and solve problems in a production support environment
  • Ability to successfully manage competing priorities in critical incident situations
  • Proficient with basic internet protocols (e.g., HTTP, DNS, TCP/IP)
  • Proficient with config management, source control and containerization tools
  • Working knowledge of agile, scrum and ITIL service management methodologies
  • Strong desire to learn and understand new technologies
  • Excellent verbal and written communication skills
  • Ability to work in the U.S. on an ongoing basis
  • Bachelor’s degree in Computer Science or related discipline required
Not the right job?
Join Ladders to find it.
With a free Ladders account, you can find the best jobs for you and be found by over 20,0000 recruiters.