Site Reliability Engineer II in Seattle, WA

$150K - $200K(Ladders Estimates)

Conversica   •  

Seattle, WA 98160

Industry: Enterprise Technology

  •  

8 - 10 years

Posted 48 days ago

SREs utilize software and systems engineering to implement better production systems with best practices and tools like white and black box monitoring, system resiliency, load balancing and high availability, failure mode and effects analysis (FMEA), incident management, risk and dependency mapping, and predictive service provisioning and capacity planning. We are looking for an experienced engineering professional who thrives on driving innovation & best practice within a fast-paced, highly-dynamic culture. You should be a strong multi-tasker and a collaborative thought-leader. The ideal candidate has a strong passion for technology, stability, and transforming platforms.

Responsibilities:

  • Engage in and improve the whole life cycle of services—from inception and design, through deployment, operation and refinement. Influence design & architecture to proactively prevent system failures
  • Support services prior to launch through activities such as system design consulting, developing software platforms and frameworks, capacity planning, and deployment review
  • Maintain services once they are live by measuring and monitoring availability, latency, and overall system health
  • Practice sustainable incident response and blameless postmortems
  • Develop and maintain effective instrumentation tools & dashboards
  • Help improve and maintain high service up-time using AWS while developing and evangelizing company-wide standards for services
  • Build and scale highly-available, distributed micro-services with high quality of service for customers
  • Assist in troubleshooting failures and performance issues across all services, while suggesting and applying preventive measures
  • Participate in scrum teams as service level owner and provide consultation on using the services maintained by the SRE team

Qualifications:

  • 5+ years of managing distributed SaaS systems in public and private cloud environments. AWS experience preferred
  • 1+ years of experience in Kubernetes / Docker environment
  • 7+ years of experience with Unix / Linux system administration
  • 2+ years of practical experience building continuous delivery pipelines
  • Experience with at least one of the following: Python, Go, PHP, Ruby, Java, C, C++
  • Experience with algorithms, data structures, complexity analysis, and software design
  • Interest in designing, analyzing, and troubleshooting large-scale distributed systems
  • Systematic problem-solving approach, coupled with strong communication skills and a sense of ownership and drive
  • Highly analytical, detail oriented, with the ability to work with complex logic to debug & optimize code and automate routine tasks while working under pressure to meet tight deadlines
  • Practical experience with MySQL, AWS Aurora, or other RDBs
  • BS degree in Computer Science / Engineering or related technical field involving coding or equivalent practical experience
  • Experience with configuration management tools like Terraform, Ansible, or Puppet is required
  • Experience with full stack development is preferred


Valid Through: 2019-10-23