Sr. Manager, Systems Reliability Engineering

Walt Disney   •  

Seattle, WA

Industry: Media


5 - 7 years

Posted 171 days ago

This job is no longer available.

Job ID 563067BR

Job Summary:

Do you want to apart of a team that creates magic for millions of guests across all of the Disney brands? Behind the scenes, the Enterprise Technology team helps deliver magical digital & physical experiences leveraging the latest technology; and our teams provide expert engineering services in cloud, automation, and systems reliability engineering to support the innovation and operation of The Walt Disney Company. We are passionate about ensuring our systems provide the best guest experience!

The Sr. Manager of Systems Reliability Engineering will be responsible for managing a team of high performing systems reliability engineers, supporting a number of complex, large scale consumer platform systems, infrastructure and applications for The Walt Disney Company. This is a fast-paced, dynamic environment and we're looking for a leader to help set the operational direction for a team that supports all client facing platforms across various business segments. Our teams protect, operate, and continuously improve the automation and systems that run Disney’s experiences, products, & services with a focus on availability, latency, automation, & cross-company collaboration while embracing a DevOps culture. This position reports into the Director of Systems Engineering who also reports into the VP, Cloud Systems Engineering & Automation Services. You will join an organization comprised of engineering teams scattered across Disney offices based in Seattle, Burbank, Orlando, Bristol and New York.


  • Leading and inspiring a team through an SRE (Systems Reliability Engineering) transformational journey
  • Guiding team to architect, design, and code cloud systems, technologies, and leverage best practices
  • Setting the strategy for developing reusable frameworks, and defining the telemetry and operational analytics
  • Owning, communicating, and resolving issues that impact design, product success, or address future concepts, products, or technologies
  • Partnering with product and platform teams to engineer and design for resiliency
  • Working with cloud providers and vendors on future roadmap and feature requests
  • Identifying, experimenting, & evangelizing new technologies, ideas, and best practices across the larger engineering & architecture community
  • Collaborating and providing leadership within and across teams and organizations
  • Proficient, collaborative, & experienced in leading complex teams that build reliable, scalable, micro-service-oriented systems
  • Finding ways to leverage technology while constantly learning
  • Building relationships with engineering colleagues and thoughtfully present to senior executives

Basic Qualifications:

  • Understanding of configuration management and orchestration (e.g. Chef, Terraform, Cloud Formation); container platforms (e.g. Docker, Kubernetes, Mesos, Elastic Container Service); Cloud/PaaS Environments (e.g. AWS, Google Cloud Compute, & Azure)
  • Ability to identify root cause sources of instability in a high-traffic, large-scale distributed system
  • Seasoned experience in the systems or software engineering space
  • You spent at least 3 years architecting solutions in the cloud
  • 5 years of experience managing engineering teams responsible for supporting and deploying internet based products or services
  • Working knowledge of core internet protocols including, but not limited to TCP/IP, DNS and HTTP
  • Excellent written and verbal communications, proven ability to develop and deliver presentations geared for Senior and Executive Management
  • Strong interpersonal communication skills and proven experience managing multiple teams in highly matrixed organizations
  • Understanding of advanced scripting and programing languages such as Python, Ruby, or JAVA
  • Understanding of configuration management frameworks such as Chef, Puppet, Ansible
  • Experience managing multiple hosting environments including public and private cloud solutions
  • Experience working with product managers, architects, and other stakeholders, driving the development of product roadmaps.

Preferred Qualifications:

  • 7 years of experience leading multiple or cross functional engineering teams responsible for supporting and deploying internet based products or services and 3-5 years of hands on previous experience developing or supporting enterprise online products and services
  • Proven experience managing software development lifecycle platforms and tools
  • Working knowledge of advanced scripting and programing languages such as Python, Ruby, or JAVA
  • Proven ability to manage to a budget

Required Education

  • Equivalent experience in technical operations or software engineering

Preferred Education

  • Bachelor’s degree in computer science or related field