Dolby Laboratories

Staff Operational Support Engineer

Dolby Laboratories$136K — $187K *
Information Technology
5 - 7 years of experience
Job Overview by Ladders

Qualifications

  • 5+ years of experience in operational or customer-facing roles
  • Strong troubleshooting skills in production video streaming and OTT services
  • Familiarity with streaming protocols like HLS, DASH, and WebRTC
  • Experience with monitoring tools like Grafana and ELK
  • Ability to remain calm and decisive during high-pressure incidents

Responsibilities

  • Take ownership of escalated customer issues and drive them to resolution
  • Troubleshoot and resolve complex production incidents affecting live streams
  • Operate on production environments, making configuration changes as needed
  • Lead incident resolution efforts during customer-impacting events
  • Collaborate with teams to validate system readiness for critical events

Benefits

  • Participation in a 24/7 on-call rotation for global support
  • Collaboration with a dedicated team focused on operational excellence
  • Opportunities to work with cutting-edge AI-driven operational tools
  • Engagement in continuous improvement and feedback loops with Engineering
  • Supportive environment for professional growth and skill enhancement
Full Job Description
The Dolby Cloud Solutions organization builds technologies and innovations that easily integrate into service providers' infrastructure to make content experiences more effective, meaningful, and engaging for consumers.

Dolby OptiView is building a dedicated Operational Support (L2) team responsible for the stability,
availability, and operational excellence of our 24/7 live video streaming, ads, player, and real-time
delivery platforms.

As an Operational Support Engineer (L2), you take end-to-end ownership of customer-impacting
production incidents once they are triaged by Level 1 support. You operate directly on production
systems, lead live incident resolution, and act as the operational bridge between Support, Engineering,
DevOps, and customers, particularly during high-impact live events.

This is a hands-on, customer-facing role focused on incident ownership, production operations,
automation, and operational scalability, not just reactive troubleshooting.

Key Responsibilities

Incident & Operational Support
• Take ownership of escalated customer issues from Level 1 Support and drive them to resolution.
• Troubleshoot and resolve complex, high-impact production incidents affecting live streams,
VOD playback, ad insertion, DRM, and real-time WebRTC services.
• Operate directly on production environments, including configuration changes, CDN
adjustments, and corrective actions, following established operational procedures, including
executing mitigations and emergency changes during live incidents when customer impact
requires immediate action.
• Lead or actively contribute to live incident bridges involving customers, internal teams, and
partners.
• Provide clear, timely communication during incidents, including status updates and customer-
facing explanations.

Infrastructure as Code & Production Operations
• Work fluently with Infrastructure as Code (IaC) to understand, troubleshoot, and safely modify
production environments
• Leverage tools and frameworks such as:
  • Terraform
  • Helm
  • Kubernetes manifests
  • GitOps workflows
  • CI/CD and deployment pipelines

• Use IaC as the primary mechanism for safe, auditable, and repeatable operational changes
• Collaborate with Engineering and DevOps to improve deployment reliability and operational
safety
• Validate and execute infrastructure or configuration changes through codified workflows
AI-Driven Operations & Automation.
• Leverage AI tools and automation to enhance operational efficiency and incident response.

Contribute to and use:
  • AI-assisted incident triage and classification
  • Automated runbook execution
  • AI-based pattern detection across incidents
  • Intelligent alert correlation and noise reduction

Use AI to:
  • Generate or improve incident communications
  • Accelerate troubleshooting workflows
  • Identify recurring patterns and systemic issues

• Drive adoption of automation-first and AI-augmented operational practices
Pre-Event Planning & Operational Readiness
• Participate in pre-event readiness planning for critical customer events

Validate system readiness through:
  • Runbook checks
  • Monitoring coverage validation
  • Risk identification and mitigation planning

• Define and rehearse incident response strategies for high-risk scenarios
• Collaborate with customers and internal teams to ensure smooth event execution

On-Call & 24/7 Operations
• Participate in a 24/7 on-call rotation, including nights, weekends, and holidays, as part of a
global support model
• Ensure smooth handovers between shifts and regions
• Respond to critical alerts within defined SLAs for stream health, player errors, and delivery
infrastructure.

Root Cause & Continuous Improvement
• Perform or contribute to root cause analysis (RCA) for production incidents
• Document findings, corrective actions, and preventive measures
• Identify recurring issues and work with Engineering and Product teams to eliminate them
permanently
• Contribute to and improve runbooks, operational playbooks, and knowledge bases for all
OptiView products (Player, ads, live and real time streaming)
Collaboration & Engineering Feedback Loop
• Work closely with Engineering teams to escalate defects, validate fixes, and support production
deployments
• Provide feedback on system observability, tooling gaps, and operational risks
• Act as the operational voice during post-incident reviews

Required Skills & Experience

Technical Skills
• 5+ years of relevant experience in operational, support, or similar customer-facing roles
• Proven ability to own complex problems end-to-end and operate with a high degree of
autonomy
• Strong experience supporting production video streaming platforms, OTT services, live systems
• Solid troubleshooting skills across distributed systems (APIs, microservices, cloud infrastructure)
• Familiarity with HLS, DASH, CMAF, WebRTC, DRM and CDN architectures
• Experience working with monitoring, alerting, and logs to diagnose live incidents (Grafana,
Kibana/ELK, Prometheus, Loki)
• Correlate backend streaming metrics, player telemetry, and CDN signals to diagnose live
customer issues end-to-end.
• Comfort performing controlled changes in production environments
• Working knowledge of incident management and on-call operations
Operational Mindset
• Proven ability to remain calm, structured, and decisive during high-pressure incidents
• Strong sense of ownership and accountability for customer outcomes
• Excellent written and verbal communication skills, including customer-facing communication
during incidents

#LI-GW!

The Atlanta Area base salary range for this full-time position is $136,500-$187,400, which can vary if outside this location,plus bonus, benefits, and some roles may also include equity. Our salary ranges are determined by role, level, and location. Within the range, individual pay is determined by work location and additional factors, including job-related skills, competencies, experience, market demands, internal parity, and relevant education or training. Your recruiter can share more about the specific salary range and perks and benefits for your location during the hiring process.

Dolby will consider qualified applicants with criminal histories in a manner consistent with the requirements of San Francisco Police Code, Article 49, and Administrative Code, Article 12

About Dolby Laboratories

Dolby Laboratories, Inc. creates audio and imaging technologies that transform entertainment and communications at the cinema, at home, at work, and on mobile devices. The company develops and licenses its audio technologies, such as AAC & HE-AAC, a digital audio codec solution used for TVs, set-top boxes (STBs), personal computers (PCs), gaming consoles, mobile devices, and digital radio; AVC, a digital video codec with high bandwidth efficiency used in media devices; Dolby AC-4, an audio coding technology that delivers new audio experiences to a range of playback devices; and Dolby Atmos technology for home theaters, cinemas, device speakers, mobile devices, and headphones. Its audio technologies also comprise Dolby Digital, a digital audio coding technology that provides multichannel sound in the home; Dolby Digital Plus, a digital audio coding technology that delivers audio quality for streaming, downloaded, and broadcast content; Dolby TrueHD, a digital audio coding technology for content providers; Dolby Vision, an imaging technology for cinema, digital TV, and other consumer devices; and HEVC, a digital video codec with high bandwidth efficiency to support delivery of Ultra HD and other video content. In addition, the company designs and manufactures audio and imaging products, such as digital cinema servers, Dolby Cinema audio products, and other products for the film production, cinema, television broadcast, and entertainment industries. Further, it offers services to support theatrical and television production for cinema exhibition, broadcast, and home entertainment. The company serves film studios, content creators, post-production facilities, cinema operators, broadcasters, and video game designers. Dolby Laboratories, Inc. was founded in 1965 and is headquartered in San Francisco, California.
Learn more about Dolby Laboratories
Size
2,368 employees
Market Cap
$6.5 billion
Industry
Net Income
$317.8 million
Founded
1965
5 Year Trend
+3%
Revenue
$1.2 billion
NASDAQ

Similar Jobs

More Jobs at Dolby Laboratories

More Information Technology Jobs

Find similar Staff Operational Support Engineer jobs: