Roku

Senior AI DevOps Engineer

Roku$120K — $150K *
Information Technology
5 - 7 years of experience
Job Overview by Ladders

Qualifications

  • 5+ years of experience operating CI/CD infrastructure at scale, preferably with GitLab CI.
  • Deep understanding of mobile build systems like Xcode/xcodebuild and Gradle.
  • Strong scripting skills in Python and Bash.
  • Advanced proficiency with AI-assisted development tools.
  • Experience designing autonomous, self-healing systems.

Responsibilities

  • Design and maintain CI/CD pipelines for mobile apps on GitLab CI.
  • Architect pipeline stages for efficient execution and deployment.
  • Build smart test routing to optimize testing workflows.
  • Implement flaky test detection and management systems.
  • Automate release processes to minimize manual intervention.
  • Manage device orchestration to enhance CI job efficiency.
  • Develop self-healing automation for network and device reliability.

Benefits

  • Access to global mental health and financial wellness support.
  • Comprehensive healthcare options including medical, dental, and vision.
  • Life, accident, and disability insurance coverage.
  • Retirement options like 401(k) and pension plans.
  • Support for personal leave and time-off needs.
Full Job Description
About the team

Join a high-performing, innovative team that plays a pivotal role in Roku's mission to be the best TV streaming platform in the world. Our team is responsible for delivering intuitive, high-quality mobile applications that enhance the way millions of users interact with Roku devices globally. We pride ourselves on creating products that "just work" - seamlessly and effortlessly. This commitment to excellence is driven by a collaborative, inclusive, and results-oriented culture where your contributions will directly impact the user experience of millions. If you're passionate about building products that feel magical and intuitive, this is the team for you.

About the role

The mobile team ships the Roku Remote, Smart Home, and Howdy apps on iOS and Android. You own the CI/CD pipeline and QA automation infrastructure for the mobile engineering team. Your role will use AI to design and build a completely autonomous, self-healing CI/CD and QA automation pipeline for multiple products with millions of users. You treat AI as your primary design tool not an add-on. Every system you build should minimize human intervention, from code push to app store submission. You'll own CI Pipeline Architecture: the path from git push to a green or red signal. Your job is to make that path fast, reliable, and cheap. That includes QA Automation & Device Orchestration: the software systems that schedule, monitor, and recover the test infrastructure.

What you'll be doing
  • Design and maintain CI/CD pipelines for iOS and Android on GitLab CI
  • Architect pipeline stages for fail-fast execution: cheapest checks first (lint, compile, static analysis), expensive checks last (device farm tests)
  • Build smart test routing: analyze MR diffs to determine which tests need physical devices and which can run on emulators, so 80% of MRs never touch the device farm
  • Build flaky test detection and quarantine systems. Classify failures as infrastructure-caused vs. code-caused so engineers trust the signal
  • Automate release mechanics: code signing, versioning, TestFlight/Play Console uploads, dSYM and mapping file management. The goal is zero manual steps between merge and app store submission
  • As agent-authored MR volume grows, ensure pipelines absorb the increase without degrading speed or starving human-authored MRs of resources
  • Build the device reservation and orchestration system that assigns devices to CI jobs, prevents contention, and maximizes utilization without manual scheduling
  • Design self-healing automation: health checks detect unresponsive devices, trigger remote recovery via API, and re-register them no human intervention required
  • Define the device compatibility matrix which firmware/model combinations require real hardware, and which can run on emulators
  • Implement priority-based test routing: device-touching MRs get farm time, UI-only MRs never queue for a device
  • Use AI to identify failure patterns, predict infrastructure issues, and continuously optimize pipeline performance


Were excited if you have
  • 5+ year's operating CI/CD infrastructure at scale, preferably GitLab CI
  • Ability to travel up to 20%
  • Deep understanding of mobile build systems (Xcode/xcodebuild, Gradle) and mobile-specific CI challenges (code signing, provisioning, multi-platform builds)
  • Strong scripting (Python, Bash) and ability to build internal tooling reservation systems, health monitors, pipeline analytics dashboards
  • Advanced proficiency with AI-assisted development (Copilot, Claude Code, Cursor, or equivalent) you use AI as your default approach to writing code, building systems, and solving infrastructure problems
  • Experience designing autonomous, self-healing systems that detect, diagnose, and recover from failures without human intervention
  • AI-first problem solving where your instinct is to automate with AI before adding manual process or headcount
  • Obsession with developer experience.You measure your success by how fast and reliably engineers get feedback, not by how complex your infrastructure is
  • Data-driven decision making. You measure failure rates, waste rates, device utilization, and pipeline duration and you use those numbers to prioritize your work

Preferred Qualifications
  • Experience with infrastructure-as-code (Terraform, Ansible, or equivalent) for managing cloud and on-premises infrastructure
  • Working knowledge of WiFi and BLE protocols enough to understand why tests that exercise radio communication behaves differently from pure software tests
  • Experience with mobile test automation frameworks (XCUITest, Espresso, Appium) not to write tests, but to understand what they need from infrastructure
  • Experience scaling CI for high-volume, automated code generation (agentic engineering, bot-authored MRs)


#LI-DH2

Our Hybrid Work Approach

Roku fosters an inclusive and collaborative environment where teams work in the office Monday through Thursday. Fridays are flexible for remote work except for employees whose roles are required to be in the office five days a week or employees who are in offices with a five day in office policy.

Benefits

Roku is committed to offering a diverse range of benefits as part of our compensation package to support our employees and their families. Our comprehensive benefits include global access to mental health and financial wellness support and resources. Local benefits include statutory and voluntary benefits which may include healthcare (medical, dental, and vision), life, accident, disability, commuter, and retirement options (401(k)/pension). Employees are supported in taking time off, in accordance with local leave policies and other personal needs to support their evolving work and life needs. It's important to note that not every benefit is available in all locations or for every role. For details specific to your location, please consult with your recruiter.

About Roku

Roku is an American consumer electronics company founded in 2002. The company is best known for its streaming devices that allow users to access internet-based video content on their televisions. Roku's devices are available in several models and are sold in the United States and other countries. The company also offers a streaming service called The Roku Channel that features a selection of movies and TV shows. Roku went public in 2017 and is traded on the NASDAQ stock exchange.
Learn more about Roku
Size
3,000 employees
Market Cap
$5.5 billion
Industry
Net Income
-$17.5 million
Founded
2002
5 Year Trend
+47.3%
Revenue
$1.7 billion
NASDAQ

Similar Jobs

More Jobs at Roku

More Information Technology Jobs

Find similar Senior AI DevOps Engineer jobs: