DraftKings

Principal Site Reliability Engineer

DraftKings$200K — $250K *
US-AnywhereRemote in United States
Enterprise Technology
8 - 10 years of experience
Job Overview by Ladders

Qualifications

  • Bachelor's Degree in Computer Science or related field.
  • 8+ years experience with distributed cloud and on-premise infrastructure.
  • 3+ years in a Staff, Principal, or equivalent role.
  • Deep expertise in Kubernetes and large-scale production operations.
  • Experience with AWS and Google Cloud Platform utilizing Infrastructure as Code tools.
  • Proficient in Go, Python, and software development best practices.
  • Strong leadership skills and experience mentoring engineering teams.

Responsibilities

  • Define and execute long-term strategy for Kubernetes platform.
  • Drive architectural decisions for critical infrastructure components.
  • Lead cross-team platform initiatives with measurable outcomes.
  • Establish reliability practices aligned with business priorities.
  • Build automation-first infrastructure with GitOps and self-healing systems.
  • Champion AI-powered tools for operational efficiency and incident response.
  • Mentor senior engineers and influence technical strategy across the organization.

Benefits

  • Guidance through the gaming license acquisition process if required.
  • Opportunities for growth in a regulated technology company.
  • Exposure to a diverse tech environment with potential for innovation.
  • Participation in a company driven by operational excellence and reliability.
Full Job Description
The Crown Is Yours

As a Principal Site Reliability Engineer, you'll shape the long-term strategy for the infrastructure behind one of the most demanding platforms in sports betting and gaming. You'll drive the architectural direction of our cloud and on-premise platforms, helping engineering teams build, deploy, and operate highly reliable systems at scale. Working across Platform Engineering and Site Reliability Engineering, you'll influence how we modernize our infrastructure, strengthen operational excellence, and prepare our platform for the next generation of growth.

What you'll do
  • Define and execute the long-term strategy for our Kubernetes platform across Google Kubernetes Engine, Amazon Elastic Kubernetes Service, RKE2, and on-premise environments, ensuring reliability, scalability, and operational consistency.
  • Drive architectural decisions across critical infrastructure, including cluster lifecycle management, networking, identity and access management, observability, autoscaling, capacity planning, and cost optimization.
  • Lead large-scale platform initiatives across multiple engineering teams, establishing technical direction, engineering standards, and measurable outcomes that improve platform reliability and developer experience.
  • Establish and evolve reliability practices by defining service level objectives, service level indicators, and error budget frameworks that align platform performance with business priorities.
  • Build automation-first infrastructure through Infrastructure as Code, GitOps workflows, self-healing systems, and internal platform tooling that improve engineering velocity and reduce operational overhead.
  • Champion the responsible adoption of AI-powered engineering capabilities that improve operational efficiency, accelerate incident response, and enhance developer productivity.
  • Lead critical platform incidents, drive post-incident improvements, and strengthen platform resilience through automation, capacity planning, and operational excellence.
  • Mentor senior engineers, influence technical strategy across the organization, and elevate engineering excellence through architecture reviews, coaching, and technical leadership.


What you'll bring
  • A Bachelor's Degree in Computer Science or a related technical field.
  • At least 8 years of experience designing, operating, and scaling distributed cloud and on-premise infrastructure, including at least 3 years operating at the Staff, Principal, or equivalent technical leadership level.
  • Proven experience leading large-scale infrastructure or platform initiatives that require cross-functional alignment and long-term technical ownership.
  • Deep expertise with Kubernetes, including cluster architecture, networking, storage, security, operators, lifecycle management, and large-scale production operations.
  • Extensive experience building and operating production infrastructure in AWS and Google Cloud Platform using Infrastructure as Code technologies such as Terraform, Pulumi, or similar tools.
  • Strong software development experience in Go, Python, or both, with expertise in GitOps, continuous integration and continuous delivery, observability, distributed systems, Linux, and reliability engineering principles.
  • Experience incorporating AI-powered tools into engineering workflows while applying sound judgment around reliability, security, and operational risk.
  • Exceptional communication and leadership skills with a proven ability to mentor engineers, influence technical strategy, and drive engineering excellence. Experience working in regulated industries, hybrid cloud environments, contributing to open-source projects, or holding cloud certifications is preferred.


#LI-MF1

Join Our Team

We're a publicly traded (NASDAQ: DKNG) technology company headquartered in Boston. As a regulated gaming company, you may be required to obtain a gaming license issued by the appropriate state agency as a condition of employment. Don't worry, we'll guide you through the process if this is relevant to your role.

The US base salary range for this full-time position is 200,000.00 USD - 250,000.00 USD, plus bonus, equity, and benefits as applicable. Our ranges are determined by role, level, and location. The compensation information displayed on each job posting reflects the range for new hire pay rates for the position across all US locations. Within the range, individual pay is determined by work location and additional factors, including job-related skills, experience, and relevant education or training. Your recruiter can share more about the specific pay range and how that was determined during the hiring process.

About DraftKings

DraftKings is an American daily fantasy sports contest and sports betting operator. The company allows users to enter daily and weekly fantasy sports–related contests and win money based on individual player performances in five major American sports, Premier League and UEFA Champions League soccer, NASCAR auto racing, Canadian Football League, the XFL, mixed martial arts and Tennis. In August 2018, DraftKings launched DraftKings Sportsbook in New Jersey becoming the first legal mobile sports betting operator in the state. Since launching in New Jersey, DraftKings has opened mobile sports betting operations in Indiana, Pennsylvania, West Virginia and opened in New Hampshire December 30, 2019 after reaching contract with the New Hampshire Lottery. Retail sports betting is available in Iowa, Mississippi and New York. DraftKings Sportsbook mobile and retail sports betting products allow bettors in each state engage in betting for most major U.S. and international sports. As of April 2016, the majority of U.S. states consider fantasy sports a game of skill and not gambling. In November 2016, FanDuel and DraftKings, the two largest companies in the daily fantasy sports industry, reached an agreement to merge. However the merger was terminated in July 2017 due to it being blocked by the Federal Trade Commission as the combined company would have controlled a 90 percent of the market for daily fantasy sports. As of July 2017, DraftKings had eight million users.
Learn more about DraftKings
Size
3,400 employees
Market Cap
$4.9 billion
Industry
Net Income
-$577.9 million
Revenue
$292.3 million
NASDAQ

Similar Jobs

More Jobs at DraftKings

More Enterprise Technology Jobs

Find similar Principal Site Reliability Engineer jobs: