Principal SRE

Gradial

• $180K — $240K *

Seattle, WA 98115In-Person

Information Technology

5 - 7 years of experience

Reposted Today

Be an Early Applicant

By clicking Apply, I agree with Ladders' Terms of Use and Privacy Policy

Job Overview by Ladders

Qualifications

5+ years in SRE, DevOps, platform engineering, or infrastructure roles with direct production system oversight.
Proven success in high-growth environments with production-grade infrastructure design and operation.
Expertise in Kubernetes, cloud-native architecture, and container orchestration.
Experience with infrastructure as code, GitOps, CI/CD workflows, and modern deployment practices.
Strong knowledge of observability and reliability fundamentals, including monitoring and incident response.
Leadership experience in influencing engineering teams and making critical technical decisions.

Responsibilities

Own reliability, scalability, and operational health of Gradial's production platform.
Lead evolution of Kubernetes, CI/CD, observability, and infrastructure as code.
Establish standards for designing and operating reliable systems.
Build tooling and automation for faster engineering processes.
Drive monitoring, alerting, incident response, and service readiness improvements.
Collaborate with engineering to identify and mitigate scaling risks early.
Influence platform direction focusing on reliability, security, performance, and cost.

Benefits

Comprehensive health, dental, and vision coverage.
401K retirement plan and paid time off.
Dynamic work environment emphasizing autonomy and ownership.
Opportunity to work on groundbreaking AI infrastructure.
Real impact in a high-growth setting with minimal bureaucracy.

Full Job Description

The Role

As a Principal Site Reliability Engineer at Gradial, you will shape the foundation our platform runs on as we scale. You will work closely with the CTO and engineering team to make our systems faster, more resilient, and easier to operate in a high-growth environment. This is a hands-on IC leadership role for someone who wants real ownership, high leverage, and the chance to define how reliability looks at an AI-native company.
What You'll Own

Own the reliability, scalability, and operational health of Gradial's production platform.
Lead the evolution of Kubernetes, CI/CD, observability, and infrastructure as code across the stack.
Set the standard for how we design, ship, and operate reliable systems.
Build the tooling and automation that help engineers move faster with more confidence.
Drive improvements in monitoring, alerting, incident response, and service readiness.
Partner with engineering to spot scaling risks early and solve them before they slow us down.
Influence the long-term direction of our platform across reliability, security, performance, and cost.

What We're Looking For

5+ years of experience in SRE, DevOps, platform engineering, or infrastructure roles with direct ownership of production systems.
Proven success designing and operating production-grade infrastructure in fast-moving, high-growth environments.
Deep expertise in Kubernetes, cloud-native architecture, and container orchestration.
Strong experience with infrastructure as code, GitOps, CI/CD workflows, and modern deployment practices.
Strong command of observability and reliability fundamentals across metrics, logging, tracing, alerting, and incident response.
A track record of leading through influence, making sound technical decisions, and raising the bar across engineering teams.

Nice to Have

Familiarity with AI or ML infrastructure, including GPU provisioning, model deployment, or compute-intensive workloads.
Experience supporting cloud or multi-cloud environments with a focus on resilience and scale.
Comfort with TypeScript or Python for internal tooling and operational automation.

The salary range for this position is $180,000 - $240,000 annually. Final compensation will be determined based on factors such as experience, skills, and qualifications. In addition to base salary, this role may be eligible for performance-based bonuses and equity awards. Gradial offers a comprehensive benefits package, including medical, dental & vision insurance, 401K retirement plan, paid time off, paid sick leave and other employee wellness programs.

You'll thrive here if you...

Embrace AI as a core tool for problem-solving, creativity and scale.
Show a strong work ethic, high ownership and bias toward action.
Communicate with clarity and curiosity.
Thrive in fast-paced, hyper-growth environments; where building is always better than maintaining the status quo.

What we offer

Meaningful equity and competitive salary
Comprehensive health, dental and vision coverage
Fast-paced environment with autonomy and ownership
Real impact, zero bureaucracy
A front-row seat to building category-defining AI infrastructure

AI Literacy & Interviewing Tools

As an AI-first company, we prioritize AI literacy as a core competency in our hiring decisions. We're excited by candidates who thoughtfully apply AI tools in their work, but during interviews we're focused on you. This is your opportunity to show how you think, communicate, and solve problems. Over-reliance on AI-generated responses during the interview process (especially when it obscures your own voice) will result in disqualification. We want to understand your unique perspective and how you approach challenges, both with and without AI.

* Ladders Estimates

Similar Jobs

Sr Operations & Integration Engineer
$145K — $203K *
Blue Origin
Seattle, WA 98115 (King County)
Today
Senior System Engineer
$107K — $195K *
Leidos Holding
Remote
Yesterday
Staff Systems Engineer - Airborne Cybersecurity Requirements (2026-100)
$132K — $199K *
Astronics
Kirkland, WA 98034 (King County)
Yesterday
Principal Platform Engineer
$96K — $207K *
Fifth Third Bancorp
Remote
Reposted Yesterday
Staff Engineer - Capacity Planning and Management
$110K — $230K *
Geico
Seattle, WA 98115 (King County)
Reposted Yesterday
Systems Development Engineer, Amazon Leo
$151K — $204K *
Amazon
Bellevue, WA 98006 (King County)
Yesterday

Get Ready For Your
Next Interview

More Jobs at Gradial

Principal SRE
$180K — $240K *
Seattle, WA 98115 (King County)
Reposted Today
Information Technology
In-Person
Senior Enterprise Account Executive
$280K — $380K *
Remote
Reposted Yesterday
Enterprise Technology
Remote in United States
Senior Integrations Engineer
$150K — $185K *
Baltimore, MD 21215 (Baltimore City County)
2 days ago
Enterprise Technology
In-Person
Senior Integrations Engineer
$150K — $185K *
Seattle, WA 98115 (King County)
2 days ago
Enterprise Technology
In-Person
Senior Forward Deployed Engineer
$130K — $210K *
Seattle, WA 98115 (King County)
1 week ago
Enterprise Technology
In-Person

More Information Technology Jobs

SDET (Software Development Engineer In Test)
Confidential Company
Washington, DC 20001 (District Of Columbia County)
1 week ago
Senior Manager, IT Risk & Compliance
$123K — $164K *
DOCS Health
Saint Paul, MN 55106 (Ramsey County)
Today
Senior QA Test Analyst
$120K — $165K *
ERCOT
Taylor, TX 76574 (Williamson County)
Today
Project Manager
$70K — $95K *
Red River
Remote
Today
Sr Network Engineer
$108K — $159K *
Entrust Datacard
Colorado Springs, CO 80918 (El Paso County)
Today

Find similar Principal SRE jobs:

Nationwide Seattle, WA

Principal SRE

Job Overview by Ladders

Full Job Description

Get Ready For Your Next Interview

Find similar Principal SRE jobs:

Get Ready For Your
Next Interview