Senior Site Reliability Engineer

Cox Enterprises • $111K — $186K *

Austin, TX 78745In-Person

Information Technology

5 - 7 years of experience

Today

Be an Early Applicant

By clicking Apply, I agree with Ladders' Terms of Use and Privacy Policy

Job Overview by Ladders

Qualifications

7+ years in SRE, DevOps, or platform engineering roles
Strong programming skills in Python, Go, or Java
Hands-on experience with a major cloud provider (AWS, GCP, Azure)
Expertise in Kubernetes and microservices architecture
Experience defining SLOs/SLIs in production environments
Deep understanding of distributed systems concepts
Proficient with infrastructure-as-code and CI/CD tools

Responsibilities

Design and maintain production infrastructure ensuring 99.99% availability
Define SLOs and drive reliability decisions across teams
Lead incident response efforts and conduct post-mortems
Develop infrastructure-as-code and CI/CD pipelines
Build observability platforms using monitoring tools
Automate processes to reduce toil and enhance efficiency
Architect container orchestration systems focused on cost and performance

Benefits

Competitive base salary with annual performance bonuses
Comprehensive health, dental, and vision insurance
Flexible hybrid/remote work model
Annual budget for learning and development
Generous PTO, paid parental leave, wellness programs
401(k) with employer match
Collaborative and blameless engineering culture

Full Job Description

Job Family Group

Engineering / Product Development

Job Profile

Sr Software Engineer

Management Level

Individual Contributor

Flexible Work Option

Hybrid - Ability to work remotely part of the week

Travel %

No

Work Shift

Day

Compensation
Compensation includes a base salary in the range of $111,600.00 - $186,000.00. The base salary may vary within the anticipated base pay range based on factors such as the ultimate location of the position and the selected candidate's knowledge, skills, and abilities. Position may be eligible for additional compensation that may include an incentive program.

Job Description

Senior Site Reliability Engineer

Department:

Engineering / Platform Reliability

Location: Austin (candidates must be based in Austin or willing to relocate for this role)

About the Role :

We are looking for a Senior Site Reliability Engineer who is passionate about building and maintaining highly available, scalable, and resilient systems. In this role you will serve as a senior engineer on the SRE team, driving reliability improvements across our production infrastructure while mentoring engineers and shaping our incident response culture.

You will partner closely with software engineering, security, and product teams to embed reliability into every stage of the development lifecycle. This is a high-impact position for someone who thrives at the intersection of software engineering and operations.

Key Responsibilities

Design, build, and maintain production infrastructure across cloud platforms (AWS, GCP, or Azure) ensuring 99.99%+ availability targets

Define and champion SLOs, SLIs, and error budgets; drive data-informed reliability decisions across engineering teams

Lead incident response efforts as Incident Commander; conduct blameless post-mortems and drive remediation to completion

Develop and maintain infrastructure-as-code (Terraform, or CloudFormation) and CI/CD pipelines for automated, repeatable deployments

Build and improve observability platforms using tools such as Prometheus, Grafana, NewRelic,Splunk, or the ELK stack

Automate toil reduction through custom tooling, self-healing systems, and proactive capacity planning

Architect and operate container orchestration systems (Kubernetes, ECS) at scale with emphasis on cost efficiency and performance

Collaborate with security teams to embed security best practices into infrastructure and deployment pipelines

Mentor junior and mid-level SREs through code reviews, knowledge-sharing sessions, and pair-programming

Contribute to the on-call rotation and continuously improve runbooks, alerting, and escalation procedures

Required Qualifications

7+ years of experience in SRE, DevOps, or platform engineering roles with progressive responsibility

Strong proficiency in at least one programming language (Python, Go, Java, or similar) for systems-level automation and tooling

Deep hands-on experience with at least one major cloud provider (AWS, GCP, or Azure) including networking, IAM, and managed services

Expert-level knowledge of container orchestration (Kubernetes) and microservices architectures

Demonstrated experience defining SLOs/SLIs and managing error budgets in production environments

Solid understanding of distributed systems concepts: consensus algorithms, CAP theorem, eventual consistency, and fault-tolerant design

Proficiency with infrastructure-as-code tools (Terraform, CloudFormation) and configuration management

Experience with CI/CD platforms (Jenkins, GitHub Actions) and GitOps workflows

Strong Linux systems administration skills and networking fundamentals (TCP/IP, DNS, load balancing, CDN)

Proven track record of leading incident response, writing effective post-mortems, and implementing systemic fixes

Preferred Qualifications

Familiarity with chaos engineering practices and tools

Background in database reliability engineering (Oracle, PostgreSQL, MySQL, Redis, or Cassandra at scale)

Hands-on experience with FinOps practices and cloud cost optimization

Contributions to open-source SRE or infrastructure projects

Relevant certifications (CKA, AWS Solutions Architect Professional, GCP Professional Cloud Architect)

Technical Environment

Our stack includes Kubernetes, Terraform, AWS, NewRelic,Prometheus and Grafana for monitoring, PagerDuty for on-call, GitHub Actions for CI/CD, and a mix of Java, Go and Python microservices. We are a team that values automation over manual intervention and continuously invest in reducing toil.

What We Offer

Competitive base salary with annual performance bonuses

Comprehensive health, dental, and vision insurance with generous employer contribution

Flexible hybrid/remote work model

Annual learning and development budget for conferences, certifications, and courses

Generous PTO policy, paid parental leave, and wellness programs

401(k) with employer match

Collaborative, blameless engineering culture that values continuous improvement

About Cox Enterprises

Cox Enterprises is a privately held global conglomerate headquartered in Atlanta, Georgia, United States, with approximately 55,000 employees and $21 billion in total revenue. Its major operating subsidiaries are Cox Communications, Cox Automotive, and Cox Media Group. The company's major national brands include AutoTrader, Kelley Blue Book, and Cox Homelife. Cox Enterprises is currently led by Alex Taylor, the great-grandson of founder James M. Cox.

Learn more about Cox Enterprises

Size

55,000 employees

Industry

Media

Founded

1898

* Ladders Estimates

Similar Jobs

Senior Software Engineer - Tech Lead
$120K — $276K *
Hewlett Packard Enterprise Development LP
Houston, TX 77084 (Harris County)
Today
Sr Software Engineer - Knowledge Representation (Ontology and ML Engineer)
$114K — $152K *
Global Healthcare Exchange
Remote
Today
Senior Software Engineer - Python
$130K — $193K *
PayPal
Austin, TX 78745 (Travis County)
Today
Senior Software Engineer
$130K — $205K *
HP Development Company, L.P.
Spring, TX 77379 (Harris County)
Today
Senior Software Engineer - Data Platform
$130K — $220K *
Samsara
Remote
Today
Senior Software Developer
$100K — $130K *
Optio Incentives
Remote
Reposted Today

Get Ready For Your
Next Interview

More Jobs at Cox Enterprises

Senior Site Reliability Engineer
$111K — $186K *
Austin, TX 78745 (Travis County)
Today
Information Technology
In-Person
Vehicle Information Strategy, Sr Manager - Cox Automotive
$122K — $204K *
Atlanta, GA 30349 (Fulton County)
Today
Manufacturing & Automotive
In-Person
Entry Level Software Engineer - Austin, TX
$81K — $122K *
Austin, TX 78745 (Travis County)
Today
Information Technology
In-Person
Field Sales Representative (Manheim)
$70K — $106K *
Portland, OR 97229 (Washington County)
Yesterday
Manufacturing & Automotive
In-Person
Field Sales Representative (Manheim)
$70K — $106K *
Remote
Yesterday
Manufacturing & Automotive
Remote in Portland, OR

More Information Technology Jobs

Client Partner - Banking / Financial Services / Capital Markets
$325K — $350K + $100K bonus *
Large IT Services Firm (client of TechLink Systems)
New York, NY 10001 (New York County)
1 week ago
Business Development Director
$300K — $345K + $120K bonus *
Tier1 IT Services Firm
Kansas City, MO 64116 (Clay County)
2 weeks ago
Client Partner / Business Developemnt - Banking
$250K — $320K + $70K bonus *
IT Services Firm (client of TechLink Systems)
New York, NY 10001 (New York County)
2 weeks ago
Manager Site Reliability Operations
$118K — $230K *
Mercury Insurance
Remote
Today
Federation/Integration Engineer/UI
$80K — $128K *
Joint Activities
Remote
Today

Find similar Senior Site Reliability Engineer jobs:

Nationwide Austin, TX

Senior Site Reliability Engineer

Job Overview by Ladders

Full Job Description

Get Ready For Your Next Interview

Find similar Senior Site Reliability Engineer jobs:

Get Ready For Your
Next Interview