Site Reliability Engineer

Future Secure AI

• $120K — $150K *

Austin, TX 78745In-Person

Information Technology

5 - 7 years of experience

Yesterday

Be an Early Applicant

By clicking Apply, I agree with Ladders' Terms of Use and Privacy Policy

Job Overview by Ladders

Qualifications

5-7 years of relevant professional experience in Site Reliability Engineering or similar roles
Hands-on experience with Kubernetes, preferably on EKS, AKS, or GKE
Proficiency with Terraform for infrastructure automation
Familiarity with Helm for streamlined Kubernetes applications deployment
Experience in scripting or programming with languages like Python, Go, or Java

Responsibilities

Design and implement reliable production infrastructure for AI applications
Manage and optimize Kubernetes platforms for AI workloads
Automate infrastructure provisioning through code using Terraform
Ensure effective deployment workflows with Helm
Monitor and enhance system reliability by defining SLIs and SLOs
Oversee incident responses and facilitate post-mortem analyses
Drive automation to minimize operational workload

Benefits

A high-performance culture that promotes excellence
Access to cutting-edge technology and tools
Opportunity to learn from exceptional leadership
Potential for significant impact on projects and initiatives
Flexible work arrangements to support work-life balance
Encouragement of diversity and creativity in the workplace

Full Job Description

We are looking for a Sr. Site Reliability Engineer to help design, build, and operate the platforms that power AI Co-Workers. This is a hands-on role for an engineer who enjoys owning reliability end-to-end and working closely with product, AI, and engineering teams. The role • Design, build, and operate reliable production infrastructure supporting AI Co-Workers • Own Kubernetes-based platforms used to deploy and run AI workloads • Build and maintain infrastructure as code using Terraform • Implement and maintain Helm-based deployment workflows • Define, measure, and improve system reliability using SLIs, SLOs, and SLAs • Participate in on-call rotation, incident response, root cause analysis, and post-mortems • Reduce operational toil through automation and engineering improvements • Build and improve observability across monitoring, logging, and alerting • Partner closely with engineers to ensure systems are resilient, scalable, and secure • Operate across build, deploy, and operate phases of the software lifecycle Must have criteria • Hands-on Kubernetes experience designing, building, or operating workloads on EKS, AKS, GKE, or self-managed Kubernetes • Hands-on Terraform experience for infrastructure provisioning and automation • Hands-on Helm experience for Kubernetes application deployment • Professional experience using at least two programming or scripting languages such as Python, Go, Java, Bash, PowerShell, or Ruby • Direct Site Reliability Engineer experience or equivalent, including reliability engineering, on-call, incident response, post-mortems, and toil reduction Should have criteria • Experience working within a defined SDLC, including CI/CD, release processes, and end-to-end delivery from design to operations • Hands-on experience with at least one major cloud provider such as AWS, Azure, or Google Cloud • Experience with ArgoCD or GitOps-style deployment approaches • Five or more years of relevant professional experience • DevOps or DevSecOps experience, including CI/CD ownership, infrastructure automation, and security considerations Preferable criteria • Relevant certifications such as CKA, CKAD, cloud certifications, DevOps, DevSecOps, or programming credentials Why Join Us? • A high-performance culture • State-of-the-art technology • Experience world-class leadership • Scale of impact and purpose • A competitive salary and a huge growth trajectory • Work with the best in the industry • Flexible work environment • Diversity and creativity Disclaimer: We do not wish to be contacted by recruitment agencies. Our hiring process is managed in-house and the best way for candidates to express interest is by applying with your resume through our company website.

* Ladders Estimates

Similar Jobs

Application Engineer, Senior
$100K — $130K *
University of Oregon
Remote
Today
Sr. Systems Engineer
$90K — $120K *
Smile Doctors
Dallas, TX 75217 (Dallas County)
Reposted Today
Hosting Systems Principal
$100K — $125K *
Thales Group
Remote
Reposted Today
Senior Systems Engineer
$100K — $130K *
Saronic Technologies
Austin, TX 78745 (Travis County)
Today
System Engineer III, Endpoint & Security Delivery/Information Solutions (Remote)
$90K — $120K *
MUSC Health & Medical University of SC
Remote
Today
Senior Systems Design Engineer
$95K — $120K *
Sentrillion
Remote
Today

Get Ready For Your
Next Interview

More Jobs at Future Secure AI

Site Reliability Engineer
$120K — $150K *
Austin, TX 78745 (Travis County)
Yesterday
Information Technology
In-Person
Senior Manager, Software Engineering
$130K — $180K *
Austin, TX 78745 (Travis County)
2 days ago
Enterprise Technology
In-Person
Head of IT
$130K — $180K *
Austin, TX 78745 (Travis County)
3 weeks ago
Information Technology
In-Person
Sr. DevOps Engineer
$120K — $150K *
Austin, TX 78745 (Travis County)
1 month ago
Information Technology
In-Person
Site Reliability Engineer
$90K — $130K *
Toronto, ON M3C 0E3
1 month ago
Technical Services
In-Person

More Information Technology Jobs

Client Partner - Banking / Financial Services / Capital Markets
$325K — $350K + $100K bonus *
Large IT Services Firm (client of TechLink Systems)
New York, NY 10001 (New York County)
1 week ago
Senior AI Security Software Engineer
$120K — $150K *
Carnegie Mellon University
Pittsburgh, PA 15237 (Allegheny County)
Reposted Today
AI Security Researcher
$90K — $130K *
Carnegie Mellon University
Pittsburgh, PA 15237 (Allegheny County)
Reposted Today
Principal Software Engineer
$99K — $223K *
Oracle Corporation
Nashville, TN 37211 (Davidson County)
Reposted Today
Cyber Defense Forensics Analyst
$62K — $141K *
Booz Allen Hamilton, Inc.
Alexandria, VA 22304 (Alexandria City County)
Today

Find similar Site Reliability Engineer jobs:

Nationwide Austin, TX

Site Reliability Engineer

Job Overview by Ladders

Full Job Description

Get Ready For Your Next Interview

Find similar Site Reliability Engineer jobs:

Get Ready For Your
Next Interview