AutoZone

Systems Engineer - Cloud Ops

AutoZone$90K — $120K *
Information Technology
Less than 5 years of experience
Job Overview by Ladders

Qualifications

  • 3+ years hands-on experience with Kubernetes in production environments
  • 3+ years of experience with Google Cloud Platform (GCP) services including GKE
  • Strong experience with Terraform for infrastructure as code (IaC)
  • Proficiency with GitLab CI/CD pipelines and experience with ArgoCD
  • Familiarity with observability tools: Dynatrace, Prometheus, Grafana
  • Basic understanding of LLM concepts and AI model deployment
  • Excellent problem-solving and analytical skills.

Responsibilities

  • Design, build, and maintain cloud infrastructure using Terraform on GCP
  • Develop CI/CD pipelines and implement GitOps practices using ArgoCD
  • Monitor system performance and troubleshoot production issues
  • Participate in on-call rotation for critical infrastructure support
  • Deploy and manage containerized applications on Google Kubernetes Engine (GKE)
  • Support infrastructure for AI/ML workloads including LLM-based applications
  • Configure and optimize Kubernetes networking and resource management.

Benefits

  • Competitive pay
  • Unrivaled company culture
  • Medical, dental, and vision plans
  • Exclusive in-store discounts and perks
  • 401(k) with company match and Stock Purchase Plan
  • Mental health support through the Living Well Program
  • Opportunities for career growth
  • Paid time off and additional insurance options for full-time employees
Full Job Description
Job Description

As a Systems Engineer on the Cloud Operations team, you will be responsible for deploying, managing, and optimizing our cloud-based infrastructure on Google Cloud Platform (GCP). You will work with technologies such as Terraform, Kubernetes (GKE), GitOps/ArgoCD, CI/CD pipelines, and observability tools to ensure reliable, secure, and scalable platform operations.

You will also contribute to our AI/ML platform initiatives, supporting infrastructure for LLM-based applications and AI-powered automation tools that enhance developer productivity and operational efficiency.

You will collaborate with development teams, SREs, and platform architects to ensure seamless deployment and delivery of applications while maintaining the highest standards of reliability, security, and performance.

Responsibilities

Cloud Infrastructure, Automation & Operations:
  • Design, build, and maintain cloud infrastructure using Terraform to automate provisioning, scaling, and lifecycle management of resources on GCP
  • Develop and maintain CI/CD pipelines using GitLab CI to automate build, test, and deployment workflows. Implement and maintain GitOps practices using ArgoCD for declarative, version-controlled application deployment
  • Monitor system performance using observability tools (Dynatrace, Cloud Monitoring, Prometheus/Grafana) and troubleshoot production issues
  • Participate in on-call rotation to provide 24/7 support for critical infrastructure incidents
  • Perform root cause analysis on incidents and implement preventive measures. Document runbooks, architecture decisions, and operational procedures


Kubernetes Platform Management:
  • Deploy, configure, and manage containerized applications on Google Kubernetes Engine (GKE), including GKE Autopilot and Standard clusters
    Manage cluster lifecycle including upgrades, node pool configurations, and capacity planning
  • Troubleshoot pod failures, CrashLoopBackOff, OOMKilled events, and container resource issues
  • Configure and optimize resource requests/limits, Horizontal Pod Autoscaler (HPA), and Vertical Pod Autoscaler (VPA)
  • Manage Kubernetes networking including Services, Ingress controllers, Network Policies, and DNS configurations. Implement and manage service mesh (Istio) for traffic management, observability, and security
  • Manage secrets and configurations using Kubernetes Secrets, ConfigMaps, and external secret management tools. Implement pod security standards, RBAC policies, and workload identity configurations

AI/ML Platform & Automation:
  • Support infrastructure for AI/ML workloads including LLM-based applications and model serving platforms
  • Deploy and manage AI-powered developer tools such as coding assistants (Claude Code, GitHub Copilot) and agentic AI systems. Explore and implement AI-assisted incident response and automated remediation workflows
  • Build and maintain infrastructure for Retrieval-Augmented Generation (RAG) pipelines and vector databases
  • Configure GPU-enabled node pools and optimize resource allocation for AI/ML workloads
  • Implement MCP (Model Context Protocol) servers and AI agent integrations for operational automation
  • Stay current with emerging AI technologies and evaluate their applicability for infrastructure automation


Qualifications

Kubernetes Expertise (Essential):
  • 3+ years hands-on experience with Kubernetes in production environments
  • Deep understanding of Kubernetes architecture: API server, etcd, scheduler, controller manager, kubelet
  • Experience with GKE (Standard and Autopilot modes), including cluster creation, upgrades, and maintenance
  • Proficiency in troubleshooting workloads: analyzing pod logs, events, describe outputs, and container states
  • Strong understanding of resource management: requests, limits, QoS classes, and resource quotas
  • Experience with Kubernetes networking: Services (ClusterIP, NodePort, LoadBalancer), Ingress, Network Policies
  • Knowledge of Kubernetes storage: PersistentVolumes, PersistentVolumeClaims, StorageClasses, dynamic provisioning
  • Experience with Helm charts for application packaging and deployment
  • Familiarity with Kubernetes security: RBAC, Pod Security Standards, Secrets management, Workload Identity
  • Understanding of Kubernetes observability: metrics-server, kubectl top, container resource monitoring
  • Experience debugging common issues: ImagePullBackOff, CrashLoopBackOff, OOMKilled, Evicted pods, pending pods

Cloud & Infrastructure:
  • 3+ years of experience with Google Cloud Platform (GCP) services including GKE, Cloud Run, Cloud SQL, Memorystore, Pub/Sub, and Cloud Logging
  • Strong experience with Terraform for infrastructure as code (IaC)
  • Understanding of cloud networking: VPCs, subnets, firewall rules, Cloud NAT, Private Service Connect

CI/CD & GitOps:
  • Proficiency with GitLab CI/CD pipelines
  • Experience with ArgoCD or similar GitOps tools
  • Understanding of Helm charts and Kustomize for Kubernetes manifest management

Observability & Troubleshooting:
  • Experience with monitoring and APM tools (Dynatrace, Datadog, Prometheus, Grafana)
  • Ability to analyze logs, metrics, and traces to diagnose production issues
  • Familiarity with JVM troubleshooting (heap dumps, thread analysis, GC tuning, connection pool issues)

AI/ML Knowledge:
  • Basic understanding of LLM concepts, prompt engineering, and AI model deployment
  • Familiarity with AI coding assistants and their integration into development workflows
  • Interest in agentic AI systems and autonomous automation tools
  • Exposure to vector databases (Pinecone, Weaviate, pgvector) and RAG architectures is a plus

Systems & Networking:
  • Strong Linux administration skills
  • Understanding of networking concepts (DNS, load balancing, firewalls, TCP/IP)
  • Experience with service mesh (Istio) is a plus

General:
  • Excellent problem-solving and analytical skills
  • Strong written and verbal communication
  • Ability to work effectively in a collaborative, cross-functional environment
  • Experience working in an Agile/DevOps culture
  • Bachelor's degree in Computer Science, Information Technology, or related field (or equivalent experience)


Benefits at AutoZone

AutoZone offers thoughtful benefits programs with one-on-one benefits guidance designed to improve AutoZoners' physical, mental and financial well-being.

All AutoZoners (Full-Time and Part-Time):

  • Competitive pay
  • Unrivaled company culture
  • Medical, dental and vision plans
  • Exclusive discounts and perks, including an AutoZone in-store discount
  • 401(k) with company match and Stock Purchase Plan
  • AutoZoners Living Well Program for free mental health support
  • Opportunities for career growth


Additional Benefits for Full-Time AutoZoners:

  • Paid time off
  • Life, and short- and long-term disability insurance options
  • Health Savings and Flexible Spending Accounts with wellness rewards
  • Tuition reimbursement


Minimum age requirements may apply. Eligibility and waiting period requirements may apply; benefits for AutoZoners in Puerto Rico, Hawaii, or the U.S. Virgin Islands may differ. Learn more about all that AutoZone has to offer at Careers.AutoZone.com.

We proudly support Veterans, Active-duty Service Members, Reservists, National Guard and Military Families. Your experience is highly valued, and we encourage you to apply to join our team.

Online Application:

An online application is required. Click the Apply button to complete your application. For step-by-step instructions on how to apply visit careers.autozone.com/candidateresources.

About AutoZone

AutoZone is the nation's leading retailer and a leading distributor of automotive replacement parts and accessories with more than 6,000 stores in the US, Mexico, Brazil and Puerto Rico. Each store carries an extensive line for cars, sport utility vehicles, vans and light trucks, including new and remanufactured hard parts, maintenance items and accessories. AutoZone, headquartered in Memphis, TN, is a growing Fortune 300 company with a deep commitment to serving our customers, communities and fellow AutoZoners. We have vast opportunities in our stores, distribution centers, field offices, specialty business units and Store Support Center and embrace diverse experiences, backgrounds, knowledge and ideas to strengthen our teams and business. Our team is connected by a deep commitment to our Pledge and Values, principles established more than thirty years ago that reinforce our priorities and team culture. In addition, we constantly innovate and aspire to best serve our customers, creating new and better tools, training and outreach to serve both DIY and the professional installer customers. From in-store tools to E-Commerce, training and development to recognition, our team has the tools to help you grow your career at AutoZone. See where your drive will take you!

AutoZone Careers

Join the dynamic team at AutoZone, the leading retailer and distributor of automotive replacement parts and accessories in the U.S. At AutoZone, we are committed to providing not just auto parts, but superior customer service and solutions. There has never been a better time to explore the job opportunities available across our expansive network. Work You’ll Do At AutoZone, you’ll be part of a team that values innovation, leadership, and a commitment to excellence. We offer a variety of job opportunities that allow you to help drive growth within the company and the entire automotive industry. From sales to supply chain management, technology to marketing, there’s a place for your passion and skills. Join our team and contribute to a culture that values diversity, leadership, and professional development. AutoZone is not just about car parts—it’s about elevating your career to new heights in an environment that fosters growth and innovation. Professional Growth and Development AutoZone is dedicated to your professional growth. We provide comprehensive benefits and diversity training that ensure our team members are equipped to lead in the automotive and retail industries. Our leadership programs, internships, and continuous learning paths are designed to enhance your skills and advance your career. Innovate with Us Be part of a company that thrives on innovation and operational excellence. At AutoZone, you’ll work with a team of dedicated professionals who are always pushing the boundaries of what’s possible in the automotive world. Our commitment to innovation is at the core of our operations, ensuring that we stay ahead of industry trends and continue to deliver exceptional value to our customers. Networking and Career Advancement AutoZone provides a platform for networking with industry leaders and peers that can lead to incredible career opportunities. Our internal networking events, mentorship programs, and leadership training are geared towards enhancing your professional skills and helping you build meaningful connections within the industry. Join Our Team Search open positions that match your skills and interests. We are looking for passionate, curious, creative, and solution-driven team players. Whether you’re seeking an entry-level position or a more senior role, AutoZone offers a path that aligns with your career ambitions. Stay Connected Keep up to date with the latest in career opportunities and company news at AutoZone. Subscribe to our job alert emails and stay ahead of the game with insider tips, industry news, and exclusive looks into life at AutoZone. Explore AutoZone Careers Discover the opportunities waiting for you at AutoZone. From part-time jobs to full-time careers, from internships to executive positions, AutoZone is hiring across various departments. Ready to accelerate your career? Apply today and drive your future forward with AutoZone. SEARCH AUTOZONE JOBS Join us at AutoZone, where your career journey is just beginning.
Learn more about AutoZone
Size
62,000 employees
Market Cap
$45.5 billion
Industry
Net Income
$1.8 billion
Founded
1979
5 Year Trend
+8.3%
Revenue
$13.3 billion
NASDAQ

Similar Jobs

More Jobs at AutoZone

More Information Technology Jobs

Find similar Systems Engineer - Cloud Ops jobs: