HTC Global Services

Lead Site Reliability Engineer - Cloud Platform (GCP/Kubernetes)

HTC Global Services$120K — $150K *
Information Technology
5 - 7 years of experience
Job Overview by Ladders

Qualifications

  • 7+ years in Site Reliability Engineering, Platform Engineering, Cloud Engineering, or DevOps
  • Expert-level experience in Kubernetes
  • Strong proficiency with Google Cloud Platform (GCP)
  • Proven expertise in Terraform
  • Experience with Helm for Kubernetes deployments
  • Familiarity with multi-cloud environments (AWS and Azure)
  • Proficient in Python or Bash scripting
  • Experience with monitoring tools like Prometheus, Grafana, Splunk, or OpenTelemetry

Responsibilities

  • Design and support highly available cloud infrastructure using GCP
  • Architect and manage scalable Kubernetes environments
  • Build and maintain Infrastructure-as-Code with Terraform
  • Develop and manage Helm charts for Kubernetes deployments
  • Create strategies for failover, disaster recovery, and multi-region setups
  • Enhance platform scalability, reliability, and performance
  • Implement monitoring, alerting, and observability best practices
  • Collaborate with engineering teams on platform architecture and cloud adoption
  • Mentor engineers and provide technical guidance

Benefits

  • Opportunities for professional development and mentoring
  • Work in a rapidly growing technology ecosystem
  • Lead initiatives on infrastructure strategy and resiliency
  • Collaborate with diverse engineering teams
  • Play a key role in improving operational excellence
Full Job Description
Job Title: Lead Site Reliability Engineer (GCP & Kubernetes)

Overview / Summary

We are seeking a Lead Site Reliability Engineer to drive reliability, scalability, and operational excellence across a rapidly growing technology ecosystem. This role serves as a technical leader focused on cloud architecture, Kubernetes platforms, infrastructure automation, and highly available distributed systems. The position plays a key role in defining infrastructure strategy, improving platform resiliency, and mentoring engineering teams.

Key Responsibilities
• Design and support highly available cloud infrastructure in GCP
• Architect and manage Kubernetes environments at scale
• Build and maintain Infrastructure-as-Code using Terraform
• Develop and manage Helm charts and Kubernetes deployments
• Design failover, disaster recovery, and multi-region strategies
• Improve platform scalability, reliability, and performance
• Implement monitoring, alerting, and observability best practices
• Partner with engineering teams on platform architecture and cloud adoption
• Mentor engineers and provide technical leadership

Required Qualifications
• 7+ years of experience in Site Reliability Engineering, Platform Engineering, Cloud Engineering, or DevOps
• Expert-level Kubernetes experience
• Strong Google Cloud Platform (GCP) experience
• Expertise with Terraform
• Experience with Helm
• Multi-cloud exposure, including AWS and Azure
• Experience with distributed systems
• Python or Bash scripting experience
• Experience with Prometheus, Grafana, Splunk, or OpenTelemetry

#LI-Onsite #LI-DT1 #Hiring

About HTC Global Services

HTC Global Services is a global provider of IT and Business Process Services and Solutions. Founded in 1990, HTC is headquartered in Troy, Michigan with delivery centers across multiple locations in North America, Europe, India, and Malaysia. HTC is an Inc. 500 Hall of Fame company and has been recognized by numerous industry and trade publications as a top provider of services. HTC has a strong client base of Global 2000 customers. HTC has a strong focus on healthcare, retail, financial services, and automotive verticals. HTC has a strong commitment to corporate social responsibility and has been recognized for its contributions to the community.
Learn more about HTC Global Services
Size
17,575 employees
Industry
Founded
1990
NASDAQ

Similar Jobs

More Jobs at HTC Global Services

More Information Technology Jobs

Find similar Lead Site Reliability Engineer - Cloud Platform (GCP/Kubernetes) jobs: