Team Lead, Site Reliability Engineer

TeamViewer Germany GmbH

$120K — $150K *
Information Technology
Less than 5 years of experience
Job Overview by Ladders

Qualifications

  • 3+ years in SRE, DevOps, software development, or related roles
  • Strong experience with Microsoft Azure
  • Familiarity with containers and orchestration tools like Docker and Kubernetes
  • Experience with Infrastructure-as-Code using Terraform or Argo CD
  • Proficiency in monitoring tools such as Grafana or Prometheus
  • Solid understanding of cloud networking and security principles
  • Ability to balance technical tasks with leadership responsibilities

Responsibilities

  • Operate and maintain a scalable cloud infrastructure for a global SaaS platform
  • Lead deployment and ensure reliable production releases
  • Enhance monitoring and observability for 24/7 service reliability
  • Automate processes to minimize operational toil
  • Act as an escalation point during incidents for timely resolution
  • Conduct post-incident reviews with focus on root cause analysis
  • Proactively identify and address reliability risks and performance bottlenecks
  • Mentor and develop a team of Site Reliability Engineers

Benefits

  • Flexible PTO and paid holidays
  • 401(k) with employer matching
  • Comprehensive health insurance with 100% employer-paid medical coverage
  • Up to 12 weeks of parental leave
  • Employer-paid basic life insurance and disability coverage
  • Quarterly team-building events and leadership luncheons
  • Open door policy and casual dress code
  • Commitment to diversity and inclusion in the workplace
Full Job Description
As an SRE Team Lead at TeamViewer, you will be responsible for both technical leadership and people management within the Site Reliability Engineering team. You will help ensure the reliability, availability, and performance of our Azure-based global SaaS platform, while also leading, coaching, and developing a team of SREs.

This role sits at the intersection of operations, software engineering, and leadership. You will remain hands-on with production systems while setting direction, driving operational excellence, and fostering a strong team culture focused on ownership, reliability, and continuous improvement.

Technical & Operational Responsibilities

  • Operate and maintain highly available, scalable cloud infrastructure supporting a global SaaS platform


  • Lead the deployment and operation of feature and maintenance releases, ensuring safe and reliable delivery to production


  • Drive improvements in monitoring, alerting, and observability to support 24/7 service reliability and reduce incidents


  • Identify and reduce operational toil through automation and standardization


  • Act as an escalation point during incidents, supporting effective and timely resolution


  • Lead post-incident reviews, ensuring clear root cause analysis, actionable follow-ups, and measurable improvements


  • Proactively identify reliability risks, performance bottlenecks, and systemic weaknesses


Leadership & People Management Responsibilities

  • Line-manage and mentor a team of Site Reliability Engineers


  • Provide regular feedback, coaching, and performance development through 1:1s and goal setting


  • Support team members' technical growth and career progression


  • Promote best practices in incident management, on-call operations, documentation, and reliability engineering

  • Collaborate closely with Engineering, Product, Security, and Platform teams to improve operational maturit
  • Contribute to hiring, onboarding, and team capacity planning as the SRE function evolves
Preferred Requirements

  • Degree in Computer Science, Software Engineering, IT, or equivalent practical experience, with 3+ years in SRE, DevOps, software development, or related roles.
  • Strong hands-on experience with Microsoft Azure, including operating and troubleshooting Azure App Services and Application Gateways.
  • Practical experience with containers and orchestration (e.g., Docker, Kubernetes) and CI/CD pipelines (e.g., GitLab CI/CD, Azure DevOps).
  • Proven experience with Infrastructure-as-Code and automation (e.g., Terraform, Argo CD) and scripting (PowerShell, Bash, Python).
  • Experience with monitoring and observability tools (e.g., Grafana, Prometheus, Datadog).
  • Solid understanding of cloud networking, security principles, Identity & Access Management (e.g., Keycloak, Entra), and databases (e.g., PostgreSQL, MS SQL).
  • Comfortable balancing hands-on technical work with leadership and people responsibilities.

What we offer
  • Competitive compensation including stock-based options
  • Flexible PTO and paid holidays
  • 401(k) with employer matching
  • Comprehensive Health insurance package including 100% employer-paid medical coverage
  • Up to 12 weeks of Parental Leave
  • Basic Life Insurance, Short-Term & Long-Term Disability, 100% employer-paid
  • Quarterly teambuilding events, leadership luncheons, and companywide "All Hands" meetings
  • Open door policy and casual dress code
  • We celebrate diversity as one of our core values. Join c-a-r-e and lead change initiatives together with us!


Department Research & Development Locations Clearwater, Austin Remote status Hybrid

Similar Jobs

More Jobs at TeamViewer Germany GmbH

More Information Technology Jobs

Find similar Team Lead, Site Reliability Engineer jobs: