Google

Site Reliability Manager, GCE Node, Site Reliability Engineering

Google$207K — $300K *
Information Technology
8 - 10 years of experience
Job Overview by Ladders

Qualifications

  • Bachelor's degree in Computer Science or related field, or equivalent experience.
  • 8 years of software development experience in multiple programming languages.
  • 3 years of people or team management experience.
  • 3 years of project leadership experience.
  • 3 years in designing, analyzing, and troubleshooting distributed systems.

Responsibilities

  • Lead a team of Software and Systems Engineers on user-facing projects.
  • Own end-to-end availability and performance of key services.
  • Build automation to prevent problems and automate service responses.
  • Mentor the team and establish credibility through quality technical execution.
  • Manage on-call rotations across continents using a follow-the-sun model.
  • Design and deliver software that enhances the availability and efficiency of services.

Benefits

  • Health, dental, vision, life, and disability insurance.
  • 401(k) retirement plan with company match.
  • 20 days of vacation per year.
  • 40 hours of sick time per year, increasing for Seattle employees.
  • 28-30 weeks of maternity leave through short-term disability.
  • 18 weeks of baby bonding leave.
  • 13 paid holidays per year.
Full Job Description
info_outline
X In accordance with Washington state law, we are highlighting our comprehensive benefits package, which is available to all eligible US based employees. Benefits for this role include:
  • Health, dental, vision, life, disability insurance
  • Retirement Benefits: 401(k) with company match
  • Paid Time Off: 20 days of vacation per year, accruing at a rate of 6.15 hours per pay period for the first five years of employment
  • Sick Time: 40 hours/year (increased to 69 hours/year for Seattle) including 5 discretionary sick days per instance
  • Maternity Leave (Short-Term Disability Baby Bonding): 28-30 weeks
  • Baby Bonding Leave: 18 weeks
  • Holidays: 13 paid days per year


Minimum qualifications:
  • Bachelor's degree in Computer Science, a related field, or equivalent practical experience.
  • 8 years of experience with software development in one or more programming languages.
  • 3 years of experience managing people or teams.
  • 3 years of experience leading projects.
  • 3 years of experience designing, analyzing, and troubleshooting distributed systems.

Preferred qualifications:
  • Master's degree in Computer Science or Engineering.


About the job

Site Reliability Engineering (SRE) combines software and systems engineering to build and run large-scale, massively distributed, fault-tolerant systems. SRE ensures that Google's services-both our internally critical and our externally-visible systems-have reliability, uptime appropriate to users' needs and a fast rate of improvement. Additionally SRE's will keep an ever-watchful eye on our systems capacity and performance.

Much of our software development focuses on optimizing existing systems, building infrastructure and eliminating work through automation. On the SRE team, you'll have the opportunity to manage the complex challenges of scale which are unique to Google, while using your expertise in coding, algorithms, complexity analysis and large-scale system design.

SRE's culture of intellectual curiosity, problem solving and openness is key to its success. Our organization brings together people with a wide variety of backgrounds, experiences and perspectives. We encourage them to collaborate, think big and take risks in a blame-free environment. We promote self-direction to work on meaningful projects, while we also strive to create an environment that provides the support and mentorship needed to learn and grow.

To learn more: check out our books on Site Reliability Engineering or read a career profile about why a Software Engineer chose to join SRE.

In this role, you will manage the massive fleet of virtual machines powering Google Compute Engine. Your mission is to ensure that both internal and external Google Cloud users have access to a reliable, scalable, and feature-rich compute environment. By maintaining the integrity of this global infrastructure, you will directly enable the success of countless services and applications worldwide.

The US base salary range for this full-time position is $207,000-$300,000 bonus equity benefits. Our salary ranges are determined by role, level, and location. Within the range, individual pay is determined by work location and additional factors, including job-related skills, experience, and relevant education or training. Your recruiter can share more about the specific salary range for your preferred location during the hiring process.

Please note that the compensation details listed in US role postings reflect the base salary only, and do not include bonus, equity, or benefits. Learn more about benefits at Google .

Responsibilities
  • Lead a team of Software and Systems Engineers on user-facing projects, taking direct responsibility for uptime.
  • Own end-to-end availability and performance of key services and build automation to prevent problem recurrence and automate responses to all non-exceptional service conditions.
  • Mentor the team, lead by example, and establish credibility through quality technical execution.
  • Manage on-call rotations across continents, using a follow-the-sun model.
  • Design, write, and deliver software to improve the availability, scalability, latency, and efficiency of Google's services.


About Google

Google is a multinational technology company that specializes in Internet-related services and products. These include online advertising technologies, search engine, cloud computing, software, and hardware. Google was founded in 1998 by Larry Page and Sergey Brin while they were Ph.D. students at Stanford University. The company has grown tremendously since then and has become one of the most valuable companies in the world. Google's mission is to organize the world's information and make it universally accessible and useful.
Learn more about Google
Size
156,500 employees
Market Cap
$1,115.4 billion
Industry
Net Income
$40.2 billion
Founded
1998
5 Year Trend
+23.3%
Revenue
$182.5 billion
NASDAQ

Similar Jobs

More Jobs at Google

More Information Technology Jobs

Find similar Site Reliability Manager, GCE Node, Site Reliability Engineering jobs: