Google

Senior Software Engineer, Site Reliability Engineering

Google$174K — $252K *
Information Technology
5 - 7 years of experience
Job Overview by Ladders

Qualifications

  • Bachelor's degree in Computer Science, Engineering, or equivalent experience.
  • 5+ years in software development with proficiency in one or more programming languages.
  • 3+ years designing and troubleshooting large-scale distributed systems.
  • 2+ years of project leadership and technical management experience.
  • Master's degree in Computer Science or Engineering is a plus.

Responsibilities

  • Engage in and enhance the entire service lifecycle from design to deployment.
  • Support pre-launch services with design consulting and platform development.
  • Monitor live services to ensure availability, latency, and overall health.
  • Scale systems sustainably using automation and drive improvements in reliability.
  • Conduct sustainable incident response and perform blameless postmortems.

Benefits

  • Comprehensive health insurance options.
  • Retirement savings plans with company matching.
  • Generous paid time off and holiday policies.
  • Access to wellness programs and resources.
  • Continuous professional development opportunities.
Full Job Description
Minimum qualifications:
  • Bachelor's degree in Computer Science, Engineering, a related field, or equivalent practical experience.
  • 5 years of experience with software development in one or more programming languages.
  • 3 years of experience in designing, analyzing, and troubleshooting large-scale distributed systems.
  • 2 years of experience leading projects and providing technical leadership.

Preferred qualifications:
  • Master's degree in Computer Science or Engineering.


About the job

Site Reliability Engineering (SRE) is what you get when you treat operations as if it's a software problem. Our mission is to progress, protect, and provide for the software and systems behind all of Google's public services - Search, Ads, Gmail, Android, YouTube, and AppEngine, to name just a few - with an ever-watchful eye on their availability, latency, performance, and capacity.

This is an unusual job, unlike others in the industry. Like traditional operations groups, we keep important, revenue-critical systems up and running despite hurricanes, bandwidth outages, and configuration problems . Unlike traditional operations groups, we also have full access to and authority to fix, extend, and scale the code to keep it working and harden it against all the vagaries of the Internet. We hire people from both systems and software backgrounds. Strong candidates will have experience with both.

Just as what we do is unique, where we do it is unique too. At Google, we have the good fortune to have developed many interesting systems ranging from planet-spanning databases to near real-time scalable data warehousing to fault-tolerant datastream joining . In SRE, we flip between the fine-grained detail of disk driver I/O scheduling to the big picture of continental-level service capacity, across a range of systems and a user population measured in billions. We own those products in production. We drive reliability and performance across massive scale by mastering the full depth of the stack. We literally do learn something new every day - usually surprising things - that have the potential to transform the lives of billions of our users around the world.

Behind everything our users see online is the architecture built by the Technical Infrastructure team to keep it running. From developing and maintaining our data centers to building the next generation of Google platforms, we make Google's product portfolio possible. We're proud to be our engineers' engineers and love voiding warranties by taking things apart so we can rebuild them. We keep our networks up and running, ensuring our users have the best and fastest experience possible.

The US base salary range for this full-time position is $174,000-$252,000 bonus equity benefits. Our salary ranges are determined by role, level, and location. Within the range, individual pay is determined by work location and additional factors, including job-related skills, experience, and relevant education or training. Your recruiter can share more about the specific salary range for your preferred location during the hiring process.

Please note that the compensation details listed in US role postings reflect the base salary only, and do not include bonus, equity, or benefits. Learn more about benefits at Google .

Responsibilities
  • Engage in and improve the whole lifecycle of services - from inception and design through to deployment, operation, and refinement.
  • Support services before they go live through activities such as system design consulting, developing software platforms and frameworks, capacity planning, and launch reviews.
  • Maintain services once they are live by measuring and monitoring availability, latency, and overall system health.
  • Scale systems sustainably through mechanisms like automation, and evolve systems by pushing for changes that improve reliability and velocity.
  • Practice sustainable incident response and blameless postmortems.


About Google

Google is a multinational technology company that specializes in Internet-related services and products. These include online advertising technologies, search engine, cloud computing, software, and hardware. Google was founded in 1998 by Larry Page and Sergey Brin while they were Ph.D. students at Stanford University. The company has grown tremendously since then and has become one of the most valuable companies in the world. Google's mission is to organize the world's information and make it universally accessible and useful.
Learn more about Google
Size
156,500 employees
Market Cap
$1,115.4 billion
Industry
Net Income
$40.2 billion
Founded
1998
5 Year Trend
+23.3%
Revenue
$182.5 billion
NASDAQ

Similar Jobs

More Jobs at Google

More Information Technology Jobs

Find similar Senior Software Engineer, Site Reliability Engineering jobs: