Google

Staff Software Engineer, Fault Management

Google$207K — $301K *
Information Technology
8 - 10 years of experience
Job Overview by Ladders

Qualifications

  • Bachelor's degree or equivalent practical experience.
  • 8 years of experience programming in C.
  • 5 years of experience in testing and launching software products.
  • 5 years in building and developing large-scale infrastructure, distributed systems, or networks.
  • 3 years of experience in software design and architecture.
  • Experience with C, SQL, and SQL Pipelines.

Responsibilities

  • Craft software/firmware solutions to enhance server reliability involving CPU, memory, and PCIe/CXL components.
  • Collaborate across teams to influence design and implementation of Google's compute and storage systems.
  • Drive development processes from requirements through integration ensuring high-quality outputs.
  • Plan and manage resources and tools to meet reliability goals outlined in the roadmap.
  • Engage with vendors and represent the fault management software team in project planning with executives.

Benefits

  • Opportunity to work on cutting-edge technology in a major tech company.
  • Collaboration with diverse teams in a dynamic work environment.
  • Engagement in high-impact projects that influence core infrastructure.
  • Exposure to complex systems design and implementation challenges.
Full Job Description
Minimum qualifications:
  • Bachelor's degree or equivalent practical experience.
  • 8 years of experience programming in C .
  • 5 years of experience testing, and launching software products.
  • 5 years of experience building and developing large-scale infrastructure, distributed systems or networks, or experience with compute technologies, storage, or hardware architecture.
  • 3 years of experience with software design and architecture.
  • Experience with C , SQL, and SQL Pipelines.

Preferred qualifications:
  • Master's degree or PhD in Engineering, Computer Science, or a related technical field.
  • 8 years of experience with data structures and algorithms.
  • 3 years of experience in a technical leadership role leading project teams and setting technical direction.
  • 3 years of experience working in a matrixed organization involving cross-functional, or cross-business projects.
  • Experience with Reliability, Availability, and Serviceability (RAS) related data pipelines, and dashboards.


Responsibilities
  • Focus on crafting software/firmware solutions that fortify the reliability of servers and their components, spanning x86 CPU, memory subsystems, peripheral component interconnect express (PCIe)/compute express link (CXL) input/output link covering host and endpoints, and software components.
  • Partner across multiple teams and job ladders to influence the design and implementation of most compute and storage systems powering Google's data centers.
  • Drive every facet of development, from requirements definition to design, implementation, unit testing, and integration. Oversee meticulous reviews to guarantee the delivery of high-quality solutions.
  • Plan and manage resources, and tools to execute against a comprehensive roadmap that advances our reliability goals.
  • Promote collaborations with vendors and represent the fault management software team in project planning discussions with executive management.

About Google

Google is a multinational technology company that specializes in Internet-related services and products. These include online advertising technologies, search engine, cloud computing, software, and hardware. Google was founded in 1998 by Larry Page and Sergey Brin while they were Ph.D. students at Stanford University. The company has grown tremendously since then and has become one of the most valuable companies in the world. Google's mission is to organize the world's information and make it universally accessible and useful.
Learn more about Google
Size
156,500 employees
Market Cap
$1,115.4 billion
Industry
Net Income
$40.2 billion
Founded
1998
5 Year Trend
+23.3%
Revenue
$182.5 billion
NASDAQ

Similar Jobs

More Jobs at Google

More Information Technology Jobs

Find similar Staff Software Engineer, Fault Management jobs: