Google

Staff Software Engineer, ML Data Infrastructure

Google$207K — $300K *
Information Technology
8 - 10 years of experience
Job Overview by Ladders

Qualifications

  • Bachelor's degree or equivalent practical experience.
  • 8 years programming in C .
  • 5 years testing and launching software products.
  • 5 years building large-scale infrastructure or experience with compute technologies, storage, hardware architecture.
  • 3 years in software design and architecture.

Responsibilities

  • Enable next-generation model architectures and training procedures.
  • Write and maintain large-scale data processing pipelines in C .
  • Propose and secure buy-in for new infrastructure to support evolving training data.
  • Reduce complexity and fragmentation in ML training infrastructure with standardized solutions.
  • Collaborate with infrastructure teams on recommendations quality and debug data and infrastructure issues.

Benefits

  • Access to diverse training and development opportunities.
  • Flexible work arrangements and a focus on work-life balance.
  • Health and wellness programs.
  • Employee resource groups and community support.
  • Opportunities to work with cutting-edge technology in an innovative environment.
Full Job Description
Minimum qualifications:
  • Bachelor's degree or equivalent practical experience.
  • 8 years of experience programming in C .
  • 5 years of experience testing, and launching software products.
  • 5 years of experience building and developing large-scale infrastructure, distributed systems or networks, or experience with compute technologies, storage, or hardware architecture.
  • 3 years of experience with software design and architecture.

Preferred qualifications:
  • Experience building large-scale data infrastructure, frameworks or libraries.
  • Understanding of ML concepts, including model architecture and training.
  • Ability to collaborate effectively across teams and functions.
  • Solid communication (broadly and deeply) skills about recommendation technology, system design and implementation.


About the job

The YouTube Discovery Data team is responsible for the data that powers personalized discovery at YouTube -- the YouTube homepage, watch page, and dozens of other surfaces that allow users to discover content on YouTube. Hundreds of engineers across YouTube use these data sources to train and serve more than a thousand ML models, including the use of LLMs for personalized discovery at YouTube scale. Some of our current products include YouTube watch history, discovery training data, discovery sessions, the YouTube user data dump.

At YouTube, we believe that everyone deserves to have a voice, and that the world is a better place when we listen, share, and build community through our stories. We work together to give everyone the power to share their story, explore what they love, and connect with one another in the process. Working at the intersection of cutting-edge technology and boundless creativity, we move at the speed of culture with a shared goal to show people the world. We explore new ideas, solve real problems, and have fun - and we do it all together.

The US base salary range for this full-time position is $207,000-$300,000 bonus equity benefits. Our salary ranges are determined by role, level, and location. Within the range, individual pay is determined by work location and additional factors, including job-related skills, experience, and relevant education or training. Your recruiter can share more about the specific salary range for your preferred location during the hiring process.

Please note that the compensation details listed in US role postings reflect the base salary only, and do not include bonus, equity, or benefits. Learn more about benefits at Google .

Responsibilities
  • Enable next-generation model architectures and training procedures.
  • Write and maintain large-scale data processing pipelines in C .
  • Propose and secure buy-in from our clients to build new infrastructure for the evolving training data use-cases.
  • Reduce complexity and fragmentation in the ML training infrastructure by providing standardized, composable, and self-service infrastructure solutions.
  • Collaborate closely with other infrastructure teams working on recommendations quality, storage, logging and privacy. Debug data quality and infrastructure issues across the stack.


About Google

Google is a multinational technology company that specializes in Internet-related services and products. These include online advertising technologies, search engine, cloud computing, software, and hardware. Google was founded in 1998 by Larry Page and Sergey Brin while they were Ph.D. students at Stanford University. The company has grown tremendously since then and has become one of the most valuable companies in the world. Google's mission is to organize the world's information and make it universally accessible and useful.
Learn more about Google
Size
156,500 employees
Market Cap
$1,115.4 billion
Industry
Net Income
$40.2 billion
Founded
1998
5 Year Trend
+23.3%
Revenue
$182.5 billion
NASDAQ

Similar Jobs

More Jobs at Google

More Information Technology Jobs

Find similar Staff Software Engineer, ML Data Infrastructure jobs: