Minimum qualifications:- Bachelor's degree or equivalent practical experience.
- 2 years of experience with software development or 1 year of experience with an advanced degree in an industry setting.
- 2 years of experience with developing large-scale infrastructure, distributed systems or networks, or experience with compute technologies, storage or hardware architecture.
- 2 years of experience with networking protocols and troubleshooting.
- 2 years of experience working in C .
Preferred qualifications:- 2 years of experience with data structures and algorithms.
- 2 years of experience with performance optimization.
- 2 years of experience with networking protocols.
- Experience developing accessible technologies.
- Experience in network infrastructure.
- Experience with machine learning infrastructure.
About the jobTensor Processing Units (TPU) are Google's custom-built Application-Specific Integrated Circuits used to accelerate machine learning (ML) workloads. TPU are designed from the ground up leveraging Google's deep experience and leadership in ML Learning.
As a team member in Tensor Processing Unit Host Networking, you will play a leading role in the design, development, testing, deployment, and debugging of the TPU networking stack, from hardware (Tensor Processing Unit, Network Interface Controller) all the way up to ML frameworks (JAX, PyTorch) to enable both large-scale training and low-latency inference applications.
The US base salary range for this full-time position is $147,000-$211,000 bonus equity benefits. Our salary ranges are determined by role, level, and location. Within the range, individual pay is determined by work location and additional factors, including job-related skills, experience, and relevant education or training. Your recruiter can share more about the specific salary range for your preferred location during the hiring process.
Please note that the compensation details listed in US role postings reflect the base salary only, and do not include bonus, equity, or benefits. Learn more about benefits at Google .
Responsibilities - Write product or system development code.
- Design, develop, test and deploy TPU networking stack.
- Perform full-stack cross-layer optimization of TPU networking performance for a variety of ML workloads.
- Analyze and debug TPU networking performance issues in production.
- Develop and enhance telemetry to provide deep visibility into network behavior and accelerate troubleshooting.