0-5 years of mechanical or thermal reliability engineering experience in data center hardware or high-performance computing platforms
Bachelor's or Master's degree in Mechanical Engineering, Thermal Engineering, or related field
Strong understanding of mechanical and thermal reliability principles
Experience with failure analysis and root cause methodologies
Familiar with Design of Experiments (DOE) and product validation processes
Ability to work effectively in cross-functional engineering teams
Responsibilities
Develop and execute reliability strategies for new product development programs
Design and implement Design of Experiments (DOE) for product validation
Perform reliability testing and evaluation for mechanical and thermal systems
Lead qualification readiness assessments for materials and tests
Conduct failure analysis and drive corrective actions during development
Evaluate and improve reliability of liquid cooling systems
Collaborate with partners on system testing and reliability validation
Develop reliability reports and qualification documentation
Benefits
Work in a dynamic, innovative environment with cross-functional collaboration
Opportunity to contribute to cutting-edge technology in cooling systems
Potential for professional growth in a leadership role within a high-performance team
Access to resources for continuous learning and skill enhancement
Engagement in a critical field that directly impacts product quality and reliability
Full Job Description
What You'll Do Here
Develop and execute reliability strategies and qualification plans for New Product Development (NPD) programs
Design and implement Design of Experiments (DOE) for material, process, and product validation
Perform reliability testing, evaluation, and qualification for mechanical and thermal systems
Lead qualification readiness assessments, including material readiness and test program validation
Conduct failure analysis and root cause investigations, and drive corrective and preventive actions (CAPA) during development.
Work on system-level reliability for:
Mechanical assemblies
High-speed connectors
Compute blade architectures
Evaluate and improve the reliability of liquid cooling systems, including:
Direct-to-chip (D2C) cooling
Rack and POD-level liquid cooling infrastructure
Collaborate with ODM, JDM, and CM partners on L6~L11 system testing and reliability validation
Develop reliability reports, risk assessments, and qualification documentation
Contribute to product validation processes and drive improvements in product quality and reliability
Who You Are
0-5 years of experience in mechanical and/or thermal reliability engineering in data center hardware, server systems, or high-performance computing platforms
Bachelor's or Master's degree in:
Mechanical Engineering
Thermal Engineering
Or a related field
Strong understanding of:
Mechanical and thermal reliability principles
Failure analysis and root cause methodologies
Materials and process characterization
Experience with:
Design of Experiments (DOE)
Product validation and qualification processes
Ability to work effectively in cross-functional engineering teams
Bonus Points If You Have
Knowledge of:
Liquid cooling systems and architectures
Advanced packaging and interconnect technologies
Familiarity with industry standards such as:
ISO
OCP
Experience working with ODM/JDM/CM partners in manufacturing environments
Strong analytical, problem-solving, and data-driven decision-making skills
All candidates must be authorized to work in the United States and work from our offices in Mountain View Tuesdays-Thursdays.