Hardware Systems Engineer

Meta

$130K — $180K *
Telecommunications & Hardware
8 - 10 years of experience
Job Overview by Ladders

Qualifications

  • Bachelor's degree in Computer Science, Computer Engineering, or related field
  • 8+ years of hands-on engineering experience in software, firmware, or hardware
  • Experience in ASIC development, board level debug, and system validation
  • Proficient in leading Silicon or System troubleshooting and debugging
  • Experience in developing test specifications and procedures

Responsibilities

  • Drive and execute end-to-end system validation strategy for AI/HPC systems
  • Lead hands-on bring-up, validation, and deployment of large-scale hardware systems
  • Explore new use cases with customers and identify relevant test methodologies
  • Investigate and troubleshoot complex hardware failures with cross-functional teams
  • Triage failures and advance project development work
  • Identify improvements for test processes in the NPI phase
  • Communicate project progress to internal and external teams

Benefits

  • Engage in cutting-edge technology and innovative projects
  • Work collaboratively with diverse teams across various disciplines
  • Opportunity for hands-on participation in hardware deployment
  • Contribute to the development of systems for AI and HPC applications
  • Access to professional growth with exposure to advanced hardware systems
Full Job Description
Hardware Systems Engineers work closely with Hardware/Software co-design teams, hardware designers, networking teams, system manufacturers, component vendors, capacity engineering, production engineering, production services, and data center operations teams to enable new systems that will be deployed in our production data centers. Ramping to production and solving the datacenter scaling and deployment challenges requires us to take a systems based approach to the new product introduction (NPI) phase.

Responsibilities

Drive and execute end-to-end system validation strategy (hardware and software), with a focus on various AI/HPC hardware systems in datacenter applications
• Lead the bring-up, validation, and deployment of cutting-edge hardware systems in large scale deployment with active hands-on participations
• Explore new use cases with customer teams and identify related test methodologies/test cases accordingly
• Investigate and troubleshoot complex failures potentially related to Hardware systems with cross-function teams, which may involve different stacks like silicon, firmware, software, etc
• Triage failures and continue rootcausing while driving project development work forward
• Identify gaps and opportunities to improve test process and test methodologies across the NPI space
• Guide automation efforts and data analysis for NPI projects through engagement with related cross-function teams
• Communicate project progress and assessments to related internal and external teams

Minimum Qualifications
• Bachelor's degree in Computer Science, Computer Engineering, relevant technical field, or equivalent practical experience
• 8+ years of experience in hands-on SW, FW or HW engineering to build any of the following products (AI Silicon, GPUs, TPUs, Autonomous cars, AI servers)
• Experience in one or more domains such as: ASIC development (Silicon design, bringup, characterization, validation), board level debug, firmware validation, system validation
• Experience with leading Silicon or System troubleshooting and debugging
• Experience in developing test specifications, procedures, and debug guides for test solutions

Preferred Qualifications
• 8+ years of experience integrating lab tools for automated workflows and managing large-scale deployments
• 8+ years of experience with one or more of the following modules/domains: PCIe, NVlink, Networking, Flash, Memory, CPU, GPU, TPU, DRAM (DDR4/5 or HBM), AI silicon/AI accelerators
• 5+ years of experience with using continuous integration and version control tools for system development and testing
• 5+ years of experience in software, firmware, and hardware engineering to develop systems/products for datacenter applications such as video processing, AI/ML, and networking
• 5+ years of experience with definition of HW/SW interface requirements for telemetry, diagnostics, debugging
• Experience with debugging tools for SoCs (e.g., JTAG, GDB, Trace32) and knowledge of common bus protocols such as I2C, SPI, USB, and PCIe
• Proficiency in High-Performance Computing (HPC) or AI system architecture at rack level and at scale
• Proficiency in Linux environment and server system management

Similar Jobs

More Jobs at Meta

More Telecommunications & Hardware Jobs

Find similar Hardware Systems Engineer jobs: