KLA Tencor

HPC Systems Engineer

KLA Tencor$159K — $271K *
Technical Services
Less than 5 years of experience
Job Overview by Ladders

Qualifications

  • Doctorate, Master's, or Bachelor's Degree in a relevant field with extensive related experience (3-8 years)
  • Deep expertise in Linux operating systems (SUSE, Red Hat, Rocky Linux, Ubuntu)
  • Strong experience in architecting and maintaining robust storage systems
  • Solid understanding of HPC hardware ecosystems, including servers and networking
  • Proficiency in scripting and development with Shell and Python
  • Experience with configuration management tools (Ansible, Salt, Chef, Puppet)
  • Familiarity with HPC schedulers (SGE, SLURM)

Responsibilities

  • Design and architect scalable, high-performance HPC cluster solutions
  • Lead deployment, configuration, and lifecycle management of HPC infrastructure
  • Collaborate with developers to translate requirements into technical solutions
  • Drive solutions from design through to production support
  • Ensure system reliability and performance across compute, storage, and networking
  • Support ongoing operations and troubleshoot HPC systems
  • Contribute to automation and DevOps best practices across the platform

Benefits

  • Participation in performance incentive programs
  • Medical, dental, and vision insurance
  • 401(K) with company matching
  • Employee stock purchase program (ESPP)
  • Tuition reimbursement program
  • Wellness benefits, including EAP
  • Paid time off and company holidays
Full Job Description

Group/Division

Enabling the movement toward advanced chip design, KLA's Measurement, Analytics and Control group (MACH) is looking for the best and brightest research scientists, software engineers, application development engineers and senior product technology process engineers to join our team. The MACH team's mission is to collaborate with our customers to innovate technologies and solutions that detect and control highly complex process variations—at their source—rather than compensate for them at later stages of the manufacturing process. With over 40 years of semiconductor process control experience, chipmakers around the globe rely on KLA to ensure that their fabs ramp next-generation devices to volume production quickly and cost-effectively. Our MACH team develops leading-edge solutions for patterning process analytics and control technologies, thereby providing customers with critical insight at the feature level, field level and cross-wafer analysis. Our teams also develop advanced modeling simulation, data analytics and process control modeling technologies. As a member of the MACH team, you’ll be joining the most sophisticated and successful process-control company in the semiconductor industry--working across functions to solve the most complex technical problems in the digital age.

Job Description/Preferred Qualifications

Role Overview

In this role, you will lead the architecture, deployment, and operational support of a high-performance computing (HPC) cluster platform used across IC fabrication facilities and mask shops globally.

You will partner with engineering stakeholders to gather requirements, design scalable solutions, and drive implementation from concept through production. This role requires a strong balance of systems architecture, hands-on engineering, and operational excellence in complex HPC environments.

Key Responsibilities

  • Design and architect scalable, high-performance HPC cluster solutions for global manufacturing environments
  • Lead deployment, configuration, and lifecycle management of cluster infrastructure
  • Collaborate with developers and cross-functional teams to understand requirements and translate them into technical solutions
  • Drive solutions from design through production, including implementation, validation, and support
  • Ensure system reliability, performance, and availability across compute, storage, and networking layers
  • Support ongoing operations, troubleshooting, and continuous improvement of HPC systems
  • Contribute to automation, standardization, and DevOps best practices across the platform

Qualifications & Experience

Systems & Infrastructure

  • Deep expertise in Linux operating systems (SUSE, Red Hat, Rocky Linux, Ubuntu)
  • Strong experience architecting and maintaining robust storage systems
  • Solid understanding of HPC hardware ecosystems, including servers, GPUs, networking, storage, schedulers, BIOS, and BMC
  • Experience with virtualization technologies such as VMware, Proxmox, or XCP-ng

Networking & Core Services

  • Strong understanding of TCP/IP fundamentals and network protocols (DNS, DHCP, HTTP, LDAP, SMTP)
  • Experience with file sharing technologies (NFS, CIFS)
  • Familiarity with net boot/PXE and high-availability Linux configurations

Automation & DevOps

  • Proficiency in scripting and development using Shell and Python
  • Experience with configuration management tools (Ansible, Salt, Chef, Puppet)
  • Strong DevOps mindset, including CI/CD pipelines and Git-based repositories

Platforms & Tools

  • Experience with HPC schedulers (SGE, SLURM)
  • Familiarity with web servers and traffic management (Apache, Nginx, reverse proxy, load balancing via HAProxy)
  • Monitoring and observability tools (Prometheus, Grafana, Nagios)
  • Database experience with MySQL

Minimum Qualifications

Doctorate (Academic) Degree and related work experience of 3+ years; Master's Level Degree and related work experience of 6+ years; Bachelor's Level Degree and related work experience of 8+ years

Base Pay Range: $159,500.00 - $271,200.00 Annually

Primary Location: USA-CA-Milpitas-KLA

KLA’s total rewards package for employees may also include participation in performance incentive programs and eligibility for additional benefits including but not limited to: medical, dental, vision, life, and other voluntary benefits, 401(K) including company matching, employee stock purchase program (ESPP), student debt assistance, tuition reimbursement program, development and career growth opportunities and programs, financial planning benefits, wellness benefits including an employee assistance program (EAP), paid time off and paid company holidays, and family care and bonding leave.

Interns are eligible for some of the benefits listed. Our pay ranges are determined by role, level, and location. The range displayed reflects the pay for this position in the primary location identified in this posting. Actual pay depends on several factors, including state minimum pay wage rates, location, job-related skills, experience, and relevant education level or training. We are committed to complying with all applicable federal and state minimum wage requirements where applicable. If applicable, your recruiter can share more about the specific pay range for your preferred location during the hiring process.

About KLA Tencor

KLA Corporation is a global capital equipment company that provides process control solutions for semiconductor and related industries. The Company's products are also used in a number of other high technology industries, including the packaging, light emitting diode (LED), power device and compound semiconductor markets. Its products and services are used by bare wafer, integrated circuit (IC), lithography reticle (reticle or mask) and disk manufacturers around the world. The Company's inspection and metrology products and related offerings are categorized in various groups, including Chip Manufacturing, Wafer Manufacturing, Reticle Manufacturing, LED, Power Device and Compound Semiconductor Manufacturing, Data Storage Media/Head Manufacturing, Microelectromechanical Systems (MEMS) Manufacturing, and General Purpose/Lab Applications.
Learn more about KLA Tencor
Size
11,300 employees
Market Cap
$52 billion
Industry
Net Income
$1.3 billion
Founded
1997
5 Year Trend
+21.5%
Revenue
$6 billion
NASDAQ

Similar Jobs

More Jobs at KLA Tencor

More Technical Services Jobs

Find similar HPC Systems Engineer jobs: