HPC AI Systems Administrator

Information Technology
8 - 10 years of experience
Job Overview by Ladders

Qualifications

  • Bachelor's degree in Computer Science, MIS, or related field required.
  • 8-10 years of Linux system administration experience required, ideally in HPC or AI environments.
  • Strong background in Linux or network administration is preferred.
  • Experience with advanced lab system administration is a plus.
  • Demonstrated ability to mentor junior staff in system administration.

Responsibilities

  • Image, configure, and upgrade servers with Linux OS, including firmware and switch configurations.
  • Manage multiple root slots for HPC cluster provisioning and testing workflows.
  • Support virtualized lab infrastructure and design highly available environments.
  • Oversee installation and performance management of high-performance storage systems.
  • Coordinate hardware and software troubleshooting with infrastructure support teams.
  • Design lab layouts and operational policies that comply with cybersecurity standards.
  • Prioritize lab requests and ensure effective resource utilization.

Benefits

  • Comprehensive suite of health and wellbeing benefits for team members and their families.
  • Investment in personal and professional development to achieve career goals.
  • Commitment to unconditional inclusion and valuing individual uniqueness.
  • Flexibility to balance work and personal life needs.
Full Job Description
HPC AI Systems Administrator

This role has been designed as ‘’Onsite’ with an expectation that you will primarily work from an HPE office.

Job Description:

   

This position will support government accounts.  Therefore, due to federal export-control regulations, the selected candidate must hold U.S. citizenship, U.S. lawful permanent resident/Green Card status or otherwise have a category of refugee/asylee status enabling them to perform the role without requiring a license under the International Traffic in Arms Regulations (ITAR) or Export Administration Regulations (EAR).

The Data Center Administration team is seeking a Senior System Administrator to provide advanced system administration and lab operations support for hardware, network, and software environments used by HPE HPC & AI Performance Engineering teams. These environments support internal product development, performance engineering, ISV validation, and customer-facing sales and benchmarking activities.
This role serves as a senior technical contributor and lab expert, providing design guidance, operational leadership, and escalation-level troubleshooting across complex HPC and AI lab environments. The position partners closely with engineering teams, infrastructure support groups, and external partners to ensure lab stability, availability, and effective use of resources.
The Senior System Administrator contributes to continuous improvement of lab processes, policies, and standards, prioritizes lab requests, mentors junior staff, and supports future lab expansion and facility transitions.

Essential Job Functions and Duties

  • Image, configure, and upgrade servers with Linux operating systems, including firmware updates and switch configuration to support lab environments.
  • Configure and manage multiple root slots hosting varied operating system images in support of HPC cluster provisioning, validation, and testing workflows.
  • Provide design guidance and operational support for virtualized lab infrastructure, including virtual server administration and the design of highly available, fault-tolerant environments.
  • Provide design guidance for lab storage solutions, including installation, configuration, and performance management of high-performance storage systems (e.g., Lustre) to support sales, benchmarking, and partner activities.
  • Provide guidance for hardware and software installation and configuration, including advanced hardware diagnostics and coordination with infrastructure support teams to resolve power, CPU, and GPU issues.
  • Collaborate with AI benchmarking, R&D, and performance engineering teams to design and operate lab environments that meet internal, partner, and customer requirements.
  • Design lab layouts, networks, and operational policies that meet functional needs while adhering to cybersecurity and asset protection standards.
  • Prioritize and coordinate lab work activities to ensure timely delivery of high-impact requests and effective utilization of lab resources.
  • Make recommendations on lab resource usage, capacity planning, and future expansion to support evolving business and engineering needs.
  • Oversee and support lab transitions, including facility moves and infrastructure refresh activities.
  • Install, configure, and support job scheduling and resource management tools to maximize lab utilization.
  • Serve as a technical mentor to junior system administrators and lab staff, providing guidance on best practices, troubleshooting, and operational standards.
  • Communicate lab successes, risks, failures, and issues to management in a timely and professional manner.
  • Work effectively with remote administrators, vendors, and partners when specialized expertise or additional support is required.
Job-Specific Competencies
  • Communication – Communicates clearly and effectively in both written and verbal forms; collaborates well with diverse technical teams.
  • Creativity / Innovation – Applies creative problem-solving approaches and contributes to continuous improvement of lab processes and capabilities.
  • Customer Service – Demonstrates a service-oriented mindset when supporting internal teams, partners, and stakeholders.
  • Job Knowledge – Maintains deep technical knowledge of Linux systems, lab operations, and HPC/AI infrastructure.
  • Problem Solving / Analysis – Breaks down complex technical issues, identifies root causes, and develops effective solutions.
  • Quality – Demonstrates attention to detail, accuracy, and reliability.
  • Technical Skills – Strong expertise in Linux system administration with working knowledge of networking, storage, virtualization, and hardware platforms.
Education and Experience
  • Bachelor’s degree in Computer Science, MIS, or a related technical field required mainly System Administration.
  • Minimum of 8–10 years of Linux system administration experience required, preferably in HPC, AI, or lab-based environments.
  • Candidates with strong Linux or network administration backgrounds and demonstrated interest in advanced lab system administration will also be considered.
  • This role works as part of a team of system administrators and lab staff and reports to the Data Center Administration Manager.

Additional Skills:

Accountability, Accountability, Active Learning, Active Listening, Administrative Procedures, Agile Methodology, Agile Scrum Development, Bias, Business, Coaching, Company Policies, Creativity, Critical Thinking, Data Analysis Management, Data Collection Management (Inactive), Deliverables Management, Design, Design Thinking, Document Controls, Empathy, External Customers, File Maintenance, Follow-Through, Group Problem Solving, Growth Mindset {+ 5 more}

What We Can Offer You:

Health & Wellbeing

We strive to provide our team members and their loved ones with a comprehensive suite of benefits that supports their physical, financial and emotional wellbeing.

Personal & Professional Development

We also invest in your career because the better you are, the better we all are. We have specific programs catered to helping you reach any career goals you have — whether you want to become a knowledge expert in your field or apply your skills to another division.

Unconditional Inclusion

We are unconditionally inclusive in the way we work and celebrate individual uniqueness. We know varied backgrounds are valued and succeed here. We have the flexibility to manage our work and personal needs. We make bold moves, together, and are a force for good.

Job:

Engineering Services

Job Level:

Expert

    

"The expected salary/wage range for this position is provided below. Actual offer may vary from this range based upon geographic location, work experience, education/training, and/or skill level.
– United States of America: Annual Salary USD 105,500 - 243,000 in Minnesota & Wisconsin
The listed salary range reflects base salary. Variable incentives may also be offered."

Information about employee benefits offered in the US can be found at https://myhperewards.com/main/new-hire-enrollment.html

About Hewlett Packard Enterprise Development LP

Hewlett Packard Enterprise Development LP Careers

Joining Hewlett Packard Enterprise Development LP presents an unparalleled opportunity to advance a career in technology alongside some of the industry's most innovative minds. Hewlett Packard Enterprise Development LP stands as a beacon of innovation, leadership, and professional growth, offering a plethora of job opportunities that cater to a diverse range of skills and experiences.

Explore Career Opportunities

Hewlett Packard Enterprise Development LP is actively hiring, seeking passionate, creative, and solution-driven team players. Explore open positions that align with your skills and interests in areas ranging from engineering to marketing, and sales to IT. Each position at Hewlett Packard Enterprise Development LP not only boosts professional growth but also contributes to the company's culture of innovation and leadership.

Internship Programs

Kickstart your career with Hewlett Packard Enterprise Development LP’s internship programs. These opportunities allow interns to work on real projects, gaining hands-on experience and insights into the company's operations. Internships are a gateway to full-time employment, offering invaluable networking opportunities and a chance to build a professional resume.

Employee Benefits and Culture

Hewlett Packard Enterprise Development LP is committed to fostering a workplace where diversity and inclusion are integral to the company culture. Employees enjoy a range of benefits designed to support their physical, financial, and emotional well-being. From health insurance to retirement plans and flexible working conditions, the company ensures that team members have what they need to succeed.

Professional Development and Growth

The commitment to employee growth is evident through comprehensive training and development programs that encourage continuous learning and career advancement. Leadership development and diversity training are pillars of the company's strategy, ensuring that all team members have the opportunity to lead and innovate.

Networking and Innovation

At Hewett Packard Enterprise Development LP, networking goes hand in hand with innovation. Employees are encouraged to connect with colleagues and leaders through various professional networks and company-sponsored events. This culture of collaboration drives the development of groundbreaking solutions and services, reinforcing the company's position at the forefront of the technology sector.

Join the Hewlett Packard Enterprise Development LP Team

Search for job opportunities that match your skills and interests. Hewlett Packard Enterprise Development LP looks for individuals eager to drive innovation and lead in the digital era. Prepare your resume, hone your interview skills, and get ready to join a team that values growth and leadership.

Stay Connected with Hewlett Packard Enterprise Development LP Careers

Keep up to date with career tips, insider perspectives, and industry-leading insights you can put to use today—all from the professionals who work at Hewlett Packard Enterprise Development LP.

Sign Up for Job Alerts

Personalize your subscription to receive job alerts, latest news, and insider tips tailored to your preferences. Discover the exciting and rewarding career opportunities that await at Hewlett Packard Enterprise Development LP.
Learn more about Hewlett Packard Enterprise Development LP

Similar Jobs

More Jobs at Hewlett Packard Enterprise Development LP

More Information Technology Jobs

Find similar HPC AI Systems Administrator jobs: