IT042 High Performance Computing (HPC) and Storage System Administrator

ADNET Systems, Inc.

• $100K — $130K *

Greenbelt, MD 20770In-Person

Information Technology

5 - 7 years of experience

Today

Be an Early Applicant

By clicking Apply, I agree with Ladders' Terms of Use and Privacy Policy

Job Overview by Ladders

Qualifications

Expert experience in Linux System Administration for enterprise distributions (RHEL, Ubuntu Server, etc.)
Hands-on knowledge of high-performance file systems, especially IBM Spectrum Scale or Lustre
Familiarity with HPC management software like Slurm for scheduling
Strong foundation in systems security and compliance protocols
Experience with Agile methodologies and collaborative project tools (Jira, GitLab)
Proficiency in GPU systems administration and performance tuning
Master's degree in a relevant field and over 5 years of experience
Must be a US citizen with eligibility for government clearance

Responsibilities

Manage daily operations of large-scale supercomputing clusters to ensure optimal performance
Administer high-performance storage systems, including configuration and maintenance
Oversee workload management and job scheduling using cluster management software
Implement security patches and compliance measures proactively
Conduct preventative and corrective maintenance, troubleshoot system issues
Provide expert-level user support for complex software and research challenges
Administer and optimize GPU-accelerated computing environments

Benefits

Annual Leave/Sick Leave
Military and Family Emergency Leave
Paid Holidays
Performance Bonuses
Medical, Dental and Vision Plans
401K Plan with Company Matching
Tuition Reimbursement
Swag bags

Full Job Description

IT042 High Performance Computing (HPC) and Storage System Administrator

This job description is for a High Performance Computing and Storage System Administrator to support the operations of the Integrated Modeling Computing Center (IMCC), formerly known as the NASA Center for Climate Simulation (NCCS). The IMCC will directly support the Integrated Modeling Virtual Institute (IMVI) to meet the Earth science modeling needs for NASA. The following describes the core duties and responsibilities and technical skills. Ideal candidates should have excellent communication skills, problem solving, and the ability to work efficiently within a highly performing team environment.

Core Duties & Responsibilities:

Full Operational Management: Perform day-to-day operations and management of large-scale, supercomputing clusters to meet the required availability, and performance, including, but not limited to, integration, provisioning, software stack deployment, updates, hardware and software maintenance, and decommissioning.
High-Performance Storage Administration: Deploy, tune, configure, maintain, and operate massive parallel file systems.
Workload and Schedule Management: Manage, configure, optimize, and troubleshoot cluster management and job scheduling software.
Security, Patches, and Compliance: Proactively implement security updates, coordinate systematic Operating System kernel patches, and mitigate vulnerabilities across computing and storage environments without compromising system stability.
Preventative and Corrective Maintenance: Coordinate vendor-supported maintenance schedules, conduct hardware and software diagnostics, and participate in rapid-response resolution during service degradations or system blackouts.
User Support: Provide specialized, tiered technical assistance ranging from software provisioning and workflow optimization to advanced, expert-level troubleshooting for complex research challenges.
GPU System Administration: Provision, configure, and maintain GPU-accelerated computing systems, including driver management, library configuration, and performance optimization for workload acceleration.

Required Technical Skills and Qualifications:

Expert Linux System Administration: Advanced, production-level expertise in enterprise Linux distributions (RHEL, Rocky Linux, AlmaLinux, or Ubuntu Server), incorporating expert-level command-line proficiency, kernel tuning, and automated shell scripting (Bash, Python).
Parallel File Systems Architecture: Hands-on experience in the design, deployment, scaling, and/or optimization of high-performance file systems. Experience in deploying, configuring, and operating IBM Spectrum Scale and/or Lustre.
Scheduling Proficiency: Working familiarity with HPC resource management, including experience with Slurm.
Systems Security Alignment: Robust foundation in core security frameworks, containing firewalls, identity management (LDAP/Active Directory), access control lists (ACLs), SSH hardening, and continuous patch management cycles.
Agile Methodologies: Experience operating within modern Agile frameworks (Scrum, Kanban), leveraging iterative workflows, participating in sprint reviews, and utilizing collaborative project boards (Jira, Gitlab) to track milestones.
GPU Accelerator Management: Proficiency in configuring and maintaining GPU-accelerated computing environments, including driver installation/management, CUDA or similar library configuration, and performance tuning for accelerated workloads.
A MS degree and 5+ years' experience in relevant work areas.
US Citizenship required.
Ability to obtain and maintain a Tier 1 or Tier 2 Investigation through NASA.

Some features of our compensation plans and environment perks include:

Annual Leave/Sick Leave
Military and Family Emergency Leave
Paid Holidays
Performance Bonuses
Medical, Dental and Vision Plans
Direct Deposit Payroll
401K Plan with Company Matching
Tuition Reimbursement
Swag bags

* Ladders Estimates

Similar Jobs

Program Manager - Asset Management Technology (Hybrid)
$122K — $135K *
Eversource
Hartford, CT 06106 (Capitol County)
Today
IT045 High Performance Computing (HPC) and Storage System Administrator
$100K — $130K *
ADNET Systems, Inc.
Greenbelt, MD 20770 (Prince Georges County)
Today
IT043 High Performance Computing (HPC) and Storage System Administrator
$100K — $130K *
ADNET Systems, Inc.
Greenbelt, MD 20770 (Prince Georges County)
Today
Senior Systems Administrator, High Performance Computing
$107K — $131K *
Weill Cornell Medicine
Ithaca, NY 14850 (Tompkins County)
Reposted Today
IT044 High Performance Computing (HPC) and Storage System Administrator
$100K — $130K *
ADNET Systems, Inc.
Greenbelt, MD 20770 (Prince Georges County)
Today
Senior SysOps Engineer
$120K — $149K *
OneStream
Remote
Today

Get Ready For Your
Next Interview

More Jobs at ADNET Systems, Inc.

IT037 Technical Lead
$100K — $130K *
Greenbelt, MD 20770 (Prince Georges County)
Today
Aerospace & Defense
In-Person
GD169 Senior Software Systems Engineer & Architect
$110K — $140K *
Greenbelt, MD 20770 (Prince Georges County)
Today
Aerospace & Defense
In-Person
IT051 High Performance Computing (HPC) and Storage System Administrator
$100K — $130K *
Greenbelt, MD 20770 (Prince Georges County)
Today
Information Technology
In-Person
IT045 High Performance Computing (HPC) and Storage System Administrator
$100K — $130K *
Greenbelt, MD 20770 (Prince Georges County)
Today
Information Technology
In-Person
IT043 High Performance Computing (HPC) and Storage System Administrator
$100K — $130K *
Greenbelt, MD 20770 (Prince Georges County)
Today
Information Technology
In-Person

More Information Technology Jobs

Client Partner - Banking / Financial Services / Capital Markets
$325K — $350K + $100K bonus *
Large IT Services Firm (client of TechLink Systems)
New York, NY 10001 (New York County)
1 week ago
Software Test Engineer
$80K — $110K *
Barrios Technology
Huntsville, AL 35810 (Madison County)
Today
AI Solutions Architect
$120K — $150K *
BlackHawk Network
Coppell, TX 75019 (Dallas County)
Today
Technical Lead - Node.js
$120K — $150K *
Bridgenext, Inc
Remote
Today
Application Developer
$80K — $120K *
CCS, LLC
Remote
Today

Find similar IT042 High Performance Computing (HPC) and Storage System Administrator jobs:

Nationwide Greenbelt, MD

IT042 High Performance Computing (HPC) and Storage System Administrator

Job Overview by Ladders

Full Job Description

Get Ready For Your Next Interview

Find similar IT042 High Performance Computing (HPC) and Storage System Administrator jobs:

Get Ready For Your
Next Interview