Senior Hardware Data Center Technician

Crusoe

$110K — $135K *
Telecommunications & Hardware
5 - 7 years of experience
Job Overview by Ladders

Qualifications

  • 5-7 years of experience in data center or hardware support roles
  • Expertise in diagnosing and repairing GPU-based servers
  • Deep technical understanding of server hardware, BMC manageability, and BIOS settings
  • At least 4 years in a data center environment
  • Familiarity with Infiniband switches and networking
  • Basic Linux system administration skills
  • Strong problem-solving and communication skills
  • Associate's Degree or equivalent IT experience

Responsibilities

  • Diagnose and resolve hardware failures in GPU-based servers
  • Support burn-in/stress testing of new hardware
  • Manage vendor support tickets and liaise with vendor personnel
  • Maintain a precise spares inventory for repairs
  • Assist with racking and cabling servers for cloud deployments
  • Document hardware issues and communicate with teams and vendors
  • Participate in a rotating on-call schedule to handle after-hours issues

Benefits

  • Industry competitive pay
  • Restricted Stock Units in a growing tech company
  • Comprehensive health insurance including dental and vision
  • Employer HSA contributions
  • Paid parental leave and life insurance options
  • 401(k) matching up to 4%
  • Generous PTO and holiday schedule
  • Cell phone reimbursement and tuition reimbursement
  • Access to mental health support via Calm subscription
  • Legal services through MetLife Legal
Full Job Description
Crusoe is building the World's Favorite AI-first Cloud infrastructure company. We're pioneering vertically integrated, purpose-built AI infrastructure solutions trusted by Fortune 500 companies to power their most advanced AI applications. Crusoe is redefining AI cloud infrastructure, with a mission to align the future of computing with the future of the climate. Our AI platform is recognized as the "gold standard" for reliability and performance. Our data centers are optimized for AI workloads and are powered by clean, renewable energy.

Be part of the AI revolution with sustainable technology at Crusoe. Here, you'll drive meaningful innovation, make a tangible impact, and join a team that's setting the pace for responsible, transformative cloud infrastructure.

About the Role:

This role is critical to ensuring industry-leading reliability and uptime for our cloud platform, directly impacting our ability to deliver innovative solutions to our customers. You'll be involved in exciting projects, from supporting the burn-in/stress testing of new hardware to troubleshooting complex server issues and collaborating with vendors. The ideal candidate is a highly skilled and experienced technician with a deep understanding of server hardware, a passion for problem-solving, and a commitment to maintaining peak performance in a fast-paced environment. This is a full-time position.

What You'll Be Working On:
  • Troubleshooting & Repair: Diagnose and resolve hardware failures in complex GPU-based servers (both air and liquid-cooled), ensuring minimal downtime.
  • Hardware Testing & Qualification: Collaborate with the Infrastructure Systems team to support burn-in/stress testing of new hardware and resolve any issues that arise. Support the qualification of new hardware.
  • Vendor Management: Open and manage support tickets with hardware vendors, serve as the datacenter liaison for vendor support personnel, and maintain a hardware issue tracker.
  • Inventory Management: Maintain an accurate spares inventory and replenish stock as needed to ensure quick repairs.
  • Deployment Support: Assist the Cloud Deployments team with racking and cabling servers, contributing to the efficient expansion of our infrastructure.
  • Documentation & Communication: Maintain detailed records of hardware issues and resolutions, and communicate effectively with internal teams and vendors.
  • Physical Demands: Work in a physically challenging environment (sound/vibration/thermal) and be able to lift 50 lbs.
  • On-Call Support: Participate in a rotating on-call schedule to address critical after-hours hardware issues.

What You'll Bring to the Team:
  • Minimum of 5-7 years of hands-on experience in a data center, hardware support, or equivalent technical environment.
  • Server Hardware Expertise: Possess significant experience diagnosing and repairing complex GPU-based servers (both air and liquid-cooled).
  • Technical Proficiency: Demonstrate a deep understanding of server hardware, BMC-based manageability, BIOS settings, and firmware deployment.
  • Datacenter Experience: Have four or more years of hands-on experience working in a datacenter environment.
  • Networking Knowledge: Familiarity with Infiniband switches and network topology.
  • Linux Skills: Basic Linux system administration expertise.
  • Problem-Solving Abilities: Excellent analytical and problem-solving skills to effectively troubleshoot hardware issues.
  • Communication Skills: Strong organizational, time management, and communication skills.
  • Education: Associates Degree or equivalent experience in an IT-related field.
  • Background Check: Must be able to pass a background check.
  • Safety and Compliance: This position is designated a safety-sensitive position and/or is located in a safety-sensitive facility. Drug and alcohol program participation is required.


Bonus Points:
  • Experience with other high-performance computing (HPC) technologies.
  • Relevant certifications (e.g., CompTIA Server+, CCNA).
  • Experience with scripting languages (e.g., Python, Bash).
  • Knowledge of datacenter infrastructure management (DCIM) tools.
  • Experience working in a fast-growing startup environment.
  • Familiarity with various cooling systems used in data centers.
  • Experience with liquid cooling systems.


Benefits:
  • Industry competitive pay
  • Restricted Stock Units in a fast growing, well-funded technology company
  • Health insurance package options that include HDHP and PPO, vision, and dental for you and your dependents
  • Employer contributions to HSA accounts
  • Paid Parental Leave
  • Paid life insurance, short-term and long-term disability
  • Teladoc
  • 401(k) with a 100% match up to 4% of salary
  • Generous paid time off and holiday schedule
  • Cell phone reimbursement
  • Tuition reimbursement
  • Subscription to the Calm app
  • MetLife Legal


Compensation Range

Compensation will be paid in the range of up to $110,000 - $135,000 + Bonus. Restricted Stock Units are included in all offers. Compensation to be determined by the applicants knowledge, education, and abilities, as well as internal equity and alignment with market data.

Crusoe is an Equal Opportunity Employer. Employment decisions are made without regard to race, color, religion, disability, genetic information, pregnancy, citizenship, marital status, sex/gender, sexual preference/ orientation, gender identity, age, veteran status, national origin, or any other status protected by law or regulation.

Similar Jobs

More Jobs at Crusoe

  • Project Executive
    $190K — $240K *
    Dallas, TX 75217 (Dallas County)
    Real Estate & Construction
    In-Person
  • Workplace Manager
    $130K — $160K *
    New York, NY 10025 (New York County)
    Business Services
    In-Person
  • Senior Production Engineer
    $209K — $253K *
    San Francisco, CA 94112 (San Francisco County)
    Information Technology
    In-Person
  • Workplace Experiential Design Producer
    $128K — $156K *
    San Francisco, CA 94112 (San Francisco County)
    Business Services
    In-Person
  • Project Executive
    $190K — $240K *
    Remote
    Real Estate & Construction
    Remote in Dallas, TX

More Telecommunications & Hardware Jobs

Find similar Senior Hardware Data Center Technician jobs: