Hardware Sustaining Engineer

DigitalOcean

$83K — $104K *
Telecommunications & Hardware
Less than 5 years of experience
Job Overview by Ladders

Qualifications

  • Bachelor's degree in Computer Science/Engineering or equivalent experience
  • Hands-on experience with mid-tier cloud infrastructure
  • In-depth understanding of server hardware and firmware
  • Proficient in troubleshooting techniques, Python, and BASH
  • Experience with JTAG debugging, firmware troubleshooting, or wire sniffing is a plus
  • Strong communication skills for collaboration with stakeholders
  • Passion for continual improvement in processes and technologies

Responsibilities

  • Act as part of the Sustaining Engineering team
  • Support server hardware and networking through its operational lifecycle
  • Monitor issues via the #machines channel and MACHINES JIRA project
  • Participate in 24/7 on-call rotation with the engineering team
  • Serve as Tier 2 escalation for hardware/firmware issues reported by DCOPS and CloudOps
  • Develop and maintain operational standards for DigitalOcean hardware
  • Collaborate with various teams to address tooling and firmware concerns

Benefits

  • Remote work flexibility
  • Opportunities for professional development
  • Access to the latest technologies and innovative projects
  • Collaborative and inclusive company culture
  • Supportive team structure with continuous learning opportunities
Full Job Description
Reporting to the Manager of Infra::Machines::Design, you will support sustaining engineering efforts for the hardware infrastructure of the DigitalOcean server fleet. The ideal candidate will be eager to face new challenges as DigitalOcean continues to scale its data center footprint and infrastructure cloud capacity and explore new technologies and capabilities to bring to our customers, providing top-tier support for our hardware and firmware in the DO fleet.
What You'll Be Doing:
  • Act a member of the Sustaining Engineering team in the Infra::Machines::Design Organization
  • Support server hardware, cabling, and networking hardware throughout its operational lifecycle
  • Monitor the #machines channel and MACHINES JIRA project for issues and drive them to resolution
  • Participate in 24/7 on-call rotation with other members of the team
  • Act as Tier 2 escalation for Datacenter Operations (DCOPS) and Cloud Operations (CloudOps) regarding hardware and firmware components
  • Develop and maintain standards and practices for DigitalOcean hardware operations
  • Work closely with the Qualification team, Firmware team, Fleet Lifecycle Engineering team (FLE), Foresight team, and Infrastructure Services team to resolve issues in tooling, firmware packages, hardware components, and other operational concerns
  • Help with development of tooling and associated runbooks to address gaps in operational capabilities around hardware and firmware operations
  • Coordinate with Ops teams on monitoring thresholds, failure modes and alerting
  • Assist in troubleshooting cause of failures and work to prevent them in the future
  • Raise the quality bar in the delivery of our cloud infrastructure by identifying industry best practices and working to adopt them
What We'll Expect From You:
  • Technical Degree (BS Computer Science/Engineering) or equivalent practical experience
  • Hands-on experience operating a cloud infrastructure at mid-tier scale or better
  • An in-depth understanding of server hardware, firmware, and infrastructure
  • Strong knowledge in troubleshooting techniques, Python and BASH
  • Extra points for JTAG debugging / Firmware troubleshooting / Wire sniffing experience
  • Clear communication and collaboration across key stakeholders
  • An insatiable passion for constant improvement
Compensation Range:
  • $83,000 - $104,000

*This is a remote role



#LI-Remote

Similar Jobs

More Jobs at DigitalOcean

More Telecommunications & Hardware Jobs

Find similar Hardware Sustaining Engineer jobs: