Senior Manager, Infrastructure Platform Engineering

Crusoe

$245K — $295K *
Information Technology
8 - 10 years of experience
Job Overview by Ladders

Qualifications

  • 10+ years in infrastructure or systems software development, with 3+ years in an engineering leadership role
  • Deep expertise in large-scale infrastructure platforms, focusing on resource pooling and allocation
  • Strong background with Kubernetes, and cloud platforms like GCP, AWS, or Azure
  • Experience with distributed state management and control systems
  • Proven track record in efficiency, capacity, or performance engineering
  • Hands-on management style with a focus on team growth
  • Experience hiring and mentoring infrastructure engineers

Responsibilities

  • Lead the team responsible for building core platform services
  • Set technical direction for capacity, utilization, and platform security
  • Drive the design of secure and instrumented platform systems
  • Hire, mentor, and grow a high-performing team of engineers
  • Partner with infrastructure and security teams on operational reliability
  • Improve platform efficiency and availability by identifying bottlenecks
  • Establish engineering standards for infrastructure software development

Benefits

  • Competitive compensation and equity packages
  • Comprehensive health, dental & vision insurance
  • Paid time off and holidays
  • Employer contributions to HSA account
  • Paid parental leave
  • Professional development and tuition reimbursement
  • Mental health and wellness support
  • 401(k) Retirement plan with company match
  • Volunteer time off and global travel insurance
  • Daily meals allowance
Full Job Description
About the Role

We are seeking a Senior Manager, Infrastructure Platform Engineering to lead a team building core systems that turn large-scale compute infrastructure into reliable, secure, and efficiently allocatable capacity. The team owns foundational services spanning resource pooling and allocation, capacity and utilization intelligence, fleet and system lifecycle management, and platform security and trust.

This is a hands-on management role for a leader who has come up through infrastructure and systems software engineering, understands the realities of operating compute at scale across cloud and on-premise environments, and is energized by building the control and platform systems that other engineering teams depend on. You'll lead a growing team of infrastructure software engineers, set technical direction across the platform, and partner closely with adjacent infrastructure, production engineering, and security teams to keep the substrate reliable, well-utilized, and easy to build on.

While this is an infrastructure-focused role rather than a traditional product role, the systems this team builds are essential to the experience our customers have on the platform. Reliable capacity, healthy systems, and a trustworthy substrate are what make a seamless, dependable customer experience possible - so the team's work directly underpins the business, even as its immediate users are the internal engineering teams building and operating workloads on top of it.

What You'll Be Working On
  • Leading the team responsible for the platform services that abstract underlying infrastructure into reliable, allocatable capacity, and for the systems that track and reconcile state across a large fleet
  • Setting the technical roadmap across capacity and utilization intelligence, resource lifecycle and state management, and platform security and trust frameworks
  • Driving the design of secure, well-instrumented platform systems - from Kubernetes-based orchestration and automation to lower-level system and hardware integration
  • Hiring, mentoring, and growing a team of infrastructure software engineers; building a high-performing organization from a strong foundation
  • Partnering with infrastructure, production engineering, and security teams to align platform capabilities with operational reliability, capacity, and trust requirements
  • Improving platform efficiency and availability - characterizing bottlenecks, reducing stranded resources, and shortening operational and recovery cycles
  • Establishing engineering standards for infrastructure software development: code quality, testing, deployment safety, and on-call practices for systems that span the platform
  • Translating a vertically integrated infrastructure stack into reliable platform primitives that engineering teams can build on
  • Staying technically hands-on - reviewing designs, contributing to architecture decisions, and being credible to the engineers you lead


What You'll Bring to the Team
  • 10+ years of experience in infrastructure or systems software development, with at least 3+ years in an engineering leadership role
  • Deep expertise in large-scale infrastructure platforms - building services that pool, allocate, and reconcile compute resources at scale
  • Strong background with Kubernetes and cloud platforms (GCP, AWS, or Azure) - orchestration, automation, and operating distributed systems in production
  • Experience with distributed state management and control systems - modeling resource and system lifecycle, reconciling desired vs. actual state, and handling failure gracefully across a large fleet
  • Experience with efficiency, capacity, or performance engineering - characterizing system behavior, identifying bottlenecks, and driving measurable improvements in utilization or availability
  • A player-coach approach to management: hands-on enough to make technical calls, structured enough to grow a team and ship through them
  • Track record of hiring strong infrastructure engineers and helping them grow into more senior roles
  • Comfortable operating in a fast-moving environment where the path isn't fully paved - willing to drive ambiguity to clarity


Bonus Points
  • Experience operating Kubernetes on bare-metal infrastructure as well as on managed cloud services (GKE, EKS, AKS)
  • Familiarity with the operational challenges of GPU clusters, AI training, and inference workloads
  • Working knowledge of platform security and trust concepts - secure boot, measured boot, TPMs, and hardware attestation
  • Experience with capacity forecasting, demand modeling, or allocation optimization at scale
  • Hands-on background with telemetry and observability platforms at scale (Prometheus, OpenTelemetry, Grafana)
  • Prior experience building infrastructure platforms at hyperscalers or cloud providers where internal engineers are the primary customer
  • Familiarity with hardware-software co-design - understanding how platform choices affect physical infrastructure utilization


Benefits
  • Competitive compensation and equity packages
  • Restricted Stock Units
  • Paid time off, paid holidays & leave of absence programs
  • Comprehensive health, dental & vision insurance
  • Employer contributions to HSA account
  • Paid parental leave
  • Paid life insurance, short-term and long-term disability
  • Professional development & tuition reimbursement
  • Mental health & wellness support
  • Commuter benefits (parking & transit)
  • Cell phone stipend
  • 401(k) Retirement plan with company match up to 4% of salary
  • Volunteer time off
  • Global travel insurance & emergency assistance
  • Daily meals allowance
  • Additional perks & programs specific to location


Compensation Range

Compensation will be paid in the range of up to $245,000 - $295,000 + Bonus. Restricted Stock Units are included in all offers. Compensation to be determined by the applicant's knowledge, education, and abilities, as well as internal equity and alignment with market data.

Similar Jobs

More Jobs at Crusoe

More Information Technology Jobs

Find similar Senior Manager, Infrastructure Platform Engineering jobs: