Staff Technical Program Manager, Managed Intelligence

Crusoe

$193K — $234K *
Information Technology
5 - 7 years of experience
Job Overview by Ladders

Qualifications

  • 7+ years as a Technical Program Manager in technical fields with complex program ownership.
  • Knowledge of LLM inference, batching strategies, and production scaling tradeoffs.
  • Experience with multi-tenant systems and SLA enforcement across workloads.
  • Familiarity with fine-tuning and alignment workflows in AI models.
  • Proven ability to build execution models in low-structure environments.
  • Exceptional verbal and written communication skills for executive updates.
  • Active use of AI tools to enhance program execution and risk detection.

Responsibilities

  • Own and plan multi-quarter release schedules for the Managed Inference platform.
  • Drive model version rollouts and capacity planning from start to finish.
  • Coordinate across multiple teams to ensure project alignment and success.
  • Proactively identify and mitigate risks before they escalate.
  • Develop execution frameworks and maintain real-time dashboards for tracking.
  • Lead pre-launch planning and validation for GPU generations.
  • Persuade and align engineering and product stakeholders effectively.

Benefits

  • Competitive compensation and equity packages
  • Restricted Stock Units
  • Comprehensive health, dental & vision insurance
  • Paid parental leave
  • Professional development & tuition reimbursement
  • Mental health & wellness support
  • 401(k) Retirement plan with company match up to 4% of salary
  • Global travel insurance & emergency assistance
  • Daily meals allowance
  • Volunteer time off
Full Job Description
About This Role:

Crusoe is the world's first vertically integrated, sustainable AI cloud. We build and operate GPU infrastructure powered by clean energy, from data center design through IaaS products to managed inference at scale, enabling AI-native companies to run demanding workloads without compromising on sustainability or reliability. Crusoe Cloud is 1,400 people and growing, and the TPM frameworks are still being built -- which means there is a genuine opportunity to shape how the function operates rather than inherit how it already works.

The Managed Inference platform is where customers run production LLM workloads without managing low-level infrastructure, and it is one of Crusoe's fastest-growing product areas. The Staff TPM for Managed Intelligence connects model engineering, IaaS, product, and data center operations to deliver a reliable, scalable inference platform. You will own end-to-end program delivery across multi-quarter roadmaps, model onboarding, inference optimization, and production readiness for new model versions. Deep familiarity with the model layer -- including how LLMs are served, optimized, and evaluated in production -- is essential to being effective in this role.

What You'll Be Working On:
  • End-to-end program delivery: Own multi-quarter release planning, dependency governance, and executive communication across the Managed Inference platform.
  • Complex, high-risk program management: Drive model version rollouts, inference optimization campaigns, SLA readiness for new GPU hardware, and multi-tenant capacity planning from kickoff through delivery.
  • Cross-functional alignment: Coordinate across Model Engineering, IaaS, Cloud Foundations, Data Center Operations, and external model providers to keep programs on track and unblocked.
  • Proactive risk identification: Surface risks across model serving, reliability, capacity constraints, and vendor timelines before they become program-level problems.
  • Execution frameworks and dashboards: Build lightweight, scalable TPM frameworks suited to Crusoe's pace; maintain real-time execution dashboards and deliver crisp, data-driven executive updates.
  • Phase 0 planning for model onboarding: Own pre-launch planning for model onboarding on new GPU generations, including firmware and driver readiness, CUDA and ROCm stack validation, and commissioning criteria for inference workloads.
  • Stakeholder leadership: Drive alignment and push back effectively across engineering, product, and operations leadership -- including highly technical stakeholders who have not previously worked with a TPM.


What You'll Bring to the Team:
  • 7+ years of experience as a Technical Program Manager in fast-paced technical environments, with a track record of owning complex programs end-to-end across engineering and product organizations.
  • LLM inference and model serving knowledge: Working familiarity with batching strategies, quantization approaches, and the tradeoffs that govern latency, throughput, and cost at production scale.
  • Multi-tenant systems experience: Familiarity with isolation, quota management, and SLA enforcement across concurrent workloads.
  • Fine-tuning and alignment awareness: Sufficient familiarity with fine-tuning and alignment workflows to govern program timelines, identify technical risks, and coordinate across the teams that own them.
  • Low-structure execution: Proven ability to build execution models in environments where the process did not yet exist, and make them stick with teams that didn't ask for them.
  • Executive communication: Exceptional written and verbal communication for delivering clear, data-driven, decision-oriented updates to executive stakeholders.
  • AI tool integration: Active, daily use of AI tools to improve program execution, risk detection, and communication -- not just personal productivity.
  • Cross-functional influence: Proven ability to drive alignment across engineering, product, and infrastructure leadership without direct authority, including with highly technical stakeholders.


Bonus Points:
  • 1+ years of experience working with teams building platforms or services for AI inference and/or training.
  • Direct experience governing model onboarding programs across GPU generations, including firmware, driver, and stack validation.
  • Experience coaching or mentoring junior TPMs in a high-growth technical environment.
  • Exposure to multi-site or globally distributed engineering teams.
  • Background at a Series D to Series F company or a high-performing team within a hyperscaler focused on AI infrastructure.


Benefits:
  • Competitive compensation and equity packages
  • Restricted Stock Units
  • Paid time off, paid holidays & leave of absence programs
  • Comprehensive health, dental & vision insurance
  • Employer contributions to HSA account
  • Paid parental leave
  • Paid life insurance, short-term and long-term disability
  • Professional development & tuition reimbursement
  • Mental health & wellness support
  • Commuter benefits (parking & transit)
  • Cell phone stipend
  • 401(k) Retirement plan with company match up to 4% of salary
  • Volunteer time off
  • Global travel insurance & emergency assistance
  • Daily meals allowance
  • Additional perks & programs specific to location


Compensation Range

Compensation will be paid in the range of up to $193,050 - $234,000 + Bonus. Restricted Stock Units are included in all offers. Compensation to be determined by the applicants knowledge, education, and abilities, as well as internal equity and alignment with market data.

Similar Jobs

More Jobs at Crusoe

  • Project Executive
    $190K — $240K *
    Dallas, TX 75217 (Dallas County)
    Real Estate & Construction
    In-Person
  • Workplace Manager
    $130K — $160K *
    New York, NY 10025 (New York County)
    Business Services
    In-Person
  • Senior Production Engineer
    $209K — $253K *
    San Francisco, CA 94112 (San Francisco County)
    Information Technology
    In-Person
  • Workplace Experiential Design Producer
    $128K — $156K *
    San Francisco, CA 94112 (San Francisco County)
    Business Services
    In-Person
  • Project Executive
    $190K — $240K *
    Remote
    Real Estate & Construction
    Remote in Dallas, TX

More Information Technology Jobs

Find similar Staff Technical Program Manager, Managed Intelligence jobs: