Sr. Engineering Manager, Managed Platform Services

Crusoe

$245K — $295K *
Enterprise Technology
Less than 5 years of experience
Job Overview by Ladders

Qualifications

  • 5-7 years of experience in engineering management or related field
  • Hands-on expertise in observability, machine learning, and automated remediation
  • Demonstrated ability to lead and manage teams with empathy and effectiveness
  • Experience owning and delivering complex engineering projects from start to finish
  • Strong cross-functional communication skills, both verbal and written
  • Background in operating global services at scale
  • Highly organized and able to juggle multiple initiatives simultaneously

Responsibilities

  • Drive the roadmap for insight generation and automated actions based on telemetry data
  • Influence long-term team goals and metrics through strategic planning
  • Collaborate to refine product requirements for complex problems early in the scoping phase
  • Foster cross-functional partnerships to ensure integrated solution delivery
  • Lead critical engineering projects, ensuring auditable customer outcomes
  • Champion technical excellence and best practices across the team
  • Coach and mentor engineers, defining career paths and performance expectations

Benefits

  • Competitive compensation and equity packages
  • Comprehensive health, dental, and vision insurance
  • Paid time off, holidays, and leave of absence programs
  • Employer contributions to HSA accounts
  • Paid parental leave and life insurance
  • 401(k) retirement plan with company match
  • Professional development opportunities including tuition reimbursement
  • Mental health and wellness support
  • Daily meal allowances and commuter benefits
  • Global travel insurance and emergency assistance programs
Full Job Description
About the Role:

Join Crusoe as a Senior Engineering Manager and lead a talented team focused on revolutionizing our cloud infrastructure. In this pivotal role, you'll lead the Command Center Insights & Actions team - building the systems that translate raw infrastructure telemetry into human-readable diagnostics and automated remediation workflows. You'll own a technical roadmap spanning alerting engines, heuristic development, node health systems, and state machines that trigger proactive maintenance without impacting customer workloads, while exploring the integration of Large Language Models (LLMs) to build cutting-edge AI solutions within our Command Center product. This is a full-time opportunity for a passionate leader who thrives on building high-performing teams, fostering innovation, and delivering impactful, data-driven solutions in a dynamic environment.

What You'll Be Working On:
  • Drive the Insights & Actions Roadmap: Own and execute across alerting infrastructure, control plane APIs, automated action systems, and telemetry-derived insights such as straggler node detection and GPU profiling.
  • Influence Strategic Roadmaps: Contribute significantly to the team's roadmap, impacting long-term team goals and operational performance metrics.
  • Refine Early Product Requirements: Collaborate with product and engineering leadership to bring clarity to ambiguous problems early in the scoping process.
  • Collaborate Cross-Functionally: Partner with product, design, and engineering teams inside and outside the organization to align on goals and deliver integrated solutions.
  • Manage Complex Projects: Lead critical initiatives involving multiple engineers, including those outside your direct report structure, ensuring customer outcomes are auditable and decisions are data-driven.
  • Drive Technical Excellence: Champion process improvements, operational excellence, and best practices across the team.
  • Cultivate Team Growth: Coach and mentor engineers from new grad to Staff level, setting clear performance expectations and defining career paths to build a high-performing, sustainable team.


What You'll Bring to the Team:
  • Technical Expertise in Observability & Intelligence Systems: Hands-on background in ML, heuristics, or rule-based systems - with the ability to engage deeply on problems like anomaly detection, threshold design, and automated remediation logic.
  • Proven Leadership: Demonstrated track record of people management, leading with empathy, and maintaining a sustainable workload for your teams.
  • Technical Acumen: Ability to lead effectively in spaces where problems, opportunities, and strategies are not yet fully defined - driving clarity, direction, and execution.
  • Cross-Functional Collaboration: Excellent technical communication skills, both verbal and written, to work effectively across diverse roles and functions.
  • Project Ownership: Proven experience owning and delivering complex projects end-to-end, with measurable quality and data-driven decision-making.
  • Global Scale Experience: Background building and operating global services at scale.
  • Organizational Prowess: Highly organized and capable of managing multiple complex initiatives and team priorities in parallel.


Bonus Points
  • Background in data platforms and data science
  • Background in observability platforms or products
  • Familiarity with GPU profiling tools (Nsight, NCCL Inspector) or infrastructure diagnostics at the hardware layer
  • Highly motivated and proactive in identifying process improvements and boosting team efficiency
  • Passion for coaching and mentoring engineers into high-performing individuals
  • Enthusiasm for building team culture with a high quality of life for engineers
  • A true "people-person" who thrives in collaborative environments and is energized by teamwork


Benefits:
  • Competitive compensation and equity packages
  • Restricted Stock Units
  • Paid time off, paid holidays & leave of absence programs
  • Comprehensive health, dental & vision insurance
  • Employer contributions to HSA account
  • Paid parental leave
  • Paid life insurance, short-term and long-term disability
  • Professional development & tuition reimbursement
  • Mental health & wellness support
  • Commuter benefits (parking & transit)
  • Cell phone stipend
  • 401(k) Retirement plan with company match up to 4% of salary
  • Volunteer time off
  • Global travel insurance & emergency assistance
  • Daily meals allowance
  • Additional perks & programs specific to location


Compensation Range

Compensation will be paid in the range of up to $245,000 -$295,000 + Bonus. Restricted Stock Units are included in all offers. Compensation to be determined by the applicant's knowledge, education, and abilities, as well as internal equity and alignment with market data.

Similar Jobs

More Jobs at Crusoe

More Enterprise Technology Jobs

Find similar Sr. Engineering Manager, Managed Platform Services jobs: