Senior Technical Product Manager, Observability

Vultr

$130K — $165K *
US-AnywhereRemote in United States
Information Technology
5 - 7 years of experience
Job Overview by Ladders

Qualifications

  • 7+ years in product management focusing on cloud infrastructure and observability
  • Expertise in observability and monitoring systems (metrics, logging, tracing, telemetry)
  • Proven ability to define product strategies for large-scale platform products
  • Solid technical background for effective collaboration with engineering on telemetry and data architecture
  • Experience with the observability challenges of GPU, AI/ML, or HPC systems
  • History of delivering impactful developer and operator-facing products
  • Demonstrated ability to work in cross-functional teams within high-velocity environments
  • Strong communication skills for diverse audiences
  • Bachelor's degree in Computer Science, Engineering, or a related field (or equivalent experience)

Responsibilities

  • Own the Observability Platform roadmap across multiple telemetry functions
  • Define the observability strategy aligned with infrastructure and reliability goals
  • Drive customer-facing observability features like dashboards and APIs
  • Translate low-level system signals into actionable health views and debugging workflows
  • Collaborate with engineering on technical aspects of telemetry and data models
  • Understand and build products for distributed AI workloads
  • Create health models to identify performance anomalies at scale
  • Ensure observability is integrated into new infrastructure and platform launches
  • Stay updated on observability technology trends and AI infrastructure needs

Benefits

  • Opportunity to influence the future of AI Infrastructure
  • Work in a high-growth technology company with a visible role
  • Collaborative environment with strong teamwork across multiple teams
  • Chance to build products that address unique observability challenges
  • Engagement in a fast-paced and dynamic work culture
Full Job Description
Join Vultr

Vultr is seeking a highly skilled and experienced Senior Technical Product Manager to own the Observability Platform - the system that provides telemetry ingestion, querying, visualization, alerting, and retention for large-scale GPU clusters and multi-tenant cloud environments. The ideal candidate brings deep technical fluency in observability infrastructure, distributed systems monitoring, and cloud-native telemetry, combined with a strong product instinct for developer and operator experiences. This is a highly visible role in a high-growth technology company, which will require close partnership with Compute, Networking, and Platform teams to ensure every new infrastructure launch is observable by design. This is your opportunity to join our fast growing team and leave your mark on Vultr and the future of AI Infrastructure.

Key Responsibilities
  • Own the end-to-end Observability Platform roadmap across telemetry ingestion, querying, visualization, alerting, and retention for large-scale GPU clusters and multi-tenant cloud environments
  • Define Vultr's observability strategy across bare metal, VMs, Kubernetes, and managed services, aligned to infrastructure roadmap, reliability goals, and customer experience
  • Drive the customer-facing observability surface across dashboards, APIs, telemetry pipelines, and topology-aware insights
  • Translate low-level signals across GPU, CPU, memory, storage, and network into actionable health views, alerts, and debugging workflows for customers
  • Work closely with engineering on technical tradeoffs across metrics agents, collectors, data models, telemetry pipelines, APIs, and retention architecture
  • Build products for distributed AI environments by understanding how training and inference workloads behave across nodes, clusters, schedulers, and network fabrics
  • Define health models that help customers quickly identify degraded nodes, performance anomalies, and cluster bottlenecks at fleet scale
  • Ensure new infrastructure and platform launches are observable by design through strong partnership with compute, network, and platform teams
  • Stay current on modern observability stacks and AI infrastructure trends, including how GPU workloads change performance analysis, cost attribution, and operational workflows

Qualifications
  • 7+ years of product management experience in cloud infrastructure, observability, monitoring, or developer platforms
  • Deep understanding of observability and monitoring systems, including metrics, logging, tracing, alerting, and telemetry pipeline architecture
  • Experience defining product strategy and roadmaps for platform or infrastructure products at scale
  • Strong technical background - ability to engage with engineering on telemetry agents, data models, query engines, retention, and distributed systems
  • Experience with GPU, AI/ML, or HPC infrastructure monitoring and the unique observability challenges of training and inference workloads
  • Track record of shipping developer- and operator-facing products with measurable impact on reliability, time-to-detect, or operational efficiency
  • Experience working across cross-functional teams (engineering, design, marketing, sales) in a fast-paced environment
  • Excellent written and verbal communication skills, with the ability to translate complex technical concepts for diverse audiences
  • Bachelor's degree in Computer Science, Engineering, or a related field (or equivalent experience)


Compensation

$130,000 - $165,000
Final compensation will vary depending on years of experience, background/skill set, location, and applicable laws.

Similar Jobs

More Jobs at Vultr

More Information Technology Jobs

Find similar Senior Technical Product Manager, Observability jobs: