Product Manager - AI Inference & Model Serving

Mirantis • $120K — $160K *

Austin, TX 78745In-Person

Consumer Technology

5 - 7 years of experience

1 month ago

By clicking Apply, I agree with Ladders' Terms of Use and Privacy Policy

Job Overview by Ladders

Qualifications

7+ years in product management or a senior technical role focused on AI/ML and inference products.
Strong knowledge of production AI inference concepts like model serving and serverless execution.
Ability to assess performance trade-offs across various infrastructure layers including GPU and network.
Experience with modern inference runtimes and their optimization techniques.
Comfort and credibility in discussions with engineering and production architecture teams.

Responsibilities

Own product strategy and roadmap for AI inference and model serving.
Lead technical discovery and convert insights into product requirements.
Collaborate with engineering on performance-oriented system design.
Define product positioning based on measurable performance outcomes.
Drive go-to-market strategies including pricing and customer engagement.

Benefits

Work with a leading Silicon Valley cloud infrastructure company.
Collaborate with passionate and talented colleagues on innovative projects.
Engage in cutting-edge, open-source technology advancements.
Experience a dynamic environment valuing collaboration and growth.
Access to professional development and training opportunities.
Opportunities to attend industry conferences and working groups.
Customized workstation options provided.

Full Job Description

Job Description

Job Summary

Mirantis is looking for a commercially driven, deeply technical Product Manager to own AI inference and model serving for k0rdent AI, our control plane for GPU infrastructure and distributed AI workloads. This role sits at the intersection of AI inference, cloud-native infrastructure, distributed systems, and performance engineering. You will define how NeoClouds and Enterprise customers deploy, scale, and operate production inference services while extracting maximum performance from the underlying GPU, network, and storage infrastructure.

This role owns product strategy and solution development for inference products across on-premises, cloud, and edge environments. The scope includes serverless inference, dedicated endpoints, workload placement, autoscaling, routing, lifecycle management, observability, and full-stack performance optimization. This person will define how customers run production model-serving workloads at scale while improving latency, throughput, utilization, reliability, cost, and operational control.

The ideal candidate has experience with high-performance infrastructure products and understands how production systems behave under real-world load. They should be comfortable reasoning across the full stack, identifying performance bottlenecks, evaluating system design trade-offs, and translating technical insight into clear product requirements, architecture direction, and customer-facing solutions.

Responsibilities

Own product strategy, roadmap, and lifecycle for inference and model serving, including serverless inference, dedicated endpoints, autoscaling, routing, KV cache management, and the related observability
Lead deep technical discovery with NeoClouds, sovereign clouds, and enterprise platform teams, and translate findings into prioritized requirements and architecture direction
Partner with engineering on system design trade-offs across runtime integration, GPU scheduling, network, storage, and serving topology, including disaggregated serving and multi-model serving
Define positioning grounded in measurable outcomes: latency distributions, throughput per GPU, utilization, tail reliability, and cost per tokens
Drive go-to-market execution: pricing and packaging, reference architectures, sizing guides, PoC playbooks, and direct engagement with customers, analysts, and ecosystem partners

Qualifications

7+ years in product management, technical product management, or a senior technical role owning AI/ML and inference product(s)
Strong understanding of production AI inference, including model serving, serverless execution, dedicated endpoints, autoscaling, routing, workload placement, observability, and reliability
Proven capability to reason about performance trade-offs across GPU, network, storage, orchestration, and runtime layers, and to translate low-level technical capability into business value such as TTFT, throughput per GPU, and TCO
Working knowledge of modern inference runtimes (vLLM, SGLang, TensorRT-LLM, Dynamo, Triton) and the optimization patterns that matter in production: continuous batching, KV cache management, cold starts, prefill versus decode, disaggregated serving, and multi-model serving
Credibility with engineering leaders and infrastructure operators, including comfort in production architecture reviews and technical commercial conversations with platform engineering buyers

Why you'll love Mirantis

Build the token factory foundation for the AI cloud era, working directly with leading GPU cloud operators, NeoClouds, sovereign clouds, and AI-first enterprises
Collaborate with a world-class, distributed team committed to openness and technical excellence
Shape the product narrative and influence go-to-market success

Additional Information

What does Mirantis offer you?

Work with an established Silicon Valley leader in the cloud infrastructure industry.
Work with exceptionally passionate, talented and engaging colleagues, helping Fortune 500 and Global 2000 customers implement next-generation cloud technologies.
Be a part of cutting-edge, open-source innovation.
Thrive in the high-energy environment of a young company where openness, collaboration, risk-taking, and continuous growth are valued.
Professional development and training.
Attend conferences and working groups.
Customized workstation (macOS, Windows).
A competitive compensation package with strong benefits plan and stock options.

#remote

We are a Leader for Container Management in G2 (#2 after AWS)!

About Mirantis

Mirantis is a software company that provides cloud computing services and solutions. The company was founded in 2011 and is headquartered in Sunnyvale, California. Mirantis offers a range of cloud computing services, including OpenStack, Kubernetes, and Docker. The company's solutions are used by a variety of industries, including telecommunications, finance, and healthcare. Mirantis has over 1,000 employees and offices in the United States, Russia, Ukraine, and the United Kingdom.

Learn more about Mirantis

Size

1,000 employees

Industry

Enterprise Technology

Founded

2011

* Ladders Estimates

Similar Jobs

AI Experience Manager
$100K — $130K *
First United Bank
Plano, TX 75025 (Collin County)
Reposted Today
Lead Associate, AI Platform Product Management
$141K — $184K *
Fannie Mae
Plano, TX 75025 (Collin County)
Today
Director, AI Transformation
$120K — $180K *
Human Agency
Austin, TX 78745 (Travis County)
Today
Sr. Manager, AI Platform Enablement (Remote)
$145K — $220K *
CrowdStrike Holdings, Inc.
Remote
Today
Senior AI Solution Manager
$120K — $150K *
CohnReznick
Austin, TX 78745 (Travis County)
Today
Product Manager/Owner AI
$100K — $130K *
Sedgwick
Remote
Reposted Yesterday

Get Ready For Your
Next Interview

More Jobs at Mirantis

Senior AI Infrastructure & Platform Operations Engineer (remote in the US)
$120K — $160K *
Remote
3 days ago
Enterprise Technology
Remote in United States
Technical Product Marketer, k0rdent AI - remote in the US
$90K — $130K *
Remote
3 days ago
Enterprise Technology
Remote in United States
AI Infrastructure & Platform Operations Engineer (remote in the US)
$100K — $140K *
Remote
3 days ago
Information Technology
Remote in United States
Technical Product Manager, AI Cloud Networking
$120K — $160K *
Remote
6 days ago
Enterprise Technology
Remote in Austin, TX
Technical Product Manager, AI Cloud Networking
$120K — $160K *
Austin, TX 78745 (Travis County)
6 days ago
Information Technology
In-Person

More Consumer Technology Jobs

Performance Growth Marketer
$100K — $200K *
Aria InsurTech
Hollywood, FL 33027 (Broward County)
Reposted 2 weeks ago
Senior Associate Brand Manager
$116K — $143K *
Kimberly-Clark Corporation
Chicago, IL 60629 (Cook County)
Reposted Today
Senior Innovation Manager
$105K — $115K *
Rubicon Organics, Inc.
Delta, BC V4G 0A1
Today
iOS Engineer II
$114K — $132K *
FOX News Network, LLC
New York, NY 10025 (New York County)
Reposted Today
Staff Product Designer
$140K — $190K *
Headspace
San Francisco, CA 94112 (San Francisco County)
Today

Find similar Product Manager - AI Inference & Model Serving jobs:

Nationwide Austin, TX

Product Manager - AI Inference & Model Serving

Job Overview by Ladders

Full Job Description

Get Ready For Your Next Interview

Find similar Product Manager - AI Inference & Model Serving jobs:

Get Ready For Your
Next Interview