Quadric.io

Senior Product Manager, Software & Developer Platform

Quadric.io$200K — $250K *
Enterprise Technology
5 - 7 years of experience
Job Overview by Ladders

Qualifications

  • 5+ years in product management, focusing on developer-facing AI/ML or compute platforms.
  • Experience in setting and owning software release cadence.
  • Proven track record of product shipment with real users.
  • Expertise in AI workloads and quantization techniques.
  • Daily use of agentic AI tools to enhance productivity. To leverage agentic AI without hands-on experience is insufficient.
  • Customer-oriented approach, prioritizing workable solutions over elegant designs.

Responsibilities

  • Own the monthly SDK release and quarterly major updates, including customer communications.
  • Decide the sequence of graph patterns for the compiler based on customer needs.
  • Lead the market-driven demo strategy and manage publication timelines.
  • Engage directly with customers during technical reviews to align on the roadmap.
  • Coordinate the quantization roadmap with cross-functional teams ahead of silicon release.
  • Define integration strategies for popular frameworks while fostering deep partnerships.
  • Maintain a customer-confidence model repository to ensure compiler completeness.

Benefits

  • Medical, dental, and vision plans from day one.
  • Flexible paid time off with no accrual limit.
  • Company-provided lunches and a stocked kitchen for on-site employees.
  • 401(k) retirement plan options available.
  • Convenient downtown Burlingame office location, close to Caltrain.
Full Job Description
Quadric is seeking a Senior Principal Product Manager to own the software roadmap for the Chimera Graph Compiler (CGC) - the developer-facing platform customers live in from pre-silicon through production. This role drives the monthly SDK release train, sets pattern coverage and quantization strategy, and works directly with anchor customers to convert model gaps into engineering roadmap. You'll partner with the CPO on strategy and with SW engineering on execution.

This role is based in Burlingame (on-site), with quarterly travel to Japan, U.S. East Coast, and customer SW teams worldwide.

Responsibilities
  • Software release train. Own the monthly SDK release and quarterly major: contents, release sync, go/no-go, release notes, and customer communications.
  • Pattern coverage roadmap. Decide which graph patterns CGC compiles next - attention variants, quantization schemes, normalization patterns - and sequence them against customer model requirements each quarter.
  • Market-driven demo strategy. Lead with the market story and push it through every layer: demo, model zoo, pattern coverage, compiler work. Own what we publish and when.
  • Customer engagement. Present in technical reviews with anchor customers. Convert model gap lists into engineering-ready roadmap entries.
  • Quantization and numerics. Own the roadmap for INT4 (W4A8/W4A16), FP8, OCP MX, and KV cache compression. Coordinate with HW PM on MAC capability and with customer SW teams on model format decisions ahead of silicon tape-out.
  • Framework and runtime integrations. Define the integration strategy for GGML/llama.cpp, vLLM, ONNX Runtime, ExecuTorch, and HF Optimum - deep partnership vs. thin reference.
  • Model zoo. Maintain a set of customer-confidence models (LLM chat, BEVFormer, VLA, ADAS perception) that serve as forcing functions for compiler completeness.
  • Quarterly roadmap tours. Take the roadmap to anchor customers, prospects, and the field. Brief the PMM monthly on what shipped and how to position it.
  • Competitive intelligence. Track Synopsys MetaWare, Arm KleidiAI, Ceva NeuPro Studio, and NVIDIA TensorRT-LLM. Brief exec and sales quarterly.
  • Safety and quality. Coordinate with the safety lead on ISO 26262 traceability and qualification artifacts.

Requirements
  • Domain - non-negotiable. Shipped product on at least one of: NPU or AI accelerator IP/silicon stack; graph or ML compiler (TVM, MLIR, XLA, or proprietary); developer-facing AI inference runtime or agent framework. "Adjacent" does not count.
  • Modern AI workload fluency - non-negotiable. Ready conversation, no prep, on: agentic workflows and LLM serving, KV cache optimization, quantization schemes (AWQ, GPTQ, SmoothQuant, QAT vs. PTQ), datatypes (INT4, FP8, BF16, OCP MX), and inference platforms (vLLM, llama.cpp, TensorRT-LLM, ExecuTorch, ORT).
  • Shipping bar - non-negotiable. You shipped a developer-facing AI or compute product - SDK, runtime, compiler, or inference service - with real users and a release cadence you owned.
  • Agent-pilled - non-negotiable. You use agentic AI tools daily (Claude Code, Cursor, or equivalent) to produce work. Having read about agentic AI without integrating it is not sufficient.
  • Customer first - non-negotiable. When engineering wants to build the elegant thing and the customer needs the workable thing, you take the workable thing every time.
  • 5+ years in PM, with 3+ years on a developer-facing AI/ML or compute platform
  • Owned a release cadence - picked what ships, what slips, and defended the call
  • Experience with in-person technical customer reviews
  • Bay Area resident or willing to relocate to Burlingame


Preferred
  • ML background: graduate degree, published work, trained model, or OSS contribution
  • Automotive Tier 1 engagement; ISO 26262 awareness
  • Prior product work on a competing NPU/GPU/AI accelerator stack (MetaWare, Arm ML, Ceva, TensorRT, Hailo, Tenstorrent, etc.)
  • OSS contributions to vLLM, llama.cpp, TVM, MLIR, ONNX Runtime, or ExecuTorch

Benefits

  • Competitive salary and meaningful equity
  • Medical, dental, and vision plan options starting on day one
  • 401(k) retirement plan
  • Flexible paid time off (unlimited, non-accrual)
  • Company-provided lunches and a stocked kitchen
  • Monthly parking or Caltrain pass
  • Downtown Burlingame office, walking distance from Caltrain

The base salary range for this position is $200,000 to $250,000. This range reflects the full span of levels and geographies at which Quadric hires for this role. The actual base salary offered will depend on a number of factors, including the specific level of the role, years and depth of relevant experience, technical skills and competencies, the criticality of the role to the business, internal equity, and work location. In addition to base salary, this role is eligible for equity and a discretionary annual performance bonus as applicable to the role and level.

Founded in 2016 and based in downtown Burlingame, California, Quadric is building the world's first supercomputer designed for the real-time needs of edge devices. The company was co-founded by Veerbhan Kheterpal (CEO) and Nigel Drego (CTO).

About Quadric.io

Quadric.io is a computer hardware company that specializes in developing high-performance processors for artificial intelligence applications. The company was founded in 2016 and is based in Palo Alto, California. Quadric.io's processors are designed to be highly efficient and scalable, making them ideal for use in data centers and other large-scale computing environments. The company has received significant investment from venture capital firms and has attracted attention for its innovative approach to processor design.
Learn more about Quadric.io
Size
50 employees
Industry

Similar Jobs

More Jobs at Quadric.io

More Enterprise Technology Jobs

Find similar Senior Product Manager, Software & Developer Platform jobs: