Scale AI

Technical Program Manager, Platform

Scale AI$211K — $264K *
Enterprise Technology
5 - 7 years of experience
Job Overview by Ladders

Qualifications

  • 5+ years as a TPM, Product Manager, or Software Engineer with experience delivering technical products/platforms from scratch.
  • 3+ years managing core engineering infrastructure, cloud-native ecosystems, or distributed systems.
  • Foundational knowledge of AI/ML infrastructure for Generative AI workflows.
  • Proven ability to communicate technical complexities to executive stakeholders.
  • Proficiency in iterative development methodologies and project management tools.

Responsibilities

  • Lead strategic planning and execution for core capabilities of the Scale Generative AI Platform.
  • Drive execution and manage technical dependencies across cross-functional teams.
  • Translate infrastructure metrics into actionable roadmaps for platform features.
  • Identify and mitigate technical risks in large-scale AI deployments.
  • Establish processes to enhance developer productivity while maintaining system integrity.
  • Track and report on system metrics and adoption to executive leadership.

Benefits

  • Comprehensive health, dental, and vision coverage.
  • Retirement benefits.
  • Learning and development stipend.
  • Generous PTO.
  • Potential commuter stipend.
Full Job Description
As a Technical Program Manager for the Platform team, you will partner with engineering teams to directly accelerate the development and maturity of the Scale Generative AI Platform (SGP). We are looking for a TPM who has actively built and shipped products in the past and understands how to deliver robust, scalable developer tooling and distributed systems.

In this role, you will own the strategic alignment and end-to-end execution of our most critical infrastructure initiatives-from initial scoping to measurable, company-wide and customer-ready adoption. You will serve as the core communication backbone and connective tissue between platform engineering, product teams, and executive leadership. Operating in a hyper-growth, demanding AI environment, you will translate SGP's architectural complexities into clear execution strategies, unblock engineering bottlenecks, proactively mitigate deployment risks, and ensure our foundational platforms deliver reliable, performant, and secure systems capable of global-scale deployment.
Key Responsibilities
  • Lifecycle & Platform Delivery: Lead strategic planning and high-velocity execution for SGP core capabilities (orchestration layers, model serving, APIs). Manage features from technical scoping and architecture design through production launch.
  • Cross-Functional GenAI Alignment: Drive execution and manage complex technical dependencies across systems engineering, Core ML, Research, and Product teams to deliver unified SGP capabilities with architectural consistency.
  • Technical Translation & Requirements: Translate complex infrastructure metrics (LLM inference optimization, GPU utilization, compute orchestration) into actionable roadmaps. Map demands like multi-tenancy, data privacy, and isolation into platform features.
  • Risk & Dependency Mitigation: Proactively identify, track, and mitigate technical risks unique to massive-scale GenAI infrastructure and global SGP deployments, maintaining momentum despite fast-evolving AI frameworks.
  • Developer Velocity & Operational Excellence: Establish lightweight agile processes that empower engineers to ship fast without breaking core systems. Define and enforce clear SLOs and performance benchmarks to guarantee production-grade reliability for clients.
  • Metrics-Driven Adoption: Track and report on SGP adoption metrics, system reliability, delivery forecasts, and engineering bottlenecks directly to executive leadership to ensure the platform scales responsibly.
Minimum Qualifications
  • 5+ years of experience as a Technical Program Manager, Product Manager, or Software Engineer, with a proven track record of having built and shipped technical products or platforms from scratch (e.g., internal cloud infrastructure, developer APIs, distributed systems, or ML platforms).
  • Platform Domain Expertise: 3+ years of dedicated experience managing programs focused directly on core engineering infrastructure, cloud-native ecosystems (AWS/GCP), container orchestration (Kubernetes), or distributed systems.
  • AI/ML Infrastructure Literacy: Foundational understanding of the infrastructure required for the Generative AI lifecycle, including high-throughput data pipelines, GPU/CPU cluster utilization, or model training/evaluation setups.
  • Masterful Communication: Proven track record of presenting to and influencing executive-level stakeholders, with the ability to translate complex technical/architectural challenges into clear business impacts.
  • Execution Excellence: Advanced proficiency with iterative development methodologies and modern project management tooling (Linear, Jira, etc.) applied to foundational infrastructure environments.
Nice-to-Have Qualifications
  • Engineering Roots: Strong software engineering fundamentals, with prior professional experience as a Software Engineer, DevOps Engineer, or Data Developer before transitioning into program management.
  • Platform Adoption Track Record: Proven success driving the internal adoption of technical platforms, SDKs, or APIs across disparate, fast-moving product lines.
  • Data-Centric AI Familiarity: Direct experience working with large-scale data quality pipelines, distributed vector databases, or specialized AI inference engines (e.g., Triton, Ray).


Compensation packages at Scale for eligible roles include base salary, equity, and benefits. The range displayed on each job posting reflects the minimum and maximum target for new hire salaries for the position and may be inclusive of several career levels at Scale; it will be determined during the interview process based on work location and additional factors, including job-related skills, experience, qualifications, interview performance, and relevant education or training. Scale employees in eligible roles are also granted equity based compensation, subject to Board of Director approval. Your recruiter can share more about the specific salary range for your preferred location during the hiring process, and confirm whether the hired role will be eligible for equity grant. You'll also receive benefits including, but not limited to: comprehensive health, dental and vision coverage, retirement benefits, a learning and development stipend, and generous PTO. Additionally, this role may be eligible for additional benefits such as a commuter stipend.

Please reference the job posting's subtitle for where this position will be located. For pay transparency purposes, the base salary range for this full-time position in the locations of San Francisco, New York, Seattle is:

$211,200-$264,000 USD

PLEASE NOTE: Our policy requires a 90-day waiting period before reconsidering candidates for the same role. This allows us to ensure a fair and thorough evaluation of all applicants.

About Scale AI

Scale AI is an artificial intelligence company that provides data annotation services to improve machine learning algorithms. The company's platform offers a range of services including image annotation, text annotation, and 3D annotation. Scale AI was founded in 2016 and is headquartered in San Francisco, California.
Learn more about Scale AI
Size
500 employees
Industry
Founded
2017

Similar Jobs

More Jobs at Scale AI

More Enterprise Technology Jobs

Find similar Technical Program Manager, Platform jobs: