Inference Infrastructure Engineer

Rhoda AI

• $120K — $160K *

Mountain View, CA 94040In-Person

Information Technology

Less than 5 years of experience

Today

Be an Early Applicant

By clicking Apply, I agree with Ladders' Terms of Use and Privacy Policy

Job Overview by Ladders

Qualifications

3+ years of experience in ML infrastructure, MLOps, or distributed systems
Strong proficiency with Kubernetes and containerized deployment tools
Experience in GPU orchestration and resource scheduling
Knowledge of cloud providers like AWS or GCP and hybrid cloud infrastructures
Familiarity with ML frameworks such as PyTorch and model serving tools like Triton
Strong problem-solving skills and ownership mentality

Responsibilities

Design and operate scalable infrastructure for model workloads
Build and maintain Kubernetes-based deployment pipelines
Manage resource scheduling and orchestration across GPU clusters
Integrate ML frameworks and model serving systems for different use cases
Develop tools for model deployment, versioning, and monitoring
Enhance the reliability and scalability of the infrastructure stack

Benefits

Direct impact on robotics through infrastructure work
Opportunity to work with a highly ambitious technical team
Possibility to shape future deployments and optimizations

Full Job Description

Were looking for an Inference Infrastructure Engineer to help build and operate the systems that power our model deployment stack. Youll be responsible for running large foundation models efficiently and reliably across cloud and on-prem environments, with a focus on resource management, scheduling, and infrastructure scalability.

What Youll Do

Design and operate large-scale infrastructure to run model workloads across cloud and on-prem environments
Build and maintain Kubernetes-based deployment pipelines for managing distributed ML workloads
Own resource scheduling and orchestration across GPU clusters - optimizing utilization, workload balancing, and cost-performance tradeoffs
Integrate and manage ML frameworks and model serving systems (e.g., Triton, Ray Serve, TorchServe) across research and production use cases
Build tooling for model deployment, versioning, and observability to support fast iteration cycles
Contribute to the reliability and scalability of the infrastructure stack as model complexity and deployment footprint grow

What Were Looking For

3+ years of experience in ML infrastructure, MLOps, or distributed systems
Strong proficiency with Kubernetes and containerized deployment pipelines
Experience with GPU orchestration and resource scheduling across large distributed jobs
Experience with cloud providers (e.g., AWS, GCP) and hybrid cloud/on-prem infrastructure
Familiarity with ML frameworks (e.g., PyTorch, JAX) and model serving tools (e.g., Triton, Ray Serve, TorchServe)
Strong debugging instincts and ownership mentality - comfortable driving issues to resolution across the stack

Nice to Have (But Not Required)

Experience with streaming systems or high-throughput data transport (e.g., Kafka, gRPC, NATS)
Background in networking, low-latency systems, or network-aware scheduling
Experience with edge/cloud hybrid deployment patterns and the latency constraints that come with them
Familiarity with on-robot or embedded inference environments
Experience with large-scale cluster topology and scheduling systems (e.g., SLURM, Ray, Volcano)

Why This Role

Own the infrastructure layer that connects our foundation models to real robot behavior - a direct line between your work and what the robot does in the world
Be part of building the infrastructure stack for one of the most technically ambitious robotics companies in the world

* Ladders Estimates

Similar Jobs

Camera Architect
$130K — $180K *
Apple
Sunnyvale, CA 94087 (Santa Clara County)
Reposted Today
Staff Software Engineer, Platform
$120K — $160K *
Ditto
Remote
Today
Business Systems Analyst I/II/III
$85K — $128K *
Santa Clara Family Health Plan
San Jose, CA 95123 (Santa Clara County)
Reposted Today
Sr. Project Engineer, SDS
$100K — $130K *
Fujifilm Manufacturing USA, Inc
Remote
Reposted Today
Quantum Topological Qubits Research Scientist
$143K — $275K *
GlobalFoundries
Santa Clara, CA 95051 (Santa Clara County)
Today
Camera Architect
$130K — $180K *
Apple
Sunnyvale, CA 94087 (Santa Clara County)
Reposted Yesterday

Get Ready For Your
Next Interview

More Jobs at Rhoda AI

Inference Infrastructure Engineer
$120K — $160K *
Mountain View, CA 94040 (Santa Clara County)
Today
Information Technology
In-Person
Robotics Software Test Engineer
$120K — $150K *
Mountain View, CA 94040 (Santa Clara County)
Today
Technical Services
In-Person
Product Manager, Rhoda Platform
$120K — $160K *
Mountain View, CA 94040 (Santa Clara County)
4 days ago
Telecommunications & Hardware
In-Person
Inference Optimization ML Engineer
$130K — $180K *
Mountain View, CA 94040 (Santa Clara County)
5 days ago
Information Technology
In-Person
Research Scientist / Engineer - Efficient Modeling
$120K — $160K *
Mountain View, CA 94040 (Santa Clara County)
5 days ago
Enterprise Technology
In-Person

More Information Technology Jobs

SDET (Software Development Engineer In Test)
Confidential Company
Washington, DC 20001 (District Of Columbia County)
4 days ago
Cybersecurity Engineer
$100K — $180K *
The National Renewable Energy Laboratory (NREL)
Remote
Today
Software Developer - Ruby on Rails - Professional III
$83K — $150K *
The National Renewable Energy Laboratory (NREL)
Remote
Today
Staff Software Engineer, AI Native
$200K — $240K *
PlayStation
Remote
Today
Sr. Software Engineer, AI Native
$180K — $205K *
PlayStation
New York, NY 10002 (New York County)
Today

Find similar Inference Infrastructure Engineer jobs:

Nationwide Mountain View, CA

Inference Infrastructure Engineer

Job Overview by Ladders

Full Job Description

Get Ready For Your Next Interview

Find similar Inference Infrastructure Engineer jobs:

Get Ready For Your
Next Interview