Software Engineer, ML Serving - Rime Ai

Unusual Ventures • $130K — $180K *

San Francisco, CA 94112In-Person

Enterprise Technology

Less than 5 years of experience

Today

Be an Early Applicant

By clicking Apply, I agree with Ladders' Terms of Use and Privacy Policy

Job Overview by Ladders

Qualifications

5-7 years of hands-on experience in real-time multinode ML serving infrastructure.
Proficiency with ML serving frameworks such as NVIDIA Dynamo/Triton, or similar.
Solid understanding of distributed model serving methodologies.
Strong fundamentals in cloud infrastructure: Linux, networking, Docker, Kubernetes.
Experience with Infrastructure as Code (IaC) tools like Terraform or Packer.

Responsibilities

Architect and implement TTS serving infrastructure connecting inference engines to APIs.
Optimize models for single-node to disaggregated fleet serving.
Ensure compatibility with various NVIDIA hardware for both cloud and on-prem deployments.
Develop and oversee CI/CD workflows for the model serving pipeline.
Manage site reliability tasks, including monitoring and on-call duties.

Benefits

Opportunity to shape the serving infrastructure for a leading voice AI company.
Ability to set vision and direction based on your unique experience.
Collaborate directly with inference, platform, and ML teams without handoffs.
Influence the customer experiences that are scalable.
Equity opportunities at an early-stage company.
High ownership and standards, with minimal bureaucracy.
Located in the dynamic SF/Bay Area.

Full Job Description

Role Overview

We're hiring a Software Engineer to own the serving infrastructure that connects Rime's inference engines to the world. This role sits at the intersection of ML systems and cloud infrastructure - you'll work directly on model inference and cloud infrastructure to build, harden, and scale the systems that stream voice at real-time latency. As Rime moves toward its next-generation architecture, you'll be a core architect of how our models get served.

What You'll Own

Architecture and implementation of Rime's TTS serving infrastructure, from GPU-backed inference engines to the API surface.
Model optimization from a single-node to disaggregated fleet serving.
Compatibility with different NVIDIA hardwares from Hopper to Blackwell and beyond for on-prem and cloud deployments.
Continuous integration and deployment workflows for the model serving pipeline.
Site reliability: on-call rotation, monitoring, alerting, and observability across the serving stack.
Resource provision, cost management across our GPU fleet.

What We're Looking For

Hands-on experience with real-time multinode ML serving infrastructure - ML serving framework experience: NVIDIA Dynamo/Triton, vLLM, SGLang, or equivalent.
Experience with distributed or disaggregated model serving (Tensor Parallel, Pipeline Parallel, or equivalent).
Strong cloud infrastructure fundamentals: Linux internals, networking, containerization (Docker, Kubernetes).
IaC experience - Terraform, Packer, or comparable. You should have opinions about how to do this right.
On-call is part of the job. You treat production reliability as a shared responsibility.

Nice to Have

Experience with multinode training (DDP, FSDP, etc.).
Experience with gRPC or other bidirectional binary streaming protocols.
Experience with audio streaming and related technologies (WebRTC, WebSockets, etc.).
Experience with a multilingual monorepo where you pick the best language out of merit more than personal experience.
Experience with multi-cloud infrastructures (AWS, GCP, OCI, etc.).
Comfort with configuration management tooling (Ansible, Chef, Puppet, or similar).
SRE, DevOps, or platform engineering background at a startup.
Experience at an early-stage company.

Why Join Rime

Build the serving infrastructure behind a category-defining voice AI company from the ground up.
You will bring in experience that no one else currently has at the company: you can help us set the vision.
Direct collaboration with the inference, platform, and ML teams - no handoff culture.
The systems you build determine what experiences our customers can deploy at scale.
Meaningful equity upside at an early stage.
High ownership, high standards, low bureaucracy.
SF / Bay Area.

At Rime, we...

Are outliers
Cut through the hype to focus on the craft
Move fast with agency and freedom
Maintain a growth mindset, finding joy in the struggle
Do the right things, knowing that it'll lead to making money

If that sounds like you too, you'll be a great fit for Rime!

About Unusual Ventures

Unusual Ventures is a venture capital firm that invests in early-stage startups. The firm was founded in 2018 by John Vrionis and Jyoti Bansal. Unusual Ventures focuses on investing in companies that are using technology to disrupt traditional industries. The firm has invested in companies such as Freenome, Figma, and Guild Education.

Learn more about Unusual Ventures

Size

10 employees

Industry

Business Services

Founded

2018

* Ladders Estimates

Similar Jobs

Software Engineer II - Orange Apron Media (Remote)
$100K — $130K *
The Home Depot
Remote
Today
Software Engineer
$142K — $215K *
General Motors
Sunnyvale, CA 94087 (Santa Clara County)
Reposted Today
Software Engineer II, GraphQL and NodeJS
$133K — $160K *
Box Inc
Redwood City, CA 94061 (San Mateo County)
Today
Software Engineer
$140K — $168K *
Fivetran
Oakland, CA 94601 (Alameda County)
Today
Software QA Engineer 2
$78K — $131K *
Dexcom
Remote
Today
Software Engineer, Iceberg Managed Tables, BigQuery
$147K — $211K *
Google
Sunnyvale, CA 94087 (Santa Clara County)
Today

Get Ready For Your
Next Interview

More Jobs at Unusual Ventures

Software Engineer - Rime Ai
$130K — $180K *
San Francisco, CA 94112 (San Francisco County)
Today
Information Technology
In-Person
Software Engineer, ML Serving - Rime Ai
$130K — $180K *
San Francisco, CA 94112 (San Francisco County)
Today
Enterprise Technology
In-Person
Technical Program Manager - Rime.AI
$120K — $160K *
San Francisco, CA 94112 (San Francisco County)
Yesterday
Information Technology
In-Person
Founding Account Executive - Roboto AI
$100K — $150K *
Seattle, WA 98115 (King County)
Yesterday
Enterprise Technology
In-Person
Founding Account Executive - Roboto AI
$100K — $150K *
Washington, DC 20011 (District Of Columbia County)
Yesterday
Consumer Technology
In-Person

More Enterprise Technology Jobs

Associate Director, Product Management
$133K — $238K *
Wolters Kluwer
Coppell, TX 75019 (Dallas County)
Reposted Today
Founding Business Development Rep - BDR, SDR, or commercial role exp.
$80K — $110K *
Talent Search PRO
San Francisco, CA 94112 (San Francisco County)
Today
SRE Manager, ML Operations
$150K — $200K *
Apple
New York, NY 10025 (New York County)
Today
Senior Systems Operations Engineer
$100K — $130K *
Wells Fargo
Chandler, AZ 85225 (Maricopa County)
Today
Engineering-Dallas-Vice President, Software Engineering-5121448
$150K — $200K *
The Goldman Sachs Group, Inc
Dallas, TX 75217 (Dallas County)
Today

Find similar Software Engineer, ML Serving - Rime Ai jobs:

Nationwide San Francisco, CA

Software Engineer, ML Serving - Rime Ai

Job Overview by Ladders

Full Job Description

Get Ready For Your Next Interview

Find similar Software Engineer, ML Serving - Rime Ai jobs:

Get Ready For Your
Next Interview