Member of Technical Staff, Performance and Scale

Inferact

• $200K — $400K *

San Francisco, CA 94112In-Person

Information Technology

Less than 5 years of experience

6 days ago

By clicking Apply, I agree with Ladders' Terms of Use and Privacy Policy

Job Overview by Ladders

Qualifications

Bachelor's degree or equivalent experience in computer science, engineering, or a similar field.
Strong systems programming skills in Rust, Go, or C++.
Experience designing and building high-performance distributed systems at scale.
Understanding of network protocols and high-performance I/O.
Ability to debug complex distributed systems issues.
Preferred: Experience with ML serving infrastructure and disaggregated inference architecture.
Preferred: Familiarity with GPU programming models and memory hierarchies.

Responsibilities

Design and implement foundational layers for distributed systems enabling global inference at scale.
Build infrastructure that allows seamless deployment of frontier models.
Ensure minimal latency and maximum reliability in serving models across accelerators.
Absorb complexity into the infrastructure to simplify overall system use.
Work collaboratively to improve system reliability and performance.

Benefits

Generous health, dental, and vision benefits.
401(k) company match.
Flexible remote working options for exceptional candidates.

Full Job Description

About the Role

We're looking for an infrastructure engineer to build the distributed systems that power inference at global scale. You'll design and implement the foundational layers that enable vLLM to serve models across thousands of accelerators with minimal latency and maximum reliability. Tomorrow, deploying a frontier model at scale should be as straightforward as spinning up a serverless database. The complexity doesn't disappear as it gets absorbed into the infrastructure you're building.

Skills and Qualifications

Minimum qualifications:

Bachelor's degree or equivalent experience in computer science, engineering, or similar.
Strong systems programming skills in Rust, Go, or C++.
Experience designing and building high-performance distributed systems at scale.
Understanding of network protocols and high-performance I/O.
Ability to debug complex distributed systems issues.

Preferred qualifications:

Experience with ML serving infrastructure and disaggregated inference architecture.
Familiarity with GPU programming models and memory hierarchies.
Knowledge of GPU interconnects (NVLink, InfiniBand, RoCE) and their performance characteristics.
Track record of improving system reliability and performance at scale.

Bonus points if you have:

Prior experience in supporting large-scale model training or inference environments.

Logistics

Location: This role is based in San Francisco, California. Will consider remote in the US for exceptional candidates.
Compensation: Depending on background, skills, and experience, the expected annual salary range for this position is $200,000 - $400,000 USD + equity.
Visa sponsorship: We sponsor visas on a case-by-case basis.
Benefits: Inferact offers generous health, dental, and vision benefits as well as 401(k) company match.

* Ladders Estimates

Similar Jobs

Staff Cluster Infrastructure Engineer
$224K — $284K *
CloudKitchens
San Francisco, CA 94112 (San Francisco County)
Today
Principal Platform Engineer
$96K — $207K *
Fifth Third Bancorp
Remote
Reposted Today
Staff HPC Engineer
$214K — $268K *
Biohub
San Francisco, CA 94112 (San Francisco County)
Reposted Today
Senior AV Operational Safety Engineer (GPSSC)
$129K — $269K *
General Motors
San Francisco, CA 94112 (San Francisco County)
Yesterday
Sr. Principal Field Support Responsible Electronics Engineer
$135K — $500K+*
Northrop Grumman
Beale Afb, CA 95903 (Yuba County)
Yesterday
Senior Optomechanical Engineer
$175K — $267K *
LLNL
Livermore, CA 94550 (Alameda County)
Yesterday

Get Ready For Your
Next Interview

More Jobs at Inferact

Member of Technical Staff, AMD GPU Performance Engineering
$200K — $400K *
San Francisco, CA 94112 (San Francisco County)
Yesterday
Technical Services
In-Person
Member of Technical Staff, TPU & AMD GPU Performance Engineering
$200K — $400K *
San Francisco, CA 94112 (San Francisco County)
5 days ago
Information Technology
In-Person
Member of Technical Staff, Cloud Orchestration
$200K — $400K *
San Francisco, CA 94112 (San Francisco County)
6 days ago
Information Technology
In-Person
Member of Technical Staff, Kernel Engineering
$200K — $400K *
San Francisco, CA 94112 (San Francisco County)
6 days ago
Consumer Technology
In-Person
Member of Technical Staff, Cluster Administration
$200K — $400K *
San Francisco, CA 94112 (San Francisco County)
6 days ago
Information Technology
In-Person

More Information Technology Jobs

SDET (Software Development Engineer In Test)
Confidential Company
Washington, DC 20001 (District Of Columbia County)
1 week ago
AI Director - Machine Learning | North America | Canada | Europe | Fully Remote
$130K — $180K *
Escape Velocity Entertainment Inc
Remote
Today
Principal Software Architect (C#/SQL/Azure Services)
$182K — $251K *
loanDepot
Irvine, CA 92620 (Orange County)
Today
Solution Architect - Mulesoft
$150K — $230K *
Tata Consultancy Services
Dallas, TX 75217 (Dallas County)
Today
Senior GenAI Engineer - AI Enablement & Agentic Systems-1
$140K — $170K *
Samsung Electronics Co., Ltd.
New York, NY 10025 (New York County)
Reposted Today

Find similar Member of Technical Staff, Performance and Scale jobs:

Nationwide San Francisco, CA

Member of Technical Staff, Performance and Scale

Job Overview by Ladders

Full Job Description

Get Ready For Your Next Interview

Find similar Member of Technical Staff, Performance and Scale jobs:

Get Ready For Your
Next Interview