Software Engineer, Infrastructure

Exa

• $130K — $180K *

San Francisco, CA 94112In-Person

Information Technology

Less than 5 years of experience

Today

Be an Early Applicant

By clicking Apply, I agree with Ladders' Terms of Use and Privacy Policy

Job Overview by Ladders

Qualifications

Experience with large-scale infrastructure design and operation
Knowledge of GPU clusters, Kubernetes, or cloud batchjob systems
Strong focus on reliability, observability, and optimization
Experience with production systems at scale
Ability to work in a fast-paced engineering environment

Responsibilities

Build Kubernetes orchestration for a $20m GPU cluster
Scale AWS batchjob system for extensive map reduce jobs
Design software for optimal GPU scheduling
Implement observability features in production systems
Collaborate with team members to enhance infrastructure capabilities

Benefits

Premium healthcare benefits (medical, dental, vision)
Fertility benefits
16 weeks of fully paid parental leave for all new parents
Monthly wellness stipend
Visa sponsorship available for international candidates

Full Job Description

Our Infrastructure Team builds the underlying tooling and infrastructure that powers all Exa's systems. Basically, we need more infra engineers to build the machine that builds the machine so that we can move as fast as possible as an engineering org. That could mean building GPU cluster orchestration in Kubernetes, map-reduce batchjobs on Ray, or the best observability tooling in the world.

Who You Are

You have experience designing and operating large-scale infrastructure - GPU clusters or large Kubernetes clusters or cloud batchjob systems
You bring an obsessive mindset - always thinking about reliability, observability, and optimization across the entire stack.

What You'll Do

Build the Kubernetes orchestration on a $20m GPU cluster
Scale our AWS batchjob system to handle map reduce jobs over 10s of thousands of machines
Design GPU scheduling software so we max out our cluster utilization
Build observability into our production systems

Logistics

Location: This is an in-person opportunity in San Francisco.
Visas: We're happy to sponsor international candidates (e.g., STEM OPT, OPT, H1B, O1, E3). While we cannot guarantee your visa, we have historically been successful in sponsoring candidates from all over the world. If you receive an offer, our team will work hard to get you a visa.
Benefits: We offer premium healthcare benefits (medical, dental, vision), fertility benefits, 16 weeks of fully paid parental leave for all new parents, and a monthly wellness stipend to all of our employees.

* Ladders Estimates

Similar Jobs

Senior Cloud Engineer (Remote US)
$120K — $135K *
Smile Digital Health
Remote
Yesterday
Senior Multi-Cloud Engineer
$131K — $237K *
Leidos Holding
Remote
Yesterday
Software Engineer II, Cloud Infrastructure - Slack
$120K — $150K *
Salesforce
Remote
Reposted 2 days ago
Cloud Engineer
$100K — $130K *
ActioNet, Inc
Remote
Reposted 2 days ago
Software Engineer III, Infrastructure, GDC Enterprise Application Platform
$147K — $211K *
Google
Sunnyvale, CA 94087 (Santa Clara County)
2 days ago
Epic Lab Software Engineer (NCG - Master's/PhD)
$124K — $171K *
Applied Materials, Inc
Santa Clara, CA 95051 (Santa Clara County)
3 days ago

Get Ready For Your
Next Interview

More Jobs at Exa

Sales Development Representative
$80K — $120K *
San Francisco, CA 94112 (San Francisco County)
Today
Enterprise Technology
In-Person
Field Marketing
$90K — $130K *
San Francisco, CA 94112 (San Francisco County)
Today
Business Services
In-Person
Forward Deployed Engineer
$120K — $160K *
San Francisco, CA 94112 (San Francisco County)
Today
Information Technology
In-Person
Data Partnerships
$120K — $160K *
San Francisco, CA 94112 (San Francisco County)
Today
Business Services
In-Person
Digital Growth Lead
$120K — $150K *
New York City, NY 10025 (New York County)
Today
Enterprise Technology
In-Person

More Information Technology Jobs

SDET (Software Development Engineer In Test)
Confidential Company
Washington, DC 20001 (District Of Columbia County)
5 days ago
Full Stack Engineer
$100K — $120K *
Grapefruit Health
Remote
Reposted Today
Software Engineer (Applied AI)
$190K — $260K *
Collate Labs, Inc
New York, NY 10025 (New York County)
Today
Backend Engineer
$140K — $190K *
Method Security
New York, NY 10025 (New York County)
Today
Backend Software Engineer
$120K — $160K *
New Lantern
San Francisco, CA 94112 (San Francisco County)
Today

Find similar Software Engineer, Infrastructure jobs:

Nationwide San Francisco, CA

Software Engineer, Infrastructure

Job Overview by Ladders

Full Job Description

Get Ready For Your Next Interview

Find similar Software Engineer, Infrastructure jobs:

Get Ready For Your
Next Interview