Software Engineer (AI Infrastructure / Training / Inference)

SpreeAI

• $120K — $160K *

San Francisco, CA 94112Hybrid

Information Technology

Less than 5 years of experience

2 months ago

By clicking Apply, I agree with Ladders' Terms of Use and Privacy Policy

Job Overview by Ladders

Qualifications

Degree in Computer Science, Engineering, or relevant experience.
Strong object-oriented programming skills in languages like Python, C++, Java, or Go.
Solid understanding of data structures and algorithms.
Experience in developing production-grade backend or distributed systems.
Familiarity with cloud infrastructure and containerization techniques.

Responsibilities

Design and build scalable infrastructure for training and inference workflows.
Develop high-performance APIs for AI model serving.
Optimize GPU utilization for multimodal workloads.
Build distributed systems for large-scale generative models.
Enhance the observability and reliability of AI systems.
Collaborate with Applied Science teams to implement research systems effectively.
Drive improvements in deployment workflows and platform automation.

Benefits

Opportunity to work at the intersection of systems engineering and AI.
Access to cutting-edge technologies in AI infrastructure development.
Collaborative environment with applied scientists to influence AI research outcomes.
Focus on performance optimization and cost efficiency in large-scale systems.

Full Job Description

About the Role

We are hiring Software Engineers focused on AI Infrastructure to build the systems that enable frontier multimodal AI to operate reliably at production scale. This role exists because modern generative and vision models require infrastructure beyond traditional backend engineering - including GPU orchestration, large-scale inference systems, performance optimization, and developer platforms that allow applied scientists to move fast without sacrificing reliability or cost efficiency.

You will work on:

Scalable model serving and inference pipelines.
Distributed GPU infrastructure.
Performance and cost optimization.
Reliability, observability, and production readiness.

You will operate at the boundary between systems engineering and machine learning - building the "paved roads" that allow advanced AI systems to scale safely and efficiently.

What you'll do

Design and build scalable infrastructure supporting training and inference workflows.
Develop high-performance APIs and backend services for AI model serving.
Optimize GPU utilization, latency, and throughput for multimodal workloads.
Build distributed systems supporting large-scale generative models.
Improve observability, monitoring, and reliability of AI systems.
Partner closely with Applied Science teams to productionize research systems.
Drive improvements in deployment workflows, automation, and platform usability.

Qualifications

Degree in Computer Science, Engineering, or comparable combination of education and practical experience.
Strong object-oriented programming skills (Python, C++, Java, Go, or similar).
Strong data structures and algorithms foundations.
Experience building production backend or distributed systems.
Understanding of cloud infrastructure concepts and containerized systems.

Preferred Qualifications

Experience with Kubernetes, Docker, or container orchestration.
Familiarity with GPU-based ML workloads or distributed training/inference systems.
Experience with model serving frameworks (vLLM, Triton, Ray Serve, or similar).
Experience with observability tools and performance debugging.
Familiarity with PyTorch or ML workflows.
Interest in optimizing systems for efficiency, scalability, and developer velocity.

* Ladders Estimates

Similar Jobs

Senior Computer Vision Research Engineer
$130K — $180K *
Bobyard, Inc
San Francisco, CA 94112 (San Francisco County)
Today
Senior Engineer
$149K — $235K *
Atlassian
Remote
Today
Founding AI Engineer
$150K — $200K *
Clera
San Francisco, CA 94112 (San Francisco County)
Today
Software Engineer - AI Agentic Product Dev Team (US)
$120K — $161K *
Anywhere Real Estate
Santa Clara, CA 95051 (Santa Clara County)
Reposted Today
Senior Applied AI Engineer
$152K — $287K *
NVIDIA Corporation
Santa Clara, CA 95051 (Santa Clara County)
Reposted Today
Sr Specialist Member of Technical Staff – Device Analytics and AI
$128K — $205K *
AT&T
San Ramon, CA 94582 (Contra Costa County)
Today

Get Ready For Your
Next Interview

More Jobs at SpreeAI

AI Researcher (Computer Vision/Multimodal/Generative AI)
$130K — $180K *
San Francisco, CA 94112 (San Francisco County)
6 days ago
Consumer Technology
Hybrid
Principal Engineer, AI Platform & Infrastructure
$130K — $180K *
San Francisco, CA 94112 (San Francisco County)
6 days ago
Enterprise Technology
Hybrid
Machine Learning Engineer (Computer Vision/Multimodal/Generative AI)
$120K — $160K *
San Francisco, CA 94112 (San Francisco County)
1 week ago
Consumer Technology
Hybrid
Social Media & Marketing Designer
$75K — $100K *
New York, NY 10025 (New York County)
1 month ago
Media
In-Person

More Information Technology Jobs

Chief Executive Officer
The Mitalmor Group
San Francisco, CA 94102 (San Francisco County)
2 weeks ago
IT CYBERSECURITY SPECIALIST (INFOSEC)
$75K — $95K *
Army National Guard Units
Colchester, VT 05446 (Chittenden County)
Today
IT SPECIALIST (INFOSEC/APPSW)
$90K — $120K *
Commander, Naval Information Warfare Systems Command
San Diego, CA 92154 (San Diego County)
Reposted Today
Change Management Quality Assurance Analyst
$86K — $181K *
CACI International
Sterling, VA 20164 (Loudoun County)
Reposted Today
CLEVELAND Engineer, Data Warehouse (Architecture) - Information Technology (IT)
$120K — $155K *
Jones Day
Cleveland, AL 35049 (Blount County)
Today

Find similar Software Engineer (AI Infrastructure / Training / Inference) jobs:

Nationwide San Francisco, CA

Software Engineer (AI Infrastructure / Training / Inference)

Job Overview by Ladders

Full Job Description

Get Ready For Your Next Interview

Find similar Software Engineer (AI Infrastructure / Training / Inference) jobs:

Get Ready For Your
Next Interview