Software Engineer, Inference

Pulse Software Corp

• $120K — $180K *

San Francisco, CA 94112In-Person

Enterprise Technology

Less than 5 years of experience

Today

Be an Early Applicant

By clicking Apply, I agree with Ladders' Terms of Use and Privacy Policy

Job Overview by Ladders

Qualifications

3+ years in performance engineering or ML systems
Strong Python programming skills
Exposure to C++ or CUDA
Experience with GPU profiling
Familiarity with model serving techniques

Responsibilities

Build inference services with smart batching and caching
Optimize kernels, tokenization, and model graphs
Evaluate tradeoffs of vLLM, TensorRT LLM, and Triton
Implement autoscaling and admission control with clear SLOs
Own performance dashboards and manage capacity planning

Benefits

Competitive base salary plus equity
Performance-based bonus
Relocation assistance for Bay Area moves
Daily meal stipend
Medical, vision, and dental coverage

Full Job Description

About the RoleSpecialize in low-latency, high-throughput inference for OCR and multimodal models. Own profiling, batching, and autoscaling across single-tenant and multi-tenant environments.

Responsibilities

Build inference services with smart batching and caching
Optimize kernels, tokenization, and model graphs
Evaluate vLLM, TensorRT LLM, and Triton tradeoffs
Implement autoscaling and admission control with clear SLOs
Own performance dashboards and capacity planning

Requirements

3+ years in performance engineering or ML systems
Strong Python, plus C++ or CUDA exposure
Experience with GPU profiling and model serving

Nice to have

Experience reducing p95 and cost in production ML systems

SponsorshipSponsorship available.

Compensation and benefitsCompetitive base salary plus equity, performance-based bonus, relocation assistance for Bay Area moves, daily meal stipend, medical, vision, and dental coverage.

* Ladders Estimates

Similar Jobs

AI Software Engineer - Remote
$120K — $150K *
Azumo
Remote
Reposted Today
Software Engineer, Applied AI
$120K — $180K *
OpenEvidence
San Francisco, CA 94112 (San Francisco County)
Today
Design Engineer
$100K — $150K *
Attention Engineering, Inc
San Francisco, CA 94112 (San Francisco County)
Today
AI Software Engineer
$90K — $130K *
Synergy Pet Group
Remote
Today
ML Perception Software Engineer
$125K — $222K *
Applied Intuition
Sunnyvale, CA 94087 (Santa Clara County)
Today
Senior AI Engineer
$130K — $180K *
Citizen Health
San Francisco, CA 94112 (San Francisco County)
Today

Get Ready For Your
Next Interview

More Jobs at Pulse Software Corp

Account Executive
$90K — $130K *
San Francisco, CA 94112 (San Francisco County)
Today
Enterprise Technology
In-Person
Software Engineer, Inference
$120K — $180K *
San Francisco, CA 94112 (San Francisco County)
Today
Enterprise Technology
In-Person
Solutions Engineer
$120K — $150K *
San Francisco, CA 94112 (San Francisco County)
Today
Information Technology
In-Person
Software Engineer
$120K — $160K *
San Francisco, CA 94112 (San Francisco County)
Today
Information Technology
In-Person
Design Engineer
$120K — $150K *
San Francisco, CA 94112 (San Francisco County)
Today
Consumer Technology
In-Person

More Enterprise Technology Jobs

Enterprise Account Executive, South East
$130K — $150K *
GitGuardian
Washington, DC 20011 (District Of Columbia County)
Reposted Today
Senior Technology Consultant
$111K — $211K *
Hewlett Packard Enterprise Development LP
Fort Collins, CO 80525 (Larimer County)
Reposted Today
Enterprise Account Executive, DEX Specialist - Remote, USA
$90K — $130K *
TeamViewer Germany GmbH
Chicago, IL 60629 (Cook County)
Reposted Today
Strategic Account Executive
$120K — $180K *
Valon Technologies, Inc
San Francisco, CA 94112 (San Francisco County)
Today
AI Engineer (Staff/Principal) New York
$100K — $300K *
Fifth Dimension AI
New York, NY 10025 (New York County)
Reposted Today

Find similar Software Engineer, Inference jobs:

Nationwide San Francisco, CA

Software Engineer, Inference

Job Overview by Ladders

Full Job Description

Get Ready For Your Next Interview

Find similar Software Engineer, Inference jobs:

Get Ready For Your
Next Interview