Software Engineer, Inference

Pulse Software Corp

$120K — $180K *
Enterprise Technology
Less than 5 years of experience
Job Overview by Ladders

Qualifications

  • 3+ years in performance engineering or ML systems
  • Strong Python programming skills
  • Exposure to C++ or CUDA
  • Experience with GPU profiling
  • Familiarity with model serving techniques

Responsibilities

  • Build inference services with smart batching and caching
  • Optimize kernels, tokenization, and model graphs
  • Evaluate tradeoffs of vLLM, TensorRT LLM, and Triton
  • Implement autoscaling and admission control with clear SLOs
  • Own performance dashboards and manage capacity planning

Benefits

  • Competitive base salary plus equity
  • Performance-based bonus
  • Relocation assistance for Bay Area moves
  • Daily meal stipend
  • Medical, vision, and dental coverage
Full Job Description
About the RoleSpecialize in low-latency, high-throughput inference for OCR and multimodal models. Own profiling, batching, and autoscaling across single-tenant and multi-tenant environments.

Responsibilities
  • Build inference services with smart batching and caching
  • Optimize kernels, tokenization, and model graphs
  • Evaluate vLLM, TensorRT LLM, and Triton tradeoffs
  • Implement autoscaling and admission control with clear SLOs
  • Own performance dashboards and capacity planning

Requirements
  • 3+ years in performance engineering or ML systems
  • Strong Python, plus C++ or CUDA exposure
  • Experience with GPU profiling and model serving

Nice to have
  • Experience reducing p95 and cost in production ML systems

SponsorshipSponsorship available.

Compensation and benefitsCompetitive base salary plus equity, performance-based bonus, relocation assistance for Bay Area moves, daily meal stipend, medical, vision, and dental coverage.

Similar Jobs

More Jobs at Pulse Software Corp

  • Account Executive
    $90K — $130K *
    San Francisco, CA 94112 (San Francisco County)
    Enterprise Technology
    In-Person
  • Software Engineer, Inference
    $120K — $180K *
    San Francisco, CA 94112 (San Francisco County)
    Enterprise Technology
    In-Person
  • Solutions Engineer
    $120K — $150K *
    San Francisco, CA 94112 (San Francisco County)
    Information Technology
    In-Person
  • Software Engineer
    $120K — $160K *
    San Francisco, CA 94112 (San Francisco County)
    Information Technology
    In-Person
  • Design Engineer
    $120K — $150K *
    San Francisco, CA 94112 (San Francisco County)
    Consumer Technology
    In-Person

More Enterprise Technology Jobs

Find similar Software Engineer, Inference jobs: