Applications Engineer (GPU-Accelerated)

Alembic

• $120K — $180K *

San Francisco, CA 94112In-Person

Information Technology

Less than 5 years of experience

More than 3 months ago

By clicking Apply, I agree with Ladders' Terms of Use and Privacy Policy

Job Overview by Ladders

Qualifications

4-7 years of software engineering experience with Python and C++
Hands-on experience with GPU programming (CUDA, Triton, Numba)
Familiarity with Python data stack (Pandas, NumPy, PyArrow)
Ability to create high-performance, memory-efficient C++ code
Experience collaborating with cross-functional teams

Responsibilities

Translate ML research into reliable, performant software
Optimize GPU-accelerated workloads using CUDA and related frameworks
Develop core libraries for hybrid Python/C++ environments
Create modular infrastructure for scalable ML deployment
Build APIs to integrate ML components into the platform
Implement performance tracking tools for models

Benefits

A key role in developing GPU-accelerated AI software
Collaboration with leading ML scientists and engineers
Opportunity to influence infrastructure for enterprise decision-making
A culture emphasizing technical excellence and impact

Full Job Description

About Alembic

Alembic is pioneering a revolution in marketing, proving the true ROI of marketing activities. The Alembic Marketing Intelligence Platform applies sophisticated algorithms and AI models to finally solve this long-standing problem. When you join the Alembic team, you'll help build the tools that provide unprecedented visibility into how marketing drives revenue, helping a growing list of Fortune 500 companies make more confident, data-driven decisions.

About the Role

We're looking for a Machine Learning Applications Engineer with GPU, Python, and C++ expertise to help productionize cutting-edge causal AI models. You'll work closely with ML scientists to turn experimental research code into optimized, scalable, and well-structured software that powers Alembic's real-time analytics and inference systems.

This is a hands-on, performance-focused role where you'll operate at the intersection of applied ML, systems engineering, and high-performance computing.

Key Responsibilities

Translate early-stage ML research and prototypes into reliable, testable, and performant software components
Use CUDA, Triton, and Numba to optimize GPU-accelerated workloads for inference and preprocessing
Contribute to core libraries and performance-critical routines using modern C++ in hybrid Python/C++ environments
Develop modular, reusable infrastructure that supports deployment of ML workloads at scale
Collaborate with data scientists and engineers to optimize data structures, memory usage, and execution paths
Build interfaces and APIs to integrate ML components into Alembic's broader platform
Implement logging, profiling, and observability tools to track performance and model behavior

Must-Have Qualifications

4-7 years of software engineering experience, including substantial time in Python and C++
Hands-on experience with GPU programming, including CUDA, Triton, Numba, or related frameworks
Strong familiarity with the Python data stack (Pandas, NumPy, PyArrow) and low-level performance tuning
Experience writing high-performance, memory-efficient code in C++
Demonstrated ability to work cross-functionally with researchers, platform engineers, and product teams
Comfort transforming research-grade ML code into maintainable, production-grade software

Nice-to-Have

Experience with hybrid Python/C++ or Python/CUDA extension development (e.g., Pybind11, Cython, custom ops)
Familiarity with ML serving or inference tools (e.g., TorchServe, ONNX Runtime, Triton Inference Server)
Exposure to structured data modeling, causal inference, or large-scale statistical computation
Background in distributed systems or parallel processing is a plus

What You'll Get

A pivotal role building GPU-accelerated software at the heart of a real-world AI product
Collaboration with an elite team of ML scientists, engineers, and product leaders
The opportunity to shape performance-critical infrastructure powering enterprise decision-making
A culture rooted in technical rigor, curiosity, and product impact

* Ladders Estimates

Similar Jobs

Senior Manager, Quoting
$120K — $125K *
Owens & Minor
Remote
Today
Sr. Manager, Clinical Consultant
$140K — $150K *
Owens & Minor
Remote
Today
Senior Manager, Data Engineering
$140K — $155K *
Owens & Minor
Remote
Today
Project Manager, Construction (Data Center)
$90K — $130K *
Clark Construction
Reno, NV 89502 (Washoe County)
Reposted Today
Remote Commercial Underwriting Counsel
$97K — $130K *
First American
Remote
Today
Sr. Underwriting Counsel (Remote/Hybrid)
$129K — $172K *
First American
Remote
Today

Get Ready For Your
Next Interview

More Information Technology Jobs

Business Development Director
$300K — $345K + $120K bonus *
Tier1 IT Services Firm
Kansas City, MO 64116 (Clay County)
6 days ago
Client Partner / Business Developemnt - Banking
$250K — $320K + $70K bonus *
IT Services Firm (client of TechLink Systems)
New York, NY 10001 (New York County)
6 days ago
Customer Support
Confidential Company
Austin, TX 78701 (Travis County)
2 weeks ago
Sr Assoc, Cyber Sec ThreatMgmt - Detection Engineer
$88K — $151K *
Northern Trust
Naperville, IL 60540 (Dupage County)
Today
Global Director – Vulnerability Management & Security Configuration
$164K — $288K *
Northern Trust
Chicago, IL 60629 (Cook County)
Today

Find similar Applications Engineer (GPU-Accelerated) jobs:

Nationwide San Francisco, CA

Applications Engineer (GPU-Accelerated)

Job Overview by Ladders

Full Job Description

Get Ready For Your Next Interview

Find similar Applications Engineer (GPU-Accelerated) jobs:

Get Ready For Your
Next Interview