Senior Software Development Engineer - LLM Inference Framework

Advanced Micro Devices, Inc • $130K — $180K *

Santa Clara, CA 95051In-Person

Enterprise Technology

Less than 5 years of experience

Reposted Today

Be an Early Applicant

By clicking Apply, I agree with Ladders' Terms of Use and Privacy Policy

Job Overview by Ladders

Qualifications

5-7 years of experience in Python development within Linux environments.
Strong background in GPU kernel development and LLM inference frameworks.
Expert in debugging, performance tuning, and test design.
Hands-on experience with frameworks like TensorFlow and PyTorch for deep learning integration.
Master's or PhD in Computer Science, Computer Engineering, Electrical Engineering, or a related field.

Responsibilities

Enhance and optimize deep learning frameworks like PyTorch for AMD GPUs.
Design multi-GPU inference strategies to improve performance.
Analyze and improve training and inference performance with GPU library teams.
Engage with open-source maintainers to align code changes and ensure integration.
Optimize deep learning performance across multi-GPU and multi-node systems.
Leverage advanced compiler technologies for performance optimization.
Enhance the deep learning pipeline including integrating graph compilers.

Benefits

Comprehensive health coverage and wellness programs.
Flexible work arrangements to suit individual needs.
Professional development and career advancement opportunities.
Diverse and inclusive work environment that values collaboration.
Employee assistance and support services.

Full Job Description

THE ROLE:

As a senior member of the LLM inference framework team, you will be responsible for building and optimizing production-grade single-node and distributed inference runtimes for large language models on AMD GPUs. You will work at the framework and runtime layer, driving performance, scalability, and reliability, enabling tensor parallelism, pipeline parallelism, expert parallelism (MoE), and single-node or multi-node inference at scale. Your work will directly power customer-facing deployments and benchmarking platforms (e.g., InferenceMax, MLPerf, strategic partners, and cloud providers) and will be upstreamed into open-source inference frameworks such as vLLM and SGLang to make AMD a first-class platform for LLM serving.

This role sits at the intersection of inference engines, distributed systems, and GPU runtime and kernel backends.

THE PERSON:

You are a systems-minded ML engineer who thinks in terms of throughput, latency, memory movement, and scheduling, not just model code.

You are comfortable reading and modifying large-scale inference frameworks, debugging performance across GPUs and nodes, and collaborating with kernel, compiler, and networking teams to close end-to-end performance gaps.

You enjoy working in open source and driving architecture-level improvements in inference platforms.

KEY RESPONSIBILITIES:

Inference Framework & Runtime

Architect and optimize distributed LLM inference runtimes based on in-house LLM engines or open-source stacks such as vLLM, SGLang, and llm-d

Design and improve TP / PP / EP (MoE) hybrid execution, including KV-cache management, attention dispatch, and token scheduling

Implement and optimize multi-node inference pipelines using RCCL, RDMA, and collective-based execution

Performance & Scalability

Drive throughput, latency, and memory efficiency across single-GPU and multi-GPU clusters

Optimize continuous batching, speculative decoding, KV-cache paging, prefix caching, and multi-turn serving

GPU & Backend Integration

Work with AMD GPU libraries (AITER, HIPBLAS-LT, RCCL, ROCm runtime) to ensure inference frameworks efficiently use FP8 / FP4 GEMM and FlashAttention / MLA

Collaborate with compiler teams (Triton, LLVM, ROCm) to unblock framework-level performance

Open Source & Customer Enablement

Upstream features and performance fixes into vLLM, SGLang, and llm-d

Enable customer PoCs and production deployments on AMD platforms

Build and maintain benchmark-grade inference pipelines

PREFERRED EXPERIENCE:

Inference Stack Knowledge

Hands-on understanding of vLLM, SGLang, or similar inference stacks

Experience with distributed inference scaling and a proven track record of contributing to upstream open-source projects

Deep Learning Integration

Strong experience integrating optimized GPU performance into machine-learning frameworks (e.g., PyTorch, TensorFlow) for high-throughput and scalable inference

Kernel & Inference Frameworks

Strong background in NVIDIA, AMD, or similar GPU architectures and kernel development

Software Engineering

Expertise in Python and preferably experience in C/C++, including debugging, performance tuning, and test design for large-scale systems

High-Performance Computing

Experience running large-scale workloads on heterogeneous GPU clusters, optimizing for efficiency and scalability

Compiler & Runtime Optimization

Understanding of compiler and runtime systems, including LLVM, ROCm, and GPU code generation

ACADEMIC CREDENTIALS:

Master's or PhD in Computer Science, Computer Engineering, Electrical Engineering, or a related field.

#LI-JG1

Benefits offered are described: AMD benefits at a glance.

About Advanced Micro Devices, Inc

Advanced Micro Devices, Inc. Careers

Join the innovative forefront of technology with a career at Advanced Micro Devices, Inc. (AMD), a leader in semiconductor development. As part of our global team, you will contribute to an organization renowned for its dedication to innovation, leadership, and diversity in the tech industry.

Work You’ll Do

At AMD, we offer job opportunities that push the boundaries of what is possible. Our team is composed of professionals who lead the way in microprocessor and graphics technology, driving industry standards and innovation. With AMD, you will be part of a culture that values growth and professional development, ensuring that every team member has the opportunity to excel.

Transform Your Career

AMD is not just about advancing technology, but also about advancing careers. Whether you are looking for an internship, a full-time position, or leadership roles, AMD provides the platform to propel your career to new heights. Our commitment to professional growth is matched by our dedication to diversity and inclusion, making AMD a place where everyone can thrive.

Innovative Work Environment

Join a team of over 12,000 dedicated professionals at the intersection of technology, industry expertise, and digital innovation. At AMD, you will work on groundbreaking projects that shape the future of computing and graphics. Our collaborative environment encourages networking and the sharing of ideas across teams and disciplines.

Career Development and Benefits

AMD is committed to the development of its employees. We offer robust training programs, including leadership development and diversity training, to ensure our team is equipped for both current challenges and future opportunities. Our benefits package is designed to support the well-being and financial security of our employees and their families.

Explore Job Opportunities

From engineering to marketing, AMD offers a range of career paths that cater to diverse skills and interests. Our hiring process is designed to be transparent and engaging, helping you to understand where you fit within our team and how you can contribute to our collective goals.

Stay Connected

Join Our Team Search open positions that match your skills and interest. We look for passionate, curious, creative, and solution-driven team players. Explore the opportunities to join a company that’s committed to your career growth and to innovation in the technology sector.

Keep Up to Date

Stay ahead with career tips, insider perspectives, and industry-leading insights you can put to use today—all from the people who work here.

Job Alert Emails

Personalize your subscription to receive job alerts, latest news, and insider tips tailored to your preferences. Discover the exciting and rewarding career opportunities that await at Advanced Micro Devices, Inc.

Interview and Resume Tips

Prepare for your future with AMD by accessing resources that help you craft your resume and excel in interviews. Our goal is to help you showcase your best professional self and align your skills with the needs of our dynamic team. At Advanced Micro Devices, Inc., we empower our employees to innovate, lead, and grow. Join us in driving the future of technology while building a rewarding and sustainable career.

Learn more about Advanced Micro Devices, Inc

Size

15,500 employees

Market Cap

$100.9 billion

Industry

Manufacturing & Automotive

Net Income

$2.4 billion

Founded

1969

5 Year Trend

+30.9%

Revenue

$9.7 billion

NASDAQ

AMD

* Ladders Estimates

Similar Jobs

Senior Software Engineer
$105K — $149K *
Empower
Remote
Today
Staff, Software Engineer
$143K — $286K *
Walmart
Sunnyvale, CA 94087 (Santa Clara County)
Reposted Today
Senior, Software Engineer
$117K — $234K *
Walmart
Sunnyvale, CA 94087 (Santa Clara County)
Reposted Today
Sr. Software Engineer, Fullstack - Moveworks
$120K — $160K *
ServiceNow
Mountain View, CA 94040 (Santa Clara County)
Today
Senior Software Engineer (Backend) - AI/ML
$141K — $195K *
ClickHouse
Remote
Today
Senior, Software Engineer
$117K — $234K *
Walmart
Sunnyvale, CA 94087 (Santa Clara County)
Reposted Today

Get Ready For Your
Next Interview

More Jobs at Advanced Micro Devices, Inc

Senior Software Development Engineer - LLM Inference Framework
$130K — $180K *
Santa Clara, CA 95051 (Santa Clara County)
Reposted Today
Enterprise Technology
In-Person
Director, Product Development Engineering-ASIC/SoC
$150K — $200K *
San Jose, CA 95123 (Santa Clara County)
Reposted Today
Telecommunications & Hardware
In-Person
Mixed Signal Custom Layout Engineer (1-Year Temporary)
$90K — $120K *
Markham, ON L3R 0G6
Today
Information Technology
In-Person
Principal Robotics Simulation Architect
$130K — $180K *
Austin, TX 78745 (Travis County)
Today
Technical Services
In-Person
Mixed Signal Custom Layout Engineer (1-Year Temporary)
$80K — $120K *
Markham, ON L3R 0G6
Today
Technical Services
In-Person

More Enterprise Technology Jobs

AI Enablement Specialist
$100K — $115K *
Axis Communications
Chelmsford, MA 01824 (Middlesex County)
Today
Configurator Developer Engineer (Oracle CPQ)
$85K — $110K *
Nidec Automatic Feed
St. Louis, MO 63129 (Saint Louis County)
Today
Manager, SAP SD Public Cloud
$100K — $130K *
KPMG
Calgary, AB T1Y 7M8
Today
Sr. ERP Developer
$160K — $165K *
Cape Cod Healthcare
Hyannis, MA 02601 (Barnstable County)
Today
Technical Program Manager - Engineering Systems Integration
$105K — $180K *
KLA Tencor
Ann Arbor, MI 48103 (Washtenaw County)
Reposted Today