Advanced Micro Devices, Inc

Senior Software Development Engineer - LLM Inference Framework

Advanced Micro Devices, Inc$130K — $180K *
Enterprise Technology
Less than 5 years of experience
Job Overview by Ladders

Qualifications

  • 5-7 years of experience in Python development within Linux environments.
  • Strong background in GPU kernel development and LLM inference frameworks.
  • Expert in debugging, performance tuning, and test design.
  • Hands-on experience with frameworks like TensorFlow and PyTorch for deep learning integration.
  • Master's or PhD in Computer Science, Computer Engineering, Electrical Engineering, or a related field.

Responsibilities

  • Enhance and optimize deep learning frameworks like PyTorch for AMD GPUs.
  • Design multi-GPU inference strategies to improve performance.
  • Analyze and improve training and inference performance with GPU library teams.
  • Engage with open-source maintainers to align code changes and ensure integration.
  • Optimize deep learning performance across multi-GPU and multi-node systems.
  • Leverage advanced compiler technologies for performance optimization.
  • Enhance the deep learning pipeline including integrating graph compilers.

Benefits

  • Comprehensive health coverage and wellness programs.
  • Flexible work arrangements to suit individual needs.
  • Professional development and career advancement opportunities.
  • Diverse and inclusive work environment that values collaboration.
  • Employee assistance and support services.
Full Job Description
THE ROLE:

As a senior member of the LLM inference framework team, you will be responsible for building and optimizing production-grade single-node and distributed inference runtimes for large language models on AMD GPUs. You will work at the framework and runtime layer, driving performance, scalability, and reliability, enabling tensor parallelism, pipeline parallelism, expert parallelism (MoE), and single-node or multi-node inference at scale. Your work will directly power customer-facing deployments and benchmarking platforms (e.g., InferenceMax, MLPerf, strategic partners, and cloud providers) and will be upstreamed into open-source inference frameworks such as vLLM and SGLang to make AMD a first-class platform for LLM serving.

This role sits at the intersection of inference engines, distributed systems, and GPU runtime and kernel backends.

THE PERSON:

You are a systems-minded ML engineer who thinks in terms of throughput, latency, memory movement, and scheduling, not just model code.

You are comfortable reading and modifying large-scale inference frameworks, debugging performance across GPUs and nodes, and collaborating with kernel, compiler, and networking teams to close end-to-end performance gaps.

You enjoy working in open source and driving architecture-level improvements in inference platforms.

KEY RESPONSIBILITIES:

Inference Framework & Runtime
  • Architect and optimize distributed LLM inference runtimes based on in-house LLM engines or open-source stacks such as vLLM, SGLang, and llm-d
  • Design and improve TP / PP / EP (MoE) hybrid execution, including KV-cache management, attention dispatch, and token scheduling
  • Implement and optimize multi-node inference pipelines using RCCL, RDMA, and collective-based execution

Performance & Scalability
  • Drive throughput, latency, and memory efficiency across single-GPU and multi-GPU clusters
  • Optimize continuous batching, speculative decoding, KV-cache paging, prefix caching, and multi-turn serving

GPU & Backend Integration
  • Work with AMD GPU libraries (AITER, HIPBLAS-LT, RCCL, ROCm runtime) to ensure inference frameworks efficiently use FP8 / FP4 GEMM and FlashAttention / MLA
  • Collaborate with compiler teams (Triton, LLVM, ROCm) to unblock framework-level performance

Open Source & Customer Enablement
  • Upstream features and performance fixes into vLLM, SGLang, and llm-d
  • Enable customer PoCs and production deployments on AMD platforms
  • Build and maintain benchmark-grade inference pipelines


PREFERRED EXPERIENCE:

Inference Stack Knowledge
  • Hands-on understanding of vLLM, SGLang, or similar inference stacks
  • Experience with distributed inference scaling and a proven track record of contributing to upstream open-source projects

Deep Learning Integration
  • Strong experience integrating optimized GPU performance into machine-learning frameworks (e.g., PyTorch, TensorFlow) for high-throughput and scalable inference

Kernel & Inference Frameworks
  • Strong background in NVIDIA, AMD, or similar GPU architectures and kernel development

Software Engineering
  • Expertise in Python and preferably experience in C/C++, including debugging, performance tuning, and test design for large-scale systems

High-Performance Computing
  • Experience running large-scale workloads on heterogeneous GPU clusters, optimizing for efficiency and scalability

Compiler & Runtime Optimization
  • Understanding of compiler and runtime systems, including LLVM, ROCm, and GPU code generation


ACADEMIC CREDENTIALS:
  • Master's or PhD in Computer Science, Computer Engineering, Electrical Engineering, or a related field.


#LI-JG1

Benefits offered are described: AMD benefits at a glance.

About Advanced Micro Devices, Inc

Advanced Micro Devices, Inc. Careers

Join the innovative forefront of technology with a career at Advanced Micro Devices, Inc. (AMD), a leader in semiconductor development. As part of our global team, you will contribute to an organization renowned for its dedication to innovation, leadership, and diversity in the tech industry.

Work You’ll Do

At AMD, we offer job opportunities that push the boundaries of what is possible. Our team is composed of professionals who lead the way in microprocessor and graphics technology, driving industry standards and innovation. With AMD, you will be part of a culture that values growth and professional development, ensuring that every team member has the opportunity to excel.

Transform Your Career

AMD is not just about advancing technology, but also about advancing careers. Whether you are looking for an internship, a full-time position, or leadership roles, AMD provides the platform to propel your career to new heights. Our commitment to professional growth is matched by our dedication to diversity and inclusion, making AMD a place where everyone can thrive.

Innovative Work Environment

Join a team of over 12,000 dedicated professionals at the intersection of technology, industry expertise, and digital innovation. At AMD, you will work on groundbreaking projects that shape the future of computing and graphics. Our collaborative environment encourages networking and the sharing of ideas across teams and disciplines.

Career Development and Benefits

AMD is committed to the development of its employees. We offer robust training programs, including leadership development and diversity training, to ensure our team is equipped for both current challenges and future opportunities. Our benefits package is designed to support the well-being and financial security of our employees and their families.

Explore Job Opportunities

From engineering to marketing, AMD offers a range of career paths that cater to diverse skills and interests. Our hiring process is designed to be transparent and engaging, helping you to understand where you fit within our team and how you can contribute to our collective goals.

Stay Connected

Join Our Team Search open positions that match your skills and interest. We look for passionate, curious, creative, and solution-driven team players. Explore the opportunities to join a company that’s committed to your career growth and to innovation in the technology sector.

Keep Up to Date

Stay ahead with career tips, insider perspectives, and industry-leading insights you can put to use today—all from the people who work here.

Job Alert Emails

Personalize your subscription to receive job alerts, latest news, and insider tips tailored to your preferences. Discover the exciting and rewarding career opportunities that await at Advanced Micro Devices, Inc.

Interview and Resume Tips

Prepare for your future with AMD by accessing resources that help you craft your resume and excel in interviews. Our goal is to help you showcase your best professional self and align your skills with the needs of our dynamic team. At Advanced Micro Devices, Inc., we empower our employees to innovate, lead, and grow. Join us in driving the future of technology while building a rewarding and sustainable career.
Learn more about Advanced Micro Devices, Inc
Size
15,500 employees
Market Cap
$100.9 billion
Industry
Net Income
$2.4 billion
Founded
1969
5 Year Trend
+30.9%
Revenue
$9.7 billion
NASDAQ

Similar Jobs

More Jobs at Advanced Micro Devices, Inc

More Enterprise Technology Jobs

Find similar Senior Software Development Engineer - LLM Inference Framework jobs: