Advanced Micro Devices, Inc

Principal AI Performance Engineer

Advanced Micro Devices, Inc$140K — $180K *
Information Technology
5 - 7 years of experience
Job Overview by Ladders

Qualifications

  • 7+ years in GPU computing, AI systems, or high-performance computing.
  • Extensive experience with AI serving frameworks like vLLM or TensorRT-LLM.
  • Strong end-to-end workload profiling and performance diagnosis skills.
  • Deep understanding of GPU performance characteristics (occupancy, memory coalescing, etc.).
  • Proficiency in Python and C++ for optimization tasks.
  • Customer-facing technical leadership experience with strong communication skills.
  • Familiarity with AI-assisted development tools and workflows.

Responsibilities

  • Optimize AI inference performance across stack and configurations.
  • Profile and diagnose complex performance bottlenecks in systems.
  • Translate kernel-level performance issues into actionable optimizations.
  • Lead discussions with customers on performance findings and recommendations.
  • Integrate and optimize custom kernels within AI serving frameworks.
  • Enhance multi-node distributed inference strategies for performance improvements.
  • Develop methodologies for performance optimization shared within the team.

Benefits

  • Comprehensive health and wellness benefits package.
  • Flexible work arrangements including hybrid options.
  • Opportunities for continuous learning and professional development.
  • Inclusive work environment that values diverse perspectives.
  • Engagement in innovative projects at the cutting edge of technology.
Full Job Description
PRINCIPAL AI PERFORMANCE ENGINEER

THE ROLE:

AMD is looking for a performance-obsessed engineer to drive AI inference performance to the absolute limit on AMD GPUs. You will lead a small, highly technical team and work end-to-end across the stack: profiling, diagnosing, and optimizing leading models on customer-relevant serving configurations (e.g. agentic coding, long-context, high-throughput serving). You move from challenge to challenge, tackling the hardest performance problems across our most strategic customer engagements and leaving behind measurable uplifts and reusable methodology. This is not a sustaining role: every engagement is different, every optimization leaves a lasting impact.

THE PERSON:

You can take any AI workload, understand it top to bottom, and make it faster. You are equally comfortable profiling a distributed serving deployment, diagnosing a kernel-level bottleneck, and presenting optimization results to a customer's VP of Engineering. You understand GPU kernel performance deeply: not just how to use profiling tools, but how to reason about occupancy, cache behavior, memory coalescing, and instruction-level bottlenecks from first principles. You lead through technical depth: you set the standard for your team by doing the hardest work yourself and pulling others up along the way. You are AI-fluent, not just in the models you optimize, but in how you work: you leverage AI agents and tools daily to accelerate your workflows, and you actively define new ways of using them to make yourself and your team more effective. You thrive under pressure, move fast, and measure everything.

KEY RESPONSIBILITIES:
  • Drive performance optimization end-to-end across the stack on leading models and customer-relevant serving configurations, closing competitive gaps through kernel and systems-level optimizations
  • Profile, diagnose, and resolve the hardest cross-stack performance bottlenecks, from GPU kernels and operator dispatch to framework-level scheduling and multi-node communication
  • Diagnose kernel-level performance issues using profiling tools: identify occupancy limitations, L2 cache thrashing, register pressure, memory coalescing issues, etc, and translate findings into actionable optimizations
  • Lead customer-facing technical engagements: present findings, recommend optimizations, and deliver measurable performance uplifts
  • Integrate and optimize custom kernels (Triton, Gluon, CK, PyDSL, ASM, AITER) within serving frameworks, understanding dispatch paths, shape extraction, and backend selection
  • Optimize multi-node distributed inference: communication-compute overlap, parallelism strategies, and scale-out performance
  • Develop and refine shared performance optimization methodology that raises the bar across the broader team
  • Leverage AI agents to accelerate daily work and define best practices for AI-assisted performance engineering
  • Upstream optimizations into open-source frameworks such as vLLM, SGLang, and PyTorch


PREFERRED EXPERIENCE:
  • 7+ years of software development experience in GPU computing, AI systems, or high-performance computing
  • Deep hands-on experience with AI serving frameworks (vLLM, SGLang, TensorRT-LLM, or similar) and their internals
  • Strong background in end-to-end workload profiling and bottleneck diagnosis: you can trace from user request to GPU kernel and back
  • Understanding of GPU kernel performance characteristics: occupancy, register and LDS pressure, memory coalescing, cache utilization, wavefront scheduling, and instruction-level bottlenecks
  • Ability to read and reason about kernel-level profiling data and translate it into concrete optimization actions. You may not write kernels from scratch daily, but you can tell exactly why one is slow and what needs to change
  • Understanding of model architectures (transformers, MoE, diffusion), inference paradigms (speculative decoding, prefill-decode disaggregation, continuous batching), and how they map to hardware
  • Experience with custom kernel development or integration (HIP, CUDA, Triton, CK, or similar)
  • Understanding of multi-GPU and multi-node distributed systems: scale-up and scale-out topologies, RCCL/NCCL, RDMA, and communication-compute overlap
  • System and rack-level design awareness: understanding performance tradeoffs across the full deployment stack
  • Strong proficiency in Python and C++
  • Customer-facing technical leadership experience: ability to engage with customers, present findings, and drive decisions
  • Fluent in AI-assisted development: daily user of AI agents and tools, with a mindset toward defining new AI-powered workflows
  • Strong Linux systems knowledge
  • Excellent written and verbal English communication skills


ACADEMIC CREDENTIALS:

Bachelor's, Master's, or PhD in Computer Science, Computer Engineering, Electrical Engineering, or equivalent. Advanced degree preferred but exceptional industry experience valued equally.

LOCATION:

San Jose, CA, preferred

#LI-TC1

#LI-HYBRID

Benefits offered are described: AMD benefits at a glance.

About Advanced Micro Devices, Inc

Advanced Micro Devices, Inc. Careers

Join the innovative forefront of technology with a career at Advanced Micro Devices, Inc. (AMD), a leader in semiconductor development. As part of our global team, you will contribute to an organization renowned for its dedication to innovation, leadership, and diversity in the tech industry.

Work You’ll Do

At AMD, we offer job opportunities that push the boundaries of what is possible. Our team is composed of professionals who lead the way in microprocessor and graphics technology, driving industry standards and innovation. With AMD, you will be part of a culture that values growth and professional development, ensuring that every team member has the opportunity to excel.

Transform Your Career

AMD is not just about advancing technology, but also about advancing careers. Whether you are looking for an internship, a full-time position, or leadership roles, AMD provides the platform to propel your career to new heights. Our commitment to professional growth is matched by our dedication to diversity and inclusion, making AMD a place where everyone can thrive.

Innovative Work Environment

Join a team of over 12,000 dedicated professionals at the intersection of technology, industry expertise, and digital innovation. At AMD, you will work on groundbreaking projects that shape the future of computing and graphics. Our collaborative environment encourages networking and the sharing of ideas across teams and disciplines.

Career Development and Benefits

AMD is committed to the development of its employees. We offer robust training programs, including leadership development and diversity training, to ensure our team is equipped for both current challenges and future opportunities. Our benefits package is designed to support the well-being and financial security of our employees and their families.

Explore Job Opportunities

From engineering to marketing, AMD offers a range of career paths that cater to diverse skills and interests. Our hiring process is designed to be transparent and engaging, helping you to understand where you fit within our team and how you can contribute to our collective goals.

Stay Connected

Join Our Team Search open positions that match your skills and interest. We look for passionate, curious, creative, and solution-driven team players. Explore the opportunities to join a company that’s committed to your career growth and to innovation in the technology sector.

Keep Up to Date

Stay ahead with career tips, insider perspectives, and industry-leading insights you can put to use today—all from the people who work here.

Job Alert Emails

Personalize your subscription to receive job alerts, latest news, and insider tips tailored to your preferences. Discover the exciting and rewarding career opportunities that await at Advanced Micro Devices, Inc.

Interview and Resume Tips

Prepare for your future with AMD by accessing resources that help you craft your resume and excel in interviews. Our goal is to help you showcase your best professional self and align your skills with the needs of our dynamic team. At Advanced Micro Devices, Inc., we empower our employees to innovate, lead, and grow. Join us in driving the future of technology while building a rewarding and sustainable career.
Learn more about Advanced Micro Devices, Inc
Size
15,500 employees
Market Cap
$100.9 billion
Industry
Net Income
$2.4 billion
Founded
1969
5 Year Trend
+30.9%
Revenue
$9.7 billion
NASDAQ

Similar Jobs

More Jobs at Advanced Micro Devices, Inc

More Information Technology Jobs

Find similar Principal AI Performance Engineer jobs: