Principal AI Performance Engineer

Advanced Micro Devices, Inc • $140K — $180K *

San Jose, CA 95123In-Person

Information Technology

5 - 7 years of experience

2 months ago

By clicking Apply, I agree with Ladders' Terms of Use and Privacy Policy

Job Overview by Ladders

Qualifications

7+ years in GPU computing, AI systems, or high-performance computing.
Extensive experience with AI serving frameworks like vLLM or TensorRT-LLM.
Strong end-to-end workload profiling and performance diagnosis skills.
Deep understanding of GPU performance characteristics (occupancy, memory coalescing, etc.).
Proficiency in Python and C++ for optimization tasks.
Customer-facing technical leadership experience with strong communication skills.
Familiarity with AI-assisted development tools and workflows.

Responsibilities

Optimize AI inference performance across stack and configurations.
Profile and diagnose complex performance bottlenecks in systems.
Translate kernel-level performance issues into actionable optimizations.
Lead discussions with customers on performance findings and recommendations.
Integrate and optimize custom kernels within AI serving frameworks.
Enhance multi-node distributed inference strategies for performance improvements.
Develop methodologies for performance optimization shared within the team.

Benefits

Comprehensive health and wellness benefits package.
Flexible work arrangements including hybrid options.
Opportunities for continuous learning and professional development.
Inclusive work environment that values diverse perspectives.
Engagement in innovative projects at the cutting edge of technology.

Full Job Description

PRINCIPAL AI PERFORMANCE ENGINEER

THE ROLE:

AMD is looking for a performance-obsessed engineer to drive AI inference performance to the absolute limit on AMD GPUs. You will lead a small, highly technical team and work end-to-end across the stack: profiling, diagnosing, and optimizing leading models on customer-relevant serving configurations (e.g. agentic coding, long-context, high-throughput serving). You move from challenge to challenge, tackling the hardest performance problems across our most strategic customer engagements and leaving behind measurable uplifts and reusable methodology. This is not a sustaining role: every engagement is different, every optimization leaves a lasting impact.

THE PERSON:

You can take any AI workload, understand it top to bottom, and make it faster. You are equally comfortable profiling a distributed serving deployment, diagnosing a kernel-level bottleneck, and presenting optimization results to a customer's VP of Engineering. You understand GPU kernel performance deeply: not just how to use profiling tools, but how to reason about occupancy, cache behavior, memory coalescing, and instruction-level bottlenecks from first principles. You lead through technical depth: you set the standard for your team by doing the hardest work yourself and pulling others up along the way. You are AI-fluent, not just in the models you optimize, but in how you work: you leverage AI agents and tools daily to accelerate your workflows, and you actively define new ways of using them to make yourself and your team more effective. You thrive under pressure, move fast, and measure everything.

KEY RESPONSIBILITIES:

Drive performance optimization end-to-end across the stack on leading models and customer-relevant serving configurations, closing competitive gaps through kernel and systems-level optimizations
Profile, diagnose, and resolve the hardest cross-stack performance bottlenecks, from GPU kernels and operator dispatch to framework-level scheduling and multi-node communication
Diagnose kernel-level performance issues using profiling tools: identify occupancy limitations, L2 cache thrashing, register pressure, memory coalescing issues, etc, and translate findings into actionable optimizations
Lead customer-facing technical engagements: present findings, recommend optimizations, and deliver measurable performance uplifts
Integrate and optimize custom kernels (Triton, Gluon, CK, PyDSL, ASM, AITER) within serving frameworks, understanding dispatch paths, shape extraction, and backend selection
Optimize multi-node distributed inference: communication-compute overlap, parallelism strategies, and scale-out performance
Develop and refine shared performance optimization methodology that raises the bar across the broader team
Leverage AI agents to accelerate daily work and define best practices for AI-assisted performance engineering
Upstream optimizations into open-source frameworks such as vLLM, SGLang, and PyTorch

PREFERRED EXPERIENCE:

7+ years of software development experience in GPU computing, AI systems, or high-performance computing
Deep hands-on experience with AI serving frameworks (vLLM, SGLang, TensorRT-LLM, or similar) and their internals
Strong background in end-to-end workload profiling and bottleneck diagnosis: you can trace from user request to GPU kernel and back
Understanding of GPU kernel performance characteristics: occupancy, register and LDS pressure, memory coalescing, cache utilization, wavefront scheduling, and instruction-level bottlenecks
Ability to read and reason about kernel-level profiling data and translate it into concrete optimization actions. You may not write kernels from scratch daily, but you can tell exactly why one is slow and what needs to change
Understanding of model architectures (transformers, MoE, diffusion), inference paradigms (speculative decoding, prefill-decode disaggregation, continuous batching), and how they map to hardware
Experience with custom kernel development or integration (HIP, CUDA, Triton, CK, or similar)
Understanding of multi-GPU and multi-node distributed systems: scale-up and scale-out topologies, RCCL/NCCL, RDMA, and communication-compute overlap
System and rack-level design awareness: understanding performance tradeoffs across the full deployment stack
Strong proficiency in Python and C++
Customer-facing technical leadership experience: ability to engage with customers, present findings, and drive decisions
Fluent in AI-assisted development: daily user of AI agents and tools, with a mindset toward defining new AI-powered workflows
Strong Linux systems knowledge
Excellent written and verbal English communication skills

ACADEMIC CREDENTIALS:

Bachelor's, Master's, or PhD in Computer Science, Computer Engineering, Electrical Engineering, or equivalent. Advanced degree preferred but exceptional industry experience valued equally.

LOCATION:

San Jose, CA, preferred

#LI-TC1

#LI-HYBRID

Benefits offered are described: AMD benefits at a glance.

About Advanced Micro Devices, Inc

Advanced Micro Devices, Inc. Careers

Join the innovative forefront of technology with a career at Advanced Micro Devices, Inc. (AMD), a leader in semiconductor development. As part of our global team, you will contribute to an organization renowned for its dedication to innovation, leadership, and diversity in the tech industry.

Work You’ll Do

At AMD, we offer job opportunities that push the boundaries of what is possible. Our team is composed of professionals who lead the way in microprocessor and graphics technology, driving industry standards and innovation. With AMD, you will be part of a culture that values growth and professional development, ensuring that every team member has the opportunity to excel.

Transform Your Career

AMD is not just about advancing technology, but also about advancing careers. Whether you are looking for an internship, a full-time position, or leadership roles, AMD provides the platform to propel your career to new heights. Our commitment to professional growth is matched by our dedication to diversity and inclusion, making AMD a place where everyone can thrive.

Innovative Work Environment

Join a team of over 12,000 dedicated professionals at the intersection of technology, industry expertise, and digital innovation. At AMD, you will work on groundbreaking projects that shape the future of computing and graphics. Our collaborative environment encourages networking and the sharing of ideas across teams and disciplines.

Career Development and Benefits

AMD is committed to the development of its employees. We offer robust training programs, including leadership development and diversity training, to ensure our team is equipped for both current challenges and future opportunities. Our benefits package is designed to support the well-being and financial security of our employees and their families.

Explore Job Opportunities

From engineering to marketing, AMD offers a range of career paths that cater to diverse skills and interests. Our hiring process is designed to be transparent and engaging, helping you to understand where you fit within our team and how you can contribute to our collective goals.

Stay Connected

Join Our Team Search open positions that match your skills and interest. We look for passionate, curious, creative, and solution-driven team players. Explore the opportunities to join a company that’s committed to your career growth and to innovation in the technology sector.

Keep Up to Date

Stay ahead with career tips, insider perspectives, and industry-leading insights you can put to use today—all from the people who work here.

Job Alert Emails

Personalize your subscription to receive job alerts, latest news, and insider tips tailored to your preferences. Discover the exciting and rewarding career opportunities that await at Advanced Micro Devices, Inc.

Interview and Resume Tips

Prepare for your future with AMD by accessing resources that help you craft your resume and excel in interviews. Our goal is to help you showcase your best professional self and align your skills with the needs of our dynamic team. At Advanced Micro Devices, Inc., we empower our employees to innovate, lead, and grow. Join us in driving the future of technology while building a rewarding and sustainable career.

Learn more about Advanced Micro Devices, Inc

Size

15,500 employees

Market Cap

$100.9 billion

Industry

Manufacturing & Automotive

Net Income

$2.4 billion

Founded

1969

5 Year Trend

+30.9%

Revenue

$9.7 billion

NASDAQ

AMD

* Ladders Estimates

Similar Jobs

Senior AI Engineer - Health Intelligence
$172K — $203K *
Oura
San Francisco, CA 94112 (San Francisco County)
Today
Staff AI Software Engineer, Siri Core Modeling
$130K — $180K *
Apple
Cupertino, CA 95014 (Santa Clara County)
Reposted Today
Software Engineer III, Generative AI, Payments Risk
$147K — $211K *
Google
Mountain View, CA 94040 (Santa Clara County)
Today
AI Engineer
$130K — $175K *
AI Fund
Mountain View, CA 94040 (Santa Clara County)
Today
AI/ML Engineer
$100K — $150K *
VXForward LLC
Remote
Today
Senior OpenAI Forward Deployed Engineer - GPS
$155K — $306K *
Deloitte
Sacramento, CA 95823 (Sacramento County)
Today

Get Ready For Your
Next Interview

More Jobs at Advanced Micro Devices, Inc

Senior Software Development Engineer - LLM Inference Framework
$130K — $180K *
Santa Clara, CA 95051 (Santa Clara County)
Reposted Today
Enterprise Technology
In-Person
Director, Product Development Engineering-ASIC/SoC
$150K — $200K *
San Jose, CA 95123 (Santa Clara County)
Reposted Today
Telecommunications & Hardware
In-Person
Mixed Signal Custom Layout Engineer (1-Year Temporary)
$90K — $120K *
Markham, ON L3R 0G6
Today
Information Technology
In-Person
Principal Robotics Simulation Architect
$130K — $180K *
Austin, TX 78745 (Travis County)
Today
Technical Services
In-Person
Mixed Signal Custom Layout Engineer (1-Year Temporary)
$80K — $120K *
Markham, ON L3R 0G6
Today
Technical Services
In-Person

More Information Technology Jobs

Business Development Director
$300K — $345K + $120K bonus *
Tier1 IT Services Firm
Kansas City, MO 64116 (Clay County)
6 days ago
Client Partner / Business Developemnt - Banking
$250K — $320K + $70K bonus *
IT Services Firm (client of TechLink Systems)
New York, NY 10001 (New York County)
6 days ago
Customer Support
Confidential Company
Austin, TX 78701 (Travis County)
2 weeks ago
Sr Assoc, Cyber Sec ThreatMgmt - Detection Engineer
$88K — $151K *
Northern Trust
Naperville, IL 60540 (Dupage County)
Today
Global Director – Vulnerability Management & Security Configuration
$164K — $288K *
Northern Trust
Chicago, IL 60629 (Cook County)
Today

Find similar Principal AI Performance Engineer jobs:

Nationwide San Jose, CA