NVIDIA Corporation

Engineering Manager, LLM Performance

NVIDIA Corporation$224K — $431K *
Information Technology
5 - 7 years of experience
Job Overview by Ladders

Qualifications

  • MS, PhD, or equivalent experience in Computer Science, Computer Engineering, AI, or a related technical field.
  • 7+ years of software engineering experience, with 3+ years in technical leadership.
  • Proven ability to lead and scale engineering teams across distributed and cross-functional groups.
  • Strong programming skills in C++ or Python with a focus on production-quality software libraries.
  • Expertise in large language models (LLM) or vision language models (VLM) and inference systems.

Responsibilities

  • Lead and develop a team focused on enhancing LLM inference performance across various LLM frameworks.
  • Design, implement, and optimize features essential for LLM inference performance.
  • Enhance LLM inference performance on current and future NVIDIA datacenter architectures.
  • Work with benchmark teams to fine-tune performance for significant workloads.
  • Integrate advanced NVIDIA technologies to improve the developer experience for LLM deployment.
  • Oversee software development processes, including project planning and cross-functional coordination.

Benefits

  • Eligible for equity incentives.
  • Opportunity to work in a hybrid environment.
  • Access to professional growth opportunities.
  • Collaboration with leading experts in AI and GPU architecture.
Full Job Description
We're seeking a highly skilled and driven Engineering Manager to take the lead in accelerating the next generation of LLM/VLM/VLA inference software technologies that will define the future of AI. This is a high-impact, hands-on leadership role at the intersection of deep technical expertise and world-class management. You won't just manage; you'll architect and guide a brilliant team of engineers who are pushing the performance of LLM inference. Your work will be highly collaborative, interfacing directly with NVIDIA Researchers, GPU Architects, and other teams across the company to ensure we ship production-grade, lightning-fast software that sets the global standard for AI performance.

What You'll Be Doing:

  • Lead and grow a team responsible for pushing the performance of LLM inference across multiple LLM frameworks, including TensorRT LLM, vLLM, SGLang and Dynamo on our datacenter products.
  • Drive the design, implementation and optimization of features that are key to performance in LLM inference.
  • Continuously improve the performance of LLM inference on current and upcoming NVIDIA datacenter architectures and GPUs.
  • Continuously improve the performance of LLM inference of important foundation models.
  • Work with inference benchmark teams to help tune performance for key workloads.
  • Integrating cutting-edge technologies developed at NVIDIA and offering an intuitive developer experience for LLM deployment.
  • Lead software development execution, with responsibility for project planning, milestone delivery, and cross-functional coordination.


What We Need to See:

  • MS, PhD, or equivalent experience in Computer Science, Computer Engineering, AI, or a related technical field.
  • 7+ overall years of overall software engineering experience, including 3+ years of technical leadership experience.
  • Proven ability to lead and scale high-performing engineering teams, especially across distributed and cross-functional groups.
  • Strong background in C++ or Python, with expertise in software design and delivering production-quality software libraries.
  • Demonstrated expertise in large language models (LLM) and/or vision language models (VLM) and/or inference in general.


Ways to Stand Out from the Crowd:

  • Deep understanding of GPU architecture, CUDA programming, and system-level performance tuning.
  • Background in LLM inference or working with frameworks such as TensorRT-LLM, vLLM, or SGLang.
  • Passion for building scalable, user-friendly APIs and enabling developers in the AI ecosystem.
  • Have a proven track record of growing and managing a team that encourages idea sharing, empowers team members, and provides opportunities for professional growth.


#LI-Hybrid

Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 224,000 USD - 356,500 USD for Level 3, and 272,000 USD - 431,250 USD for Level 4.

You will also be eligible for equity and benefits.

Applications for this job will be accepted at least until June 27, 2026.

This posting is for an existing vacancy.

NVIDIA uses AI tools in its recruiting processes.

About NVIDIA Corporation

Nvidia, a global leader in graphics, gaming, and AI technology, offers Nvidia careers and internship opportunities for those passionate about driving innovation in the tech industry. you'll find a company committed to growth, teamwork, and leadership in computer science and machine learning domains.

About Nvidia

A Pioneer in Technology and Innovation

Nvidia has cemented its reputation as a powerhouse in developing advanced graphics processing units (GPUs) and has significantly contributed to the gaming industry's evolution. Moreover, its foray into AI and machine learning has opened new frontiers in technology, making Nvidia a beacon of innovation and a desirable workplace for ambitious tech professionals.

Job Opportunities

Diverse Positions in a Dynamic Field

Nvidia is continuously on the lookout for talented individuals across various domains, including hardware and software engineering, product design, marketing, and sales. Employment opportunities at Nvidia are vast, catering to a wide range of expertise and career aspirations.

Employment in Hardware and Graphics

For those fascinated by the intricacies of hardware and graphics technology, Nvidia offers positions that sit at the forefront of gaming and computing advancements.

Growth in Machine Learning and AI

Nvidia's leadership in AI and machine learning has created numerous vacancies for specialists eager to contribute to groundbreaking projects.

Recruitment in Computer Science

With the constant demand for innovation, Nvidia's recruitment efforts focus on computer science experts capable of pushing the boundaries of what's possible.

Internship Program

Opening Doors to Future Innovators

Nvidia's internship program is designed to nurture the next generation of technology leaders, offering hands-on experience in a culture that celebrates creativity and teamwork.

Benefits and Culture

Interns at Nvidia enjoy a plethora of benefits, from competitive stipends to mentorship opportunities, all within an environment that values growth and learning.

Opportunities for Students

Whether you're an undergraduate, a master's student, or a Ph.D. candidate, Nvidia's internships provide a real-world glimpse into the tech industry, offering valuable experience in various technology fields.

Pathways to Full-Time Employment

Many interns have transitioned into full-time positions, marking the start of successful careers at Nvidia. The internship program is more than a stepping stone into the company; it’s an investment in the professional development of interns. The goal is to ensure that interns are well-equipped for future challenges.

Nvidia Careers: More Than Just a Job

Nvidia offers more than just a job to its employees; it provides a front-row seat on the journey into the future of technology. Nvidia stands as a pillar of innovation with its vast opportunities in hardware, graphics, gaming, machine learning, and computer science. Nvidia careers serve as a launching pad for talented workers who aim to redefine the technological landscape. Whether through full-time positions or internships, joining Nvidia means contributing to a legacy of breakthroughs and becoming part of a global community dedicated to pushing the boundaries of what's possible.
Learn more about NVIDIA Corporation
Size
22,473 employees
Market Cap
$350.4 billion
Industry
Net Income
$4.3 billion
Founded
1993
5 Year Trend
+31.3%
Revenue
$16.6 billion
NASDAQ

Similar Jobs

More Jobs at NVIDIA Corporation

More Information Technology Jobs

Find similar Engineering Manager, LLM Performance jobs: