Principal AI Performance Engineer

ARM • $262K — $355K *

San Jose, CA 95123In-Person

Enterprise Technology

Less than 5 years of experience

Today

Be an Early Applicant

By clicking Apply, I agree with Ladders' Terms of Use and Privacy Policy

Job Overview by Ladders

Qualifications

Experience optimizing DNNs using Triton, CUDA, or similar programming languages.
Deep understanding of parallel computing and performance optimization for DNNs.
Proficiency in Python and C++, with familiarity in AI frameworks and profiling tools.
Strong communication and interpersonal skills.

Responsibilities

Develop optimized solutions for AI workloads at both the kernel and system levels.
Create production quality reference implementations and performance-focused documentation.
Act as a liaison between customers and engineering teams to resolve performance issues.
Influence Arm's IP and software roadmap with insights from customer use cases.

Benefits

Central role in impactful initiatives within the company.
Opportunities for innovation, scaling, and global partnerships.
Engage in customer-focused projects, enhancing skills and experience.

Full Job Description

Job Summary;

Are you passionate about optimizing AI workloads and delivering real-world performance improvements on edge devices?

We're looking for an experienced engineer to help customers achieve best-in-class inference performance for production AI models running on Arm technology.

This role is based in San Jose, with significant time spent working directly with customers across the Bay Area.

Job Description:

In this role, you will work closely with customers to optimize AI workloads targeting Arm technology, focusing on achieving best-in-class performance and power efficiency.

Using your experience with Arm's AI optimization tools and your understanding of hardware architectures, you will develop kernel level implementations across a range of DNN models, optimizing for power and performance.

In collaboration with multiple teams across Arm's engineering organization, you will diagnose and resolve performance challenges, and use these insights to influence Arms IP and tooling roadmaps.

This role requires strong coding and communication skills; you'll translate complex technical challenges into clear insights, presenting progress and recommendations to audiences ranging from engineers to senior leadership.

Responsibilities:

Develop highly optimized solutions for AI workloads, from kernel level to system level, to meet the needs of the customer application.
Create production quality reference implementations, documentation, and performance focused technical content
Act as a technical bridge between customers and internal teams, driving resolution of complex performance issues
Influence Arm's IP and software roadmap through insights gained from real-world customer use cases

Required Skills and Experience :

Experience optimizing DNNs in Triton, CUDA or other kernel level programming language
Deep understanding of parallel computing, memory hierarchies and performance optimization techniques for DNNs
Strong programming skills in Python and C++, a solid experience with modern AI frameworks and execution models, experience with profiling and analysis tools
Strong communication and interpersonal skills

"Nice To Have":

Experience in a customer facing or field engineering environment
Experience within the Arm ecosystem
Background in AI performance optimization for edge devices

In Return:

Joining Arm means stepping into a career-defining opportunity. You'll occupy a central role in the company's most critical initiatives. These initiatives build how Arm innovates, scales, and partners globally.

Additional Information

Please note this role does not meet the eligibility requirements for sponsorship, and therefore the successful candidate must have the right to work in the US without relying on sponsorship by Arm.

About ARM

ARM Holdings is a British multinational semiconductor and software design company, owned by SoftBank Group and its Vision Fund. With its headquarters in Cambridge, England, the company designs microprocessors, physical intellectual property (IP) and related technology and software, and sells development tools to deliver complete solutions for the digital world. ARM's technology is used in a wide range of applications, including automotive, consumer electronics, and Internet of Things (IoT) devices. The company was founded in 1990 and has grown to become one of the world's leading semiconductor IP companies.

Learn more about ARM

Size

6,000 employees

Industry

Manufacturing & Automotive

* Ladders Estimates

Similar Jobs

Senior AI Application Developer - GPU and SOC Architecture Modeling
$152K — $287K *
NVIDIA Corporation
Santa Clara, CA 95051 (Santa Clara County)
Today
Distinguished AI Engineer
$293K — $335K *
Capital One Financial Corporation
San Francisco, CA 94112 (San Francisco County)
Reposted Today
Distinguished AI Engineer
$293K — $335K *
Capital One Financial Corporation
San Jose, CA 95123 (Santa Clara County)
Reposted Today
Senior Machine Learning Systems Engineer, Ads ML Experience Platform
$216K — $303K *
Reddit
Remote
Today
Senior Lead AI Engineer (GenAI Platform, Agentic Infrastructure)
$250K — $286K *
Capital One Financial Corporation
San Francisco, CA 94112 (San Francisco County)
Today
Senior Lead AI Engineer (GenAI Platform, Agentic Infrastructure)
$250K — $286K *
Capital One Financial Corporation
San Jose, CA 95123 (Santa Clara County)
Today

Get Ready For Your
Next Interview

More Jobs at ARM

Principal AI Performance Engineer
$262K — $355K *
San Jose, CA 95123 (Santa Clara County)
Today
Enterprise Technology
In-Person
Principal Engineering Lead
$262K — $355K *
San Jose, CA 95123 (Santa Clara County)
Today
Consumer Technology
In-Person
CPU Performance - Principal Microarchitecture Exploration Engineer
$249K — $338K *
Austin, TX 78745 (Travis County)
Reposted Today
Information Technology
In-Person
Director, SW Product Management
$262K — $355K *
San Jose, CA 95123 (Santa Clara County)
5 days ago
Enterprise Technology
In-Person
Staff Verification Engineer
$191K — $258K *
Austin, TX 78745 (Travis County)
6 days ago
Technical Services
In-Person

More Enterprise Technology Jobs

Business Analyst
$80K — $110K *
Allied Consultants
Austin, TX 78745 (Travis County)
Today
Dynamics 365 CE Functional Lead
$90K — $120K *
Cedar Rapids, IA 52402 (Linn County)
Reposted Today
Application Development Operations Manager
$100K — $130K *
Edgewater Federal Solutions, Inc.
Albuquerque, NM 87121 (Bernalillo County)
Today
Data Architect SME
$135K — $216K *
Peraton
Bowie, MD 20721 (Prince Georges County)
Today
Lead Developer
$100K — $130K *
Intercontinental Exchange Holdings, Inc.
Atlanta, GA 30349 (Fulton County)
Today

Find similar Principal AI Performance Engineer jobs:

Nationwide San Jose, CA

Principal AI Performance Engineer

Job Overview by Ladders

Full Job Description

Get Ready For Your Next Interview

Find similar Principal AI Performance Engineer jobs:

Get Ready For Your
Next Interview