ARM

Principal AI Performance Engineer

ARM$262K — $355K *
Enterprise Technology
Less than 5 years of experience
Job Overview by Ladders

Qualifications

  • Experience optimizing DNNs using Triton, CUDA, or similar programming languages.
  • Deep understanding of parallel computing and performance optimization for DNNs.
  • Proficiency in Python and C++, with familiarity in AI frameworks and profiling tools.
  • Strong communication and interpersonal skills.

Responsibilities

  • Develop optimized solutions for AI workloads at both the kernel and system levels.
  • Create production quality reference implementations and performance-focused documentation.
  • Act as a liaison between customers and engineering teams to resolve performance issues.
  • Influence Arm's IP and software roadmap with insights from customer use cases.

Benefits

  • Central role in impactful initiatives within the company.
  • Opportunities for innovation, scaling, and global partnerships.
  • Engage in customer-focused projects, enhancing skills and experience.
Full Job Description
Job Summary;

Are you passionate about optimizing AI workloads and delivering real-world performance improvements on edge devices?

We're looking for an experienced engineer to help customers achieve best-in-class inference performance for production AI models running on Arm technology.

This role is based in San Jose, with significant time spent working directly with customers across the Bay Area.

Job Description:

In this role, you will work closely with customers to optimize AI workloads targeting Arm technology, focusing on achieving best-in-class performance and power efficiency.

Using your experience with Arm's AI optimization tools and your understanding of hardware architectures, you will develop kernel level implementations across a range of DNN models, optimizing for power and performance.

In collaboration with multiple teams across Arm's engineering organization, you will diagnose and resolve performance challenges, and use these insights to influence Arms IP and tooling roadmaps.

This role requires strong coding and communication skills; you'll translate complex technical challenges into clear insights, presenting progress and recommendations to audiences ranging from engineers to senior leadership.

Responsibilities:

  • Develop highly optimized solutions for AI workloads, from kernel level to system level, to meet the needs of the customer application.
  • Create production quality reference implementations, documentation, and performance focused technical content
  • Act as a technical bridge between customers and internal teams, driving resolution of complex performance issues
  • Influence Arm's IP and software roadmap through insights gained from real-world customer use cases


Required Skills and Experience :

  • Experience optimizing DNNs in Triton, CUDA or other kernel level programming language
  • Deep understanding of parallel computing, memory hierarchies and performance optimization techniques for DNNs
  • Strong programming skills in Python and C++, a solid experience with modern AI frameworks and execution models, experience with profiling and analysis tools
  • Strong communication and interpersonal skills


"Nice To Have":

  • Experience in a customer facing or field engineering environment
  • Experience within the Arm ecosystem
  • Background in AI performance optimization for edge devices


In Return:

Joining Arm means stepping into a career-defining opportunity. You'll occupy a central role in the company's most critical initiatives. These initiatives build how Arm innovates, scales, and partners globally.

Additional Information

Please note this role does not meet the eligibility requirements for sponsorship, and therefore the successful candidate must have the right to work in the US without relying on sponsorship by Arm.

About ARM

ARM Holdings is a British multinational semiconductor and software design company, owned by SoftBank Group and its Vision Fund. With its headquarters in Cambridge, England, the company designs microprocessors, physical intellectual property (IP) and related technology and software, and sells development tools to deliver complete solutions for the digital world. ARM's technology is used in a wide range of applications, including automotive, consumer electronics, and Internet of Things (IoT) devices. The company was founded in 1990 and has grown to become one of the world's leading semiconductor IP companies.
Learn more about ARM
Size
6,000 employees
Industry

Similar Jobs

More Jobs at ARM

More Enterprise Technology Jobs

Find similar Principal AI Performance Engineer jobs: