Job Summary;Are you passionate about optimizing AI workloads and delivering real-world performance improvements on edge devices?
We're looking for an experienced engineer to help customers achieve best-in-class inference performance for production AI models running on Arm technology.
This role is based in San Jose, with significant time spent working directly with customers across the Bay Area.
Job Description:In this role, you will work closely with customers to optimize AI workloads targeting Arm technology, focusing on achieving best-in-class performance and power efficiency.
Using your experience with Arm's AI optimization tools and your understanding of hardware architectures, you will develop kernel level implementations across a range of DNN models, optimizing for power and performance.
In collaboration with multiple teams across Arm's engineering organization, you will diagnose and resolve performance challenges, and use these insights to influence Arms IP and tooling roadmaps.
This role requires strong coding and communication skills; you'll translate complex technical challenges into clear insights, presenting progress and recommendations to audiences ranging from engineers to senior leadership.
Responsibilities:- Develop highly optimized solutions for AI workloads, from kernel level to system level, to meet the needs of the customer application.
- Create production quality reference implementations, documentation, and performance focused technical content
- Act as a technical bridge between customers and internal teams, driving resolution of complex performance issues
- Influence Arm's IP and software roadmap through insights gained from real-world customer use cases
Required Skills and Experience :- Experience optimizing DNNs in Triton, CUDA or other kernel level programming language
- Deep understanding of parallel computing, memory hierarchies and performance optimization techniques for DNNs
- Strong programming skills in Python and C++, a solid experience with modern AI frameworks and execution models, experience with profiling and analysis tools
- Strong communication and interpersonal skills
"Nice To Have":- Experience in a customer facing or field engineering environment
- Experience within the Arm ecosystem
- Background in AI performance optimization for edge devices
In Return:Joining Arm means stepping into a career-defining opportunity. You'll occupy a central role in the company's most critical initiatives. These initiatives build how Arm innovates, scales, and partners globally.
Additional InformationPlease note this role does not meet the eligibility requirements for sponsorship, and therefore the successful candidate must have the right to work in the US without relying on sponsorship by Arm.