Please Note:
1. If you are a first time user, please create your candidatelogin account before you apply for a job. (Click Sign In > Create Account)
2. If you already have a Candidate Account, please Sign-In before you apply.
Job Description:Job Summary:
We are seeking an experienced Principal Software Engineer who has experience leading initiatives in the past. As a Principal Engineer, you will be focused on developing and integrating our AI Virtualization Stack to provide hardware-agnostic acceleration for AI/ML workloads on Virtual Machines. This role is critical in enabling multi-vendor GPU and XPU support using ML compilation technologies.
Responsibilities:
Research, design, and develop the AI Virtualization Stack for our ESXi server product.
Implement and optimize PyTorch and JAX backends using the OpenXLA framework to ensure high-performance AI/ML workload execution across GPUs and XPUs.
Analyze and re-architect performance-critical sections of the ML acceleration code, focusing on optimization techniques for LLM inference such as KV-caching and FlashAttention.
Troubleshoot and address bugs related to AI/ML acceleration functionality.
Deliver software that meets the coding guidelines and quality standards set by the VCF.
Develop and maintain technical documentation for delivered features.
Work closely with the larger team, including virtual driver and device team, as well as external GPU/XPU vendors, to provide end-to-end support for ML frameworks.
Stay up-to-date with the latest GPU/XPU hardware architecture and AI/ML compiler technologies.
Qualifications:
Bachelor's degree in Computer Science or related field and 12+ years of related experience or Masters degree and 10+ years of related experience.
5+ years of experience in ML framework/runtime development, GPU/XPU backend engineering.
Strong understanding and direct experience with ML frameworks (PyTorch, JAX) and graph/ML compiler technologies (e.g. OpenXLA).
Experience with C++ and Python programming languages.
Strong problem-solving skills and ability to troubleshoot complex issues.
Excellent communication and collaboration skills.
Experience with version control systems such as Git.
Ability to thrive in a fast-paced and dynamic work environment.
Familiarity with enterprise coding standards and best practices.
Nice to Have:
Experience with inference servers such as vLLM, Triton.
Experience with low-level GPU kernel development and writing custom kernels (e.g., CUDA, ROCm, or similar).
Must have legal authorization to work in the US
Additional Job Description:
Compensation and Benefits
The annual base salary range for this position is$127,100 - $226,000.
As a valued member of our team, you'll be eligible for a discretionary annual bonus and the opportunity to receive not only a competitive new hire equity grant, but also annual equity awards, connecting your success directly to the company's growth. All subject to relevant plan documents and award agreements.
Broadcom offers a competitive and comprehensive benefits package: Medical, dental and vision plans, 401(K) participation including company matching, Employee Stock Purchase Program (ESPP), Employee Assistance Program (EAP), company paid holidays, paid sick leave and vacation time. The company follows all applicable laws for Paid Family Leave and other leaves of absence.
If you are located outside USA, please be sure to fill out a home address as this will be used for future correspondence.