About the RoleWe're looking for a seasoned AI/ML Staff Software Engineer to lead workload-driven architecture strategy across hardware and software boundaries. You will define how we study, model, and optimize AI/ML workloads for current and next-generation products, drive alignment across HW and SW engineering organizations, and serve as a technical authority on performance and architecture tradeoffs. This is a senior individual contributor role with significant cross-functional scope and organizational influence.
Summary of RoleWe're looking for a seasoned AI/ML Staff Software Engineer to lead workload-driven architecture strategy across hardware and software boundaries. You will define how we study, model, and optimize AI/ML workloads for current and next-generation products, drive alignment across HW and SW engineering organizations, and serve as a technical authority on performance and architecture tradeoffs. This is a senior individual contributor role with significant cross-functional scope and organizational influence.
Essential ResponsibilitiesOwn workload characterization and hardware performance analysis for AI/ML systems - selecting representative workloads, defining measurement methodology, building support for MIPS products (e.g., the S8200), and projecting system-level KPIs. Your findings will directly inform SoC architecture decisions, memory subsystem design, and HW/SW co-optimization strategy.
Define the software frameworks across the product portfolio: what metrics matter, how to measure them accurately, how to estimate them pre-silicon, and how to use them to make architectural bets. Leverage open-source infrastructure like MLIR and IREE to implement and validate this work. Set the standard for how the team approaches this and mentor junior engineers in applying it.
Represent software in architectural discussions with hardware teams (CPU, SoC, memory, interconnect) and software teams (compilers, runtimes, ML frameworks). Identify critical bottlenecks - compute throughput, DRAM bandwidth, on-chip memory, data movement latency, or software overhead - and build the case for specific architectural changes or optimization investments.
Present findings and recommendations to senior engineering leadership and product stakeholders. You should be as comfortable writing a one-page architectural recommendation as a detailed technical memo.
Other Responsibilities: - Perform all activities in a safe and responsible manner and support all Environmental, Health, Safety & Security requirements and programs.
Required Qualifications: BS or MS (preferred) in EE, CE, CS, or equivalent, with 5+ years in systems engineering, hardware architecture, ML systems, or performance engineering, and a track record of technical leadership.
Deep expertise in CPU and SoC architecture - memory hierarchies, out-of-order execution, vector/SIMD pipelines, power management - and how these interact with AI/ML workloads. Strong command of system-level memory bandwidth constraints (DDR/LPDDR bandwidth, channel configuration, utilization efficiency) and the ability to reason quantitatively about memory-bound vs. compute-bound workloads.
Experience with AI/ML acceleration on edge devices - NPUs, dedicated inference accelerators, DSP-based pipelines - and the HW/SW co-design challenges involved. Familiarity with model quantization, sparsity, or other efficiency techniques and their hardware interaction is a strong plus.
Familiarity with AI compiler infrastructure: MLIR-based toolchains, IREE, TVM, TFLite, or equivalent. Understanding how graph representations are transformed, tiled, scheduled, and lowered to hardware will improve your ability to identify where compiler strategy and hardware architecture must be co-designed. Prior contributions to such toolchains are a significant differentiator.
Effective cross-functional collaborator who can drive technical consensus without direct authority, writes clearly, and calibrates technical depth for different audiences.
Preferred Qualifications- Prior implementation of CPU hardware features such as vector extensions (AVX, NEON, RVV) or matrix extensions (AMX, SME)
- Experience defining or co-defining SoC architecture requirements from workload analysis
- Contributions to graph lowering in MLIR/IREE or similar compiler infrastructure
- Internal or external publications or contributions to technical standards
- Experience mentoring junior systems engineers
- Knowledge of RISC-V architecture and Vector/Matrix extensions
Other Requirements- English fluency (written and verbal)
- Up to 10% travel
- US work authorization
- 100% in-office (Dallas, Austin, or San Jose)
Expected Salary Range$106,000.00 - $184,000.00
The exact Salary will be determined based on qualifications, experience and location.