Performance Modeling Engineer

DensityAI

$180K — $250K *
Consumer Technology
5 - 7 years of experience
Job Overview by Ladders

Qualifications

  • Strong computer architecture fundamentals including memory hierarchy and dataflow.
  • Experience in performance modeling or analysis with real workloads influencing design decisions.
  • Deep knowledge of ML workloads and their hardware mapping, including GEMM and attention mechanisms.
  • Proficiency in Python for building analytical models; C++ knowledge is a plus.
  • 5+ years in performance architecture or analysis for CPUs, GPUs, or accelerators.

Responsibilities

  • Own pre-silicon performance modeling and analysis for architectural decision-making.
  • Translate ML workloads into performance projections across critical hardware components.
  • Drive PPA trade-off analysis, advising architecture and design teams on critical resources.
  • Define and manage performance KPIs and track methodologies from architecture through silicon.
  • Correlate model projections with real data as it becomes available and refine predictive accuracy.

Benefits

  • Equity grant per company guidelines.
  • Medical, dental, and vision benefits.
  • 401(k) plan for retirement savings.
  • Standard paid time off (PTO).
Full Job Description
ITAR Notice: This role involves access to ITAR-controlled information. Applicants must be U.S. persons (U.S. citizens, U.S. permanent residents, asylees, or refugees) per 22 CFR 120.62

About the role

Own the pre-silicon performance modeling and analysis that sets the architectural targets for our AI accelerator silicon. You'll characterize target ML workloads, build the analytical and roofline models that project performance onto proposed hardware, and turn that analysis into the PPA trade-off guidance the architecture, RTL, and compiler teams design against -well before first silicon.

What you'll do
  • Own pre-silicon performance modeling and analysis - workload characterization, roofline / analytical models, and what-if trade-off studies that guide microarchitecture decisions before RTL is committed
  • Translate target ML workloads (transformer training/inference, attention, GEMM/conv, collectives) into performance projections across compute, memory-bandwidth, and interconnect bottlenecks
  • Drive PPA (performance / power / area) trade-off analysis with architecture, RTL, and software/compiler teams - recommend where to spend area, bandwidth, and power for the most performance
  • Define and own the performance KPIs and the methodology for tracking them from architecture through silicon
  • Correlate model projections against RTL, emulation, and post-silicon data as it arrives, and feed the deltas back into the model to keep it predictive
What we're looking for
  • Strong computer-architecture fundamentals - memory hierarchy, compute/bandwidth roofline, dataflow, on-chip interconnect/NoC, and accelerator/GPU/TPU-class datapaths
  • Demonstrated performance modeling or analysis experience: analytical or simulation-based projection of real workloads onto hardware, where your results drove actual design decisions
  • Deep understanding of how ML workloads map to hardware GEMM/conv/attention, quantization, parallelism (data/tensor/pipeline), and collective communication
  • Fluency in Python (C++ a plus) for building models, analysis pipelines, and trace/data analysis at scale
  • 5+ years in performance architecture, modeling, or analysis for CPUs, GPUs, accelerators, or complex SoCs
Compensation

Final offers depend on level, location, and skills relevant to the role. Additional compensation: equity grant per company guidelines; medical / dental / vision; 401(k); standard PTO.
Visa Sponsorship

DensityAI sponsors qualified candidates for H-1B, O-1, TN, E-3, and other employment-based visas, and we welcome applicants on F-1 OPT and STEM-OPT. Work authorization is required at start; we provide immigration support to secure or transfer status.

Full compensation packages are based on candidate experience and relevant certifications.

California pay range

$180,000-$250,000 USD

Similar Jobs

More Jobs at DensityAI

More Consumer Technology Jobs

Find similar Performance Modeling Engineer jobs: