Rivian

Sr. Staff ML Engineer, Quantization & Compression

Rivian$265K — $331K *
Consumer Technology
Less than 5 years of experience
Job Overview by Ladders

Qualifications

  • Ph.D. or M.S. in Computer Engineering, Electrical Engineering, Computer Science, or related field focused on ML compilers, embedded systems, or hardware-aware AI.
  • Hands-on experience with quantized model deployment and ML design stacks for embedded systems.
  • Strong understanding of computer vision models and optimization for edge inference.
  • Proficiency in deep learning frameworks like PyTorch and TensorFlow, including their low-level IRs.
  • Solid programming skills in C++ and Python; familiarity with accelerator programming models like CUDA/OpenCL.

Responsibilities

  • Research state of the art perception models with ADAS SW teams.
  • Lead optimizations for quantized perception models across hardware platforms.
  • Design hardware-aware optimizations such as quantization strategies and model compression.
  • Collaborate with hardware teams to optimize model architecture under real-time constraints.
  • Benchmark and analyze system performance to enhance deployment efficiency.
  • Align model optimization with hardware roadmap based on real-world autonomy needs.

Benefits

  • Comprehensive insurance benefits including life, medical, dental, and vision coverage.
  • Paid vacation and paid sick leave for all employees.
  • 401(k) plan participation and an Employee Stock Purchase Program.
  • Full-time employee coverage effective on the first day of employment.
Full Job Description
Role Summary

We are looking for an Engineer / Research Scientist with deep expertise in quantized deep learning models for hardware acceleration in autonomous systems. In this cross-disciplinary role, you will bridge perception model design and hardware-aware deployment, enabling efficient execution of high-performance perception algorithms across embedded compute platforms. You will focus on researching state of the art perception models and develop optimization pipelines for the quantized versions of these models customized to provide real-time performance and energy efficiency on next-generation autonomy hardware.

Responsibilities

  • Research state of the art perception models in collaboration with the ADAS SW teams
  • Lead the development of optimizations for mapping quantized perception models (e.g., CNNs, Transformers, LLMs) to embedded and heterogeneous hardware platforms.
  • Design and implement hardware-aware optimizations, including quantization strategies, model compression, memory-efficient representations, and operator fusion, targeted to custom accelerators.
  • Collaborate with hardware teams to co-optimize model architecture and compute pipeline under real-time constraints (latency, throughput, power).
  • Benchmark and analyze system performance across platforms and iterate to achieve optimal deployment efficiency.
  • Partner with perception, systems, and autonomy teams to align model optimization efforts with hardware roadmap and real-world autonomy requirements.

Qualifications

  • Ph.D. or M.S. in Computer Engineering, Electrical Engineering, Computer Science, or related field with a focus on ML compilers, embedded systems, or hardware-aware AI.
  • Hands-on experience with quantized model deployment, ML design stacks, and code generation for embedded or heterogeneous compute systems.
  • Strong understanding of computer vision models (e.g., object detection, segmentation) and their optimization for edge inference.
  • Proficiency in deep learning frameworks (e.g., PyTorch, TensorFlow) and their low-level IRs or export formats (e.g., ONNX).
  • Solid programming skills in C++, Python • Familiarity with CUDA/OpenCL (or other accelerator programming models).

Preferred Qualifications:
  • Prior experience working with hardware-software co-design, especially for autonomous or robotics platforms.
  • Deep knowledge of numerical precision trade-offs, quantization-aware training (QAT), and dynamic/static quantization flows
  • Familiarity with embedded real-time constraints and hardware profiling/debugging tools.
  • Familiarity with rearchitecting models to best suit hardware capabilities.
  • Publication record in top-tier ML/Systems conferences (e.g., MLSys, NeurIPS, DAC, ICCAD).

Pay Disclosure

Salary Range: The salary range for this role is $265,000 - $331,000. for San Francisco Bay Area based applicants. This is the lowest to highest salary we in good faith believe we would pay for this role at the time of this posting. An employee's position within the salary range will be based on several factors including, but not limited to, specific competencies, relevant education, qualifications, certifications, experience, skills, geographic location, shift, and organizational needs.

The successful candidate may be eligible for annual performance bonus and equity awards.

Benefits Summary: Rivian offers a comprehensive package of benefits for full-time and part-time employees, their spouse or domestic partner, and children up to age 26, including but not limited to paid vacation, paid sick leave, and a competitive portfolio of insurance benefits including life, medical, dental, vision, short-term disability insurance, and long-term disability insurance to eligible employees. You may also have the opportunity to participate in Rivian's 401(k) Plan and Employee Stock Purchase Program if you meet certain eligibility requirements. Full-time employee coverage is effective on their first day of employment. Part-time employee coverage is effective the first of the month following 90 days of employment. More information about benefits is available at rivianbenefits.com.

You can apply for this role through careers.rivian.com (or through internal-careers-rivian.icims.com if you are a current employee). This job is not expected to be closed any sooner than 6/30/2026.

About Rivian

Rivian is an American automaker and automotive technology company. Founded in 2009, the company develops vehicles, products and services related to sustainable transportation. Rivian has raised over $10.5 billion since 2019, with investments from Amazon, Ford, and Cox Automotive. The company's first two vehicles, the R1T and R1S, are electric vehicles that are expected to be released in 2021. Rivian has also announced plans to produce electric delivery vans for Amazon. The company has received praise for its focus on sustainability and its commitment to using recycled materials in its vehicles.
Learn more about Rivian
Size
10,000 employees
Market Cap
$16.8 billion
Industry
Founded
2009
NASDAQ

Similar Jobs

More Jobs at Rivian

More Consumer Technology Jobs

Find similar Sr. Staff ML Engineer, Quantization & Compression jobs: