Sr. Staff ML Engineer, Quantization & Compression

Rivian • $265K — $331K *

Palo Alto, CA 94303In-Person

Consumer Technology

Less than 5 years of experience

Today

Be an Early Applicant

By clicking Apply, I agree with Ladders' Terms of Use and Privacy Policy

Job Overview by Ladders

Qualifications

Ph.D. or M.S. in Computer Engineering, Electrical Engineering, Computer Science, or related field focused on ML compilers, embedded systems, or hardware-aware AI.
Hands-on experience with quantized model deployment and ML design stacks for embedded systems.
Strong understanding of computer vision models and optimization for edge inference.
Proficiency in deep learning frameworks like PyTorch and TensorFlow, including their low-level IRs.
Solid programming skills in C++ and Python; familiarity with accelerator programming models like CUDA/OpenCL.

Responsibilities

Research state of the art perception models with ADAS SW teams.
Lead optimizations for quantized perception models across hardware platforms.
Design hardware-aware optimizations such as quantization strategies and model compression.
Collaborate with hardware teams to optimize model architecture under real-time constraints.
Benchmark and analyze system performance to enhance deployment efficiency.
Align model optimization with hardware roadmap based on real-world autonomy needs.

Benefits

Comprehensive insurance benefits including life, medical, dental, and vision coverage.
Paid vacation and paid sick leave for all employees.
401(k) plan participation and an Employee Stock Purchase Program.
Full-time employee coverage effective on the first day of employment.

Full Job Description

Role Summary

We are looking for an Engineer / Research Scientist with deep expertise in quantized deep learning models for hardware acceleration in autonomous systems. In this cross-disciplinary role, you will bridge perception model design and hardware-aware deployment, enabling efficient execution of high-performance perception algorithms across embedded compute platforms. You will focus on researching state of the art perception models and develop optimization pipelines for the quantized versions of these models customized to provide real-time performance and energy efficiency on next-generation autonomy hardware.

Responsibilities

Research state of the art perception models in collaboration with the ADAS SW teams
Lead the development of optimizations for mapping quantized perception models (e.g., CNNs, Transformers, LLMs) to embedded and heterogeneous hardware platforms.
Design and implement hardware-aware optimizations, including quantization strategies, model compression, memory-efficient representations, and operator fusion, targeted to custom accelerators.
Collaborate with hardware teams to co-optimize model architecture and compute pipeline under real-time constraints (latency, throughput, power).
Benchmark and analyze system performance across platforms and iterate to achieve optimal deployment efficiency.
Partner with perception, systems, and autonomy teams to align model optimization efforts with hardware roadmap and real-world autonomy requirements.

Qualifications

Ph.D. or M.S. in Computer Engineering, Electrical Engineering, Computer Science, or related field with a focus on ML compilers, embedded systems, or hardware-aware AI.
Hands-on experience with quantized model deployment, ML design stacks, and code generation for embedded or heterogeneous compute systems.
Strong understanding of computer vision models (e.g., object detection, segmentation) and their optimization for edge inference.
Proficiency in deep learning frameworks (e.g., PyTorch, TensorFlow) and their low-level IRs or export formats (e.g., ONNX).
Solid programming skills in C++, Python • Familiarity with CUDA/OpenCL (or other accelerator programming models).

Preferred Qualifications:

Prior experience working with hardware-software co-design, especially for autonomous or robotics platforms.
Deep knowledge of numerical precision trade-offs, quantization-aware training (QAT), and dynamic/static quantization flows
Familiarity with embedded real-time constraints and hardware profiling/debugging tools.
Familiarity with rearchitecting models to best suit hardware capabilities.
Publication record in top-tier ML/Systems conferences (e.g., MLSys, NeurIPS, DAC, ICCAD).

Pay Disclosure

Salary Range: The salary range for this role is $265,000 - $331,000. for San Francisco Bay Area based applicants. This is the lowest to highest salary we in good faith believe we would pay for this role at the time of this posting. An employee's position within the salary range will be based on several factors including, but not limited to, specific competencies, relevant education, qualifications, certifications, experience, skills, geographic location, shift, and organizational needs.

The successful candidate may be eligible for annual performance bonus and equity awards.

Benefits Summary: Rivian offers a comprehensive package of benefits for full-time and part-time employees, their spouse or domestic partner, and children up to age 26, including but not limited to paid vacation, paid sick leave, and a competitive portfolio of insurance benefits including life, medical, dental, vision, short-term disability insurance, and long-term disability insurance to eligible employees. You may also have the opportunity to participate in Rivian's 401(k) Plan and Employee Stock Purchase Program if you meet certain eligibility requirements. Full-time employee coverage is effective on their first day of employment. Part-time employee coverage is effective the first of the month following 90 days of employment. More information about benefits is available at rivianbenefits.com.

You can apply for this role through careers.rivian.com (or through internal-careers-rivian.icims.com if you are a current employee). This job is not expected to be closed any sooner than 6/30/2026.

About Rivian

Rivian is an American automaker and automotive technology company. Founded in 2009, the company develops vehicles, products and services related to sustainable transportation. Rivian has raised over $10.5 billion since 2019, with investments from Amazon, Ford, and Cox Automotive. The company's first two vehicles, the R1T and R1S, are electric vehicles that are expected to be released in 2021. Rivian has also announced plans to produce electric delivery vans for Amazon. The company has received praise for its focus on sustainability and its commitment to using recycled materials in its vehicles.

Learn more about Rivian

Size

10,000 employees

Market Cap

$16.8 billion

Industry

Manufacturing & Automotive

Founded

2009

NASDAQ

RIVN

* Ladders Estimates

Similar Jobs

Physical Design Engineer
$230K — $280K *
Cerebras Systems
Sunnyvale, CA 94087 (Santa Clara County)
Today
Sr. Staff ML Engineer, Hardware Software Co-Design
$265K — $331K *
Rivian
Palo Alto, CA 94303 (Santa Clara County)
Today
Sr Staff Engineer, Device Drivers (San Diego or Boulder, CO)
$162K — $271K *
Qualcomm
Santa Clara, CA 95051 (Santa Clara County)
Today
Senior SoC Architect
$170K — $315K *
Intel
Santa Clara, CA 95051 (Santa Clara County)
Today
Hardware Engineer 3
$135K — $275K *
Hewlett Packard Enterprise Development LP
Sunnyvale, CA 94087 (Santa Clara County)
Reposted Yesterday
Principal Physical Design Engineer
$174K — $352K *
Hewlett Packard Enterprise Development LP
Sunnyvale, CA 94087 (Santa Clara County)
Yesterday

Get Ready For Your
Next Interview

More Jobs at Rivian

Staff Machine Learning Compiler Engineer
$206K — $258K *
Palo Alto, CA 94303 (Santa Clara County)
Today
Technical Services
In-Person
Sr. Staff ML Engineer, Hardware Software Co-Design
$265K — $331K *
Palo Alto, CA 94303 (Santa Clara County)
Today
Telecommunications & Hardware
In-Person
Sr. Cell Engineering Technician
$77K — $86K *
Irvine, CA 92620 (Orange County)
Today
Manufacturing & Automotive
In-Person
Sr. Staff ML Engineer, Quantization & Compression
$265K — $331K *
Palo Alto, CA 94303 (Santa Clara County)
Today
Consumer Technology
In-Person
Autonomy Technical Program Manager
$116K — $145K *
Palo Alto, CA 94303 (Santa Clara County)
Yesterday
Manufacturing & Automotive
In-Person

More Consumer Technology Jobs

AI-First Product Manager
$90K — $120K *
Team Centro
Montreal, QC H1A 0A1
Reposted Today
Senior Copywriter
$81K — $129K *
Starcom Mediavest Group Germany Gmbh
San Francisco, CA 94112 (San Francisco County)
Today
Sr. Technical Program Manager, Core Engineering, Core Engineering
$171K — $231K *
Amazon
Los Gatos, CA 95032 (Santa Clara County)
Reposted Today
Principal Product Manager, AI Feed Relevance
$185K — $299K *
LinkedIn
Mountain View, CA 94040 (Santa Clara County)
Today
Product Operations Manager
$121K — $149K *
LVMH
Remote
Today

Find similar Sr. Staff ML Engineer, Quantization & Compression jobs:

Nationwide Palo Alto, CA

Sr. Staff ML Engineer, Quantization & Compression

Job Overview by Ladders

Full Job Description

Get Ready For Your Next Interview

Find similar Sr. Staff ML Engineer, Quantization & Compression jobs:

Get Ready For Your
Next Interview