Principal Applied Scientist, ML Codesign

Amazon • $228K — $309K *

Sunnyvale, CA 94087In-Person

Information Technology

8 - 10 years of experience

Today

Be an Early Applicant

By clicking Apply, I agree with Ladders' Terms of Use and Privacy Policy

Job Overview by Ladders

Qualifications

Master's or PhD in Computer Science, Electrical Engineering, or related field, or equivalent experience.
8+ years of industry experience with first/senior author publications in machine learning systems or computer architecture.
Experience in defining or co-defining operational hardware architecture, including silicon or FPGA.
Deep expertise in low-bit quantization, structured/unstructured pruning, or knowledge distillation.
Working knowledge of computer architecture fundamentals like memory hierarchy and dataflow architectures.

Responsibilities

Define the compression roadmap for next-gen accelerators based on accuracy targets.
Own joint optimization of compression algorithms and hardware design.
Represent applied science in silicon architecture reviews influencing the compute and memory subsystems.
Set and validate the scientific roadmap for compression techniques in new architecture.
Mentor a team of applied scientists in compression and hardware-aware training.
Lead the technical agenda for codesign, accountable to senior leadership.

Benefits

Comprehensive health insurance (medical, dental, and vision).
401(k) matching.
Paid time off and parental leave.
Adoption and surrogacy reimbursement coverage.
Mental health support and flexible spending accounts.

Full Job Description

Define the joint optimization of model compression and silicon architecture for Amazon's next generation of edge and cloud inference accelerators. Your work will set the technical targets that propagate across the model, compiler, runtime, and silicon stack.

We are hiring a Principal Applied Scientist to be the technical leader who closes the loop between compression science and silicon design. Today's generation ships advanced quantization and large-model distillation in production, running multi-billion parameter language models at inference economics typical of much larger systems. Future generations target significantly larger models at the edge and in the cloud. You will be a principal architect of the next-generation accelerator and of the compression algorithms it executes natively. Few roles in the industry let one technical leader influence the model, the compiler, the runtime, and the silicon without organizational friction. This is one of them.

You have spent the last several years thinking about why hardware decisions and accuracy decisions live in different teams, and you want to be the person who owns both. You have published at MLSys, ISCA, MICRO, NeurIPS, or ICML on quantization, pruning, or hardware-aware training, and you want your next paper to ship in a chip rather than in a benchmark suite. You want a vertical stack-model, compression, compiler, runtime, operating system, silicon-where the same engineering organization owns every layer and a principal architect can move all of them.

Key job responsibilities
• Define the hardware-aware compression roadmap for next-generation accelerators, working backward from accuracy targets on standard language and reasoning benchmarks including Massive Multitask Language Understanding (MMLU), GSM8K, HumanEval, and Instruction Following Evaluation (IFEval).
• Own the joint optimization of compression algorithms (post-training quantization, quantization-aware training, knowledge distillation, structured pruning) with the underlying hardware.
• Represent applied science in silicon architecture reviews and influence decisions across the memory and compute subsystems of the accelerator.
• Set the science roadmap for the compression techniques the next architecture must support; validate that compression algorithms achieve target accuracy on the benchmarks our products are evaluated against.
• Mentor a team of senior and mid-level applied scientists working on compression and hardware-aware training.
• Serve as a single-threaded technical leader for the codesign agenda, accountable to senior leadership review.

About the team

Amazon's Devices and Services organization has shipped multiple generations of first-party silicon for consumer devices. The differentiating intellectual property across this portfolio is a custom machine learning processor co-designed with the compression algorithms it runs.

This role sits at the intersection of three teams. The Applied Science team produces compressed model checkpoints. The Silicon Engineering team designs the Application-Specific Integrated Circuits (ASICs). The Compiler and Runtime team lowers compressed models to silicon. You will be the principal architect who closes the loop across all three.

BASIC QUALIFICATIONS
• Master's or PhD in Computer Science, Electrical Engineering, or a related field, or equivalent industry experience.
• Eight or more years of industry experience with a track record of first-author or senior-author publications at top-tier venues in machine learning systems, computer architecture, or efficient machine learning.
• Demonstrated experience defining or co-defining a hardware architecture that shipped, including silicon, Field Programmable Gate Array (FPGA), or large-scale software accelerator.
• Deep expertise in at least two of the following: low-bit quantization, structured and unstructured pruning, knowledge distillation, sparse computation, hardware-aware neural architecture search.
• Working knowledge of computer architecture fundamentals: memory hierarchy, dataflow architectures, on-chip interconnect.

PREFERRED QUALIFICATIONS
• Direct experience contributing to silicon architecture for machine learning inference.
• Published work demonstrating hardware-software codesign, where the compression algorithm and the hardware were optimized jointly rather than sequentially.
• Experience applying compression techniques at large-model scale (tens of billions of parameters).
• Familiarity with Application-Specific Integrated Circuit (ASIC) development flow, Register Transfer Level (RTL) review, or compiler intermediate representations including Multi-Level Intermediate Representation (MLIR) and OpenXLA.
• Experience with Mixture-of-Experts (MoE) inference architectures.
• Track record of mentoring senior applied scientists and shaping a multi-year research agenda.
• Prior experience with vertically integrated stacks where the same team owns model, compiler, runtime, and silicon.

The base salary range for this position is listed below. Your Amazon package will include sign-on payments and restricted stock units (RSUs). Final compensation will be determined based on factors including experience, qualifications, and location. Amazon also offers comprehensive benefits including health insurance (medical, dental, vision, prescription, Basic Life & AD&D insurance and option for Supplemental life plans, EAP, Mental Health Support, Medical Advice Line, Flexible Spending Accounts, Adoption and Surrogacy Reimbursement coverage), 401(k) matching, paid time off, and parental leave. Learn more about our benefits at https://amazon.jobs/en/benefits.

USA, CA, Sunnyvale - 228,700.00 - 309,400.00 USD annually

About Amazon

Audible is a provider of spoken audio information and entertainment , on the Internet. They provide premium spoken audio content, such as audio versions of books and newspapers and radio programs, that is delivered over the Internet and played back on personal computers and hand-held electronic devices. The Audible service allows consumers to purchase and download their content from their Website, store it in digital files and play it back on personal computers and electronic devices. More than 15,000 hours of audio content are available on their Web site, including audio versions of books, periodicals and radio programs. Several manufacturers have agreed to support and promote the playback of their content on their hand-held audio-enabled electronic devices.

Amazon Careers

Joining Amazon presents an unparalleled opportunity to become part of a vibrant team pushing the boundaries of innovation and growth in the global marketplace. As a leader in e-commerce, technology, and logistics, Amazon offers a variety of job opportunities that cater to a range of skills and professional interests. Work You’ll Do At Amazon, every day is an opportunity to collaborate with the brightest minds in technology and business to redefine what’s possible. Whether you’re interested in software development, marketing, human resources, or customer service, Amazon has a position waiting for you. Transform the way the world shops and innovates with our diverse and inclusive team. Amazon is not just a company; it’s a community where you can drive real change and contribute to projects impacting millions globally. Lead with Innovation and Leadership Amazon is the perfect place to enhance your leadership and innovation skills. Our culture encourages pushing the envelope and imagining the unimaginable. Here, you will lead projects that challenge the status quo and define new industry standards. Work with a team that values diversity and is committed to creating an inclusive environment. Our leadership is focused on harnessing the collective power of unique perspectives to foster growth and innovation. Explore Amazon’s Employment Benefits Amazon’s commitment to its employees extends beyond just career growth. We offer competitive benefits, including health care, parental leave, and diversity training, ensuring that our team not only excels professionally but also enjoys well-being and security. Internship and Networking Opportunities Start your career with an Amazon internship and gain hands-on experience that matters. Our internships provide a gateway to full-time employment and an opportunity to network with professionals across various sectors of the company. Future-Proof Your Career With Amazon, your career path is filled with numerous opportunities for advancement. Our learning and development programs are designed to nurture your professional growth and keep you at the forefront of industry trends. Stay Connected Join Our Team Discover the job opportunities at Amazon that match your skills and interests. We are constantly on the lookout for passionate, curious, and innovative team players ready to make a difference. Keep Up to Date Stay ahead with career tips, insider perspectives, and industry-leading insights you can put to use today—all from the people who work here. Job Alert Emails Customize your subscription to receive job alerts, the latest news, and insider tips tailored to your preferences. Explore the exciting and rewarding career opportunities that await at Amazon. Amazon is more than just a company—it’s a platform for building a promising future. Whether you’re starting or looking to advance your career, Amazon offers the resources, support, and network you need to succeed. Join us, and be a part of our continuing mission to be Earth's most customer-centric company.

Learn more about Amazon

Size

1,608 employees

Market Cap

$832.6 billion

Industry

Retail & Consumer Goods

Net Income

$21.3 billion

Founded

1994

5 Year Trend

+28.1%

Revenue

$386 billion

NASDAQ

AMZN

* Ladders Estimates

Similar Jobs

Senior Architecture Modeling Engineer, AWS Machine Learning Accelerators
$193K — $261K *
Amazon
Cupertino, CA 95014 (Santa Clara County)
Reposted Yesterday
Senior Staff Engineer, TPU Co-Design
$240K — $334K *
Google
Sunnyvale, CA 94087 (Santa Clara County)
1 week ago
Lead CPU Performance Analysis Engineer
$273K — $409K *
Qualcomm
Santa Clara, CA 95051 (Santa Clara County)
1 week ago
Staff Engineer, TPU Co-Design
$192K — $279K *
Google
Sunnyvale, CA 94087 (Santa Clara County)
1 week ago
Deep Learning Computer Architect - New College Grad 2026
$124K — $241K *
NVIDIA Corporation
Santa Clara, CA 95051 (Santa Clara County)
1 week ago
CPU Micro Architect / Logic Designer
$105K — $260K *
Ventana Micro Systems
Cupertino, CA 95014 (Santa Clara County)
Reposted 1 week ago

Get Ready For Your
Next Interview

More Jobs at Amazon

Data Center Infrastructure Delivery Manager
$78K — $137K *
Herndon, VA 20171 (Fairfax County)
Reposted Today
Information Technology
In-Person
Research Scientist, Operational Efficiency, AET Planning and Analytics Science
$136K — $184K *
Bellevue, WA 98006 (King County)
Today
Business Services
In-Person
Data Center Technician , DCC Communities
$60K — $108K *
Sparks, NV 89436 (Washoe County)
Reposted Today
Information Technology
In-Person
Manager of Construction , Data Center Construction
$153K — $254K *
Tempe, AZ 85281 (Maricopa County)
Today
Real Estate & Construction
In-Person
Innovation and Design Engineer, Worldwide Design and Engineering
$94K — $160K *
Nashville, TN 37211 (Davidson County)
Today
Manufacturing & Automotive
In-Person

More Information Technology Jobs

Client Partner - Banking / Financial Services / Capital Markets
$325K — $350K + $100K bonus *
Large IT Services Firm (client of TechLink Systems)
New York, NY 10001 (New York County)
1 week ago
IT Security Administrator
$85K — $110K *
ULINE
Waukegan, IL 60085 (Lake County)
Today
Senior Cloud Technical Lead
$100K — $130K *
SAS
Ottawa, ON K1G 3J6
Today
Senior Developer, Intelligent Automation
$160K — $190K *
Graebel Companies
Remote
Today
Data Analytics Lead
$70K — $95K *
Skyway Behavioral Health
Downers Grove, IL 60516 (Dupage County)
Today

Find similar Principal Applied Scientist, ML Codesign jobs:

Nationwide Sunnyvale, CA