Sr Software Engineer, AI Tools - On-Device Generative AI Model Optimization

Qualcomm • $140K — $211K *

San Diego, CA 92154In-Person

Information Technology

Less than 5 years of experience

1 month ago

By clicking Apply, I agree with Ladders' Terms of Use and Privacy Policy

Job Overview by Ladders

Qualifications

Bachelor's or Master's degree in Computer Science, Engineering, or related field with relevant work experience
2+ years in ML systems, model optimization, or inference engineering
Proficiency in Python within large, typed codebases
Strong communication skills for cross-team collaboration
Deep knowledge of generative AI architectures and inference optimization preferred

Responsibilities

Reauthor generative AI architectures for efficient hardware execution
Translate hardware constraints into model-level transformations
Integrate inference acceleration techniques into model preparation
Collaborate with research teams to develop OEM-specific deployments
Partner with compiler and quantization teams for optimization strategies
Contribute to multi-stage model preparation pipeline

Benefits

Competitive annual bonus program
Opportunity for annual RSU grants
Comprehensive benefits package supporting work-life balance
Onsite full-time work in San Diego, CA or Raleigh, NC

Full Job Description

Job Area:
Engineering Group, Engineering Group > Machine Learning Engineering

General Summary:

As a Qualcomm Machine Learning Engineer, you will develop and implement cutting-edge tools and solutions to enable state-of-the-art AI solutions across various technology verticals.

This role is open to both San Diego, CA and Raleigh, NC and will be onsite full-time.

Minimum Qualifications:
• Bachelor's degree in Computer Science, Engineering, Information Systems, or related field and 2+ years of Hardware Engineering, Software Engineering, Systems Engineering, or related work experience.
OR
Master's degree in Computer Science, Engineering, Information Systems, or related field and 1+ year of Hardware Engineering, Software Engineering, Systems Engineering, or related work experience.
OR
PhD in Computer Science, Engineering, Information Systems, or related field.

What You'll Do
Model Reauthoring & Architecture Adaptation

Reauthor generative AI architectures for efficient execution on Qualcomm AI hardware. This covers LLMs (Llama, Phi, Qwen) and multimodal models (vision-language, speech, diffusion), including custom attention, normalization, positional embedding, and modality-specific components.
Translate hardware execution constraints - operator support, memory layout, dispatch behavior - into model-level transformations. These transformations need to preserve accuracy while enabling efficient on-device execution.
Build clean extension points so internal teams and external contributors can onboard new architectures without changing core pipeline code.

Inference Optimization for Edge Hardware

Integrate inference acceleration techniques into the model preparation pipeline. This includes memory-efficient attention, decode acceleration, and serving-time optimizations.
Translate end-customer deployment constraints - target SoC, context length, latency budget, memory envelope - into concrete model preparation strategies.

Custom Model & OEM Enablement

Work with research teams to develop reauthoring strategies for custom OEM models and customer-specific use cases. Take research prototypes and turn them into production deployments.

Cross-Functional Collaboration

Partner with compiler teams to understand on-target constraints. Decide on the right response: a graph-level optimization or model-level reauthoring.
Partner with quantization engineers so architectural decisions compose cleanly with the quantization stack.

Pipeline & Tooling

Contribute reauthoring and adaptation stages to a multi-stage model preparation pipeline. Build developer-facing diagnostics that give clear, actionable feedback when models fail to lower or run efficiently.

Minimum Qualifications

Bachelor's degree in Computer Science, Engineering, or related field and 4+ years of Software Engineering, ML Engineering, or related experience
OR Master's degree in Computer Science, Engineering, or related field and 3+ years of relevant experience
OR PhD in Computer Science, Engineering, or related field and 2+ years of relevant experience
2+ years in ML systems, model optimization, or inference engineering. Proficient in Python in large, typed codebases.
Strong written and verbal communication. Comfortable operating across compiler, research, and partner-facing teams.

Preferred Qualifications

Deep implementation-level knowledge of generative AI architectures across LLMs and multimodal models
Demonstrated experience optimizing inference for edge or resource-constrained deployments, with measurable latency or memory wins to point to.
Strong PyTorch internals knowledge - module customization, export flows, tracing. Familiarity with the HuggingFace transformers ecosystem.
Familiarity with on-device runtimes and SoC-level constraints (memory bandwidth, compute precision, NPU/DSP execution). Exposure to QAIRT/QNN, ONNXRuntime, LiteRT-LLM or similar is a plus.
Working understanding of how quantization interacts with model architecture decisions, even if you're not a quantization specialist.
Experience using agentic coding tools such as GitHub Copilot, Cursor, Claude Code, Codeium, or similar AI-assisted development tools to improve coding productivity and problem-solving

Level of Responsibility

Works independently on open-ended optimization challenges. Provides technical guidance and mentorship to teammates.
Decisions have broad impact on model accuracy, on-device performance, and the developer experience of teams using the preparation pipeline.
Communicates complex model architecture and inference optimization concepts to a range of audiences: hardware engineers, research scientists, compiler engineers, OEM partners, and external developers.
Has meaningful influence on the generative AI optimization roadmap, supported model strategy, and cross-team integration priorities.

Pay range and Other Compensation & Benefits:
$140,800.00 - $211,200.00

The above pay scale reflects the broad, minimum to maximum, pay scale for this job code for the location for which it has been posted. Even more importantly, please note that salary is only one component of total compensation at Qualcomm. We also offer a competitive annual discretionary bonus program and opportunity for annual RSU grants (employees on sales-incentive plans are not eligible for our annual bonus). In addition, our highly competitive benefits package is designed to support your success at work, at home, and at play. Your recruiter will be happy to discuss all that Qualcomm has to offer - and you can review more details about our US benefits at this link.

If you would like more information about this role, please contact Qualcomm Careers.

About Qualcomm

Qualcomm Ventures is the investment arm of Qualcomm Incorporated. Founded in 2000, Qualcomm Ventures is a corporate venture capital fund with over 150 active portfolio companies and more than 20 exits over a billion dollars, including 99 Taxis, Cruise Automation, Fitbit, Invensense, NQ Mobile, Waze, and more. As a global investor, Qualcomm Ventures helps connect entrepreneurs to the resources, relationships, and deep industry expertise they need to succeed in the mobile technology ecosystem.

Qualcomm Careers

Joining Qualcomm offers more than just a job opportunity; it's a gateway to a career infused with innovation, leadership, and growth. As a pivotal leader in the world of wireless technology, Qualcomm stands at the forefront of digital communication advancements. Our team of professionals is dedicated to pushing the boundaries of what's possible, making this an ideal time to become part of our global community.

Work You’ll Do

At Qualcomm, you will collaborate with some of the brightest minds in the industry, engaging in work that transforms the way the world connects, computes, and communicates. Our diverse team is driven by a shared passion for creating path-breaking wireless technologies that empower mobile ecosystems worldwide.

Innovate and Grow

Embrace the opportunity to innovate alongside leaders in the field and contribute to projects that have a global impact. Qualcomm is committed to fostering a culture of innovation and continuous improvement, ensuring that every team member has the opportunity to make a significant impact.

Professional Growth and Development

Qualcomm is dedicated to the professional growth of its employees, offering unparalleled benefits, diverse career paths, and extensive training programs that encourage professional and personal development. Whether you're looking for leadership roles or specialized technical positions, Qualcomm provides the resources and support to help you drive your career forward.

Diversity and Inclusion

We believe that a diverse workforce fuels our innovation and reflects our commitment to making a positive impact. Qualcomm’s inclusive culture and diversity training programs are designed to promote an environment where all employees can thrive.

Internship Programs

Start your career with Qualcomm through our dynamic internship programs. These opportunities allow you to apply your skills in real-world scenarios, providing a robust foundation for future employment. Internships at Qualcomm are characterized by meaningful projects and the chance to network with industry leaders.

Join Our Team

Explore the numerous job opportunities at Qualcomm, from engineering to marketing, and discover how your skills and interests align with our mission. We are continuously hiring creative and driven individuals who are ready to contribute to our culture of innovation.

Prepare for Your Interview

Ready to join our team? Prepare your resume to highlight your relevant experience and skills. Our interview process is designed to understand your capabilities and how they align with our goals at Qualcomm. We look for passionate, curious, and innovative team players who are ready to take the next step in their careers.

Stay Connected

Keep up to date with the latest at Qualcomm by following our careers blog. Gain insider perspectives and industry-leading insights that can help you navigate your professional journey.

Career Opportunities Await

At Qualcomm, your career is what you make of it. With support for your ambitions and a network of global professionals, the opportunities to advance and excel are nearly limitless. Join us and be part of a team that’s leading the world in next-generation technology.

Search Qualcomm Jobs

Discover the positions that match your skills and interests. We are looking for individuals who are ready to make an impact and excel in a fast-paced, innovative environment.

Explore Qualcomm Careers

Whether you're seeking an internship, a first job, or a leadership position, Qualcomm offers a range of opportunities across various disciplines. Let your career journey begin here, where innovation, leadership, and growth come together to create extraordinary outcomes.

Learn more about Qualcomm

Size

45,000 employees

Market Cap

$122.5 billion

Industry

Manufacturing & Automotive

Net Income

$6.7 billion

Founded

1985

5 Year Trend

+14.7%

Revenue

$26.6 billion

NASDAQ

QCOM

* Ladders Estimates

Similar Jobs

Engineering Solutions Technical Program Manager
$129K — $171K *
Anduril Industries
Costa Mesa, CA 92627 (Orange County)
Today
Senior Engineer / Engineer of Record - Hydrogen Microgrid Pilot
$150K — $190K *
E2 Consulting Engineers, Inc
Remote
Yesterday
Quality Engineer
$90K — $145K *
Enercon Services, Inc
San Luis Obispo, CA 93405 (San Luis Obispo County)
Yesterday
Senior Product Quality Assurance Engineer
$122K — $184K *
ResMed
San Diego, CA 92154 (San Diego County)
2 days ago
Staff Configuration Analyst
$146K — $219K *
Northrop Grumman
Palmdale, CA 93550 (Los Angeles County)
Reposted 2 days ago
Sr Engineer, Machine Learning Engineering (ML Apps)
$140K — $211K *
Qualcomm
San Diego, CA 92154 (San Diego County)
2 days ago

Get Ready For Your
Next Interview

More Jobs at Qualcomm

QGOV Security Software Engineer
$134K — $202K *
San Diego, CA 92154 (San Diego County)
Reposted Today
Information Technology
In-Person
Staff Internal Auditor
$98K — $147K *
San Diego, CA 92154 (San Diego County)
Today
Legal & Accounting
In-Person
Staff Manager, Business Development - SpaceX Strategic Account
$196K — $295K *
San Diego, CA 92154 (San Diego County)
Today
Aerospace & Defense
In-Person
Staff Manager, Business Development - SpaceX Strategic Account
$196K — $295K *
Santa Clara, CA 95051 (Santa Clara County)
Today
Aerospace & Defense
In-Person
Director, Business Development - SpaceX Strategic Account
$213K — $319K *
Santa Clara, CA 95051 (Santa Clara County)
Today
Aerospace & Defense
In-Person

More Information Technology Jobs

Sales Operations Specialist
Dotcomteam LLC
Salem, NH 03079 (Rockingham County)
1 week ago
Senior Systems Intergrator
$116K — $147K *
Amentum
Grand Junction, CO 81504 (Mesa County)
Today
Cryptography Analyst
$219K — $268K *
KBR, Inc
Mclean, VA 22101 (Fairfax County)
Today
Senior Software Engineer — Developer Platform
$98K — $176K *
Target Brands, Inc.
Minneapolis, MN 55445 (Hennepin County)
Today
Sr Manager Cybersecurity Defense - CSIRT
$132K — $238K *
Target Brands, Inc.
Minneapolis, MN 55445 (Hennepin County)
Today

Find similar Sr Software Engineer, AI Tools - On-Device Generative AI Model Optimization jobs:

Nationwide San Diego, CA