Qualcomm

Sr Software Engineer, AI Tools - On-Device Generative AI Model Optimization

Qualcomm$140K — $211K *
Information Technology
Less than 5 years of experience
Job Overview by Ladders

Qualifications

  • Bachelor's or Master's degree in Computer Science, Engineering, or related field with relevant work experience
  • 2+ years in ML systems, model optimization, or inference engineering
  • Proficiency in Python within large, typed codebases
  • Strong communication skills for cross-team collaboration
  • Deep knowledge of generative AI architectures and inference optimization preferred

Responsibilities

  • Reauthor generative AI architectures for efficient hardware execution
  • Translate hardware constraints into model-level transformations
  • Integrate inference acceleration techniques into model preparation
  • Collaborate with research teams to develop OEM-specific deployments
  • Partner with compiler and quantization teams for optimization strategies
  • Contribute to multi-stage model preparation pipeline

Benefits

  • Competitive annual bonus program
  • Opportunity for annual RSU grants
  • Comprehensive benefits package supporting work-life balance
  • Onsite full-time work in San Diego, CA or Raleigh, NC
Full Job Description
Job Area:
Engineering Group, Engineering Group > Machine Learning Engineering

General Summary:

As a Qualcomm Machine Learning Engineer, you will develop and implement cutting-edge tools and solutions to enable state-of-the-art AI solutions across various technology verticals.

This role is open to both San Diego, CA and Raleigh, NC and will be onsite full-time.

Minimum Qualifications:
• Bachelor's degree in Computer Science, Engineering, Information Systems, or related field and 2+ years of Hardware Engineering, Software Engineering, Systems Engineering, or related work experience.
OR
Master's degree in Computer Science, Engineering, Information Systems, or related field and 1+ year of Hardware Engineering, Software Engineering, Systems Engineering, or related work experience.
OR
PhD in Computer Science, Engineering, Information Systems, or related field.

What You'll Do
Model Reauthoring & Architecture Adaptation
  • Reauthor generative AI architectures for efficient execution on Qualcomm AI hardware. This covers LLMs (Llama, Phi, Qwen) and multimodal models (vision-language, speech, diffusion), including custom attention, normalization, positional embedding, and modality-specific components.
  • Translate hardware execution constraints - operator support, memory layout, dispatch behavior - into model-level transformations. These transformations need to preserve accuracy while enabling efficient on-device execution.
  • Build clean extension points so internal teams and external contributors can onboard new architectures without changing core pipeline code.
Inference Optimization for Edge Hardware
  • Integrate inference acceleration techniques into the model preparation pipeline. This includes memory-efficient attention, decode acceleration, and serving-time optimizations.
  • Translate end-customer deployment constraints - target SoC, context length, latency budget, memory envelope - into concrete model preparation strategies.
Custom Model & OEM Enablement
  • Work with research teams to develop reauthoring strategies for custom OEM models and customer-specific use cases. Take research prototypes and turn them into production deployments.
Cross-Functional Collaboration
  • Partner with compiler teams to understand on-target constraints. Decide on the right response: a graph-level optimization or model-level reauthoring.
  • Partner with quantization engineers so architectural decisions compose cleanly with the quantization stack.
Pipeline & Tooling
  • Contribute reauthoring and adaptation stages to a multi-stage model preparation pipeline. Build developer-facing diagnostics that give clear, actionable feedback when models fail to lower or run efficiently.


Minimum Qualifications
  • Bachelor's degree in Computer Science, Engineering, or related field and 4+ years of Software Engineering, ML Engineering, or related experience
  • OR Master's degree in Computer Science, Engineering, or related field and 3+ years of relevant experience
  • OR PhD in Computer Science, Engineering, or related field and 2+ years of relevant experience
  • 2+ years in ML systems, model optimization, or inference engineering. Proficient in Python in large, typed codebases.
  • Strong written and verbal communication. Comfortable operating across compiler, research, and partner-facing teams.


Preferred Qualifications
  • Deep implementation-level knowledge of generative AI architectures across LLMs and multimodal models
  • Demonstrated experience optimizing inference for edge or resource-constrained deployments, with measurable latency or memory wins to point to.
  • Strong PyTorch internals knowledge - module customization, export flows, tracing. Familiarity with the HuggingFace transformers ecosystem.
  • Familiarity with on-device runtimes and SoC-level constraints (memory bandwidth, compute precision, NPU/DSP execution). Exposure to QAIRT/QNN, ONNXRuntime, LiteRT-LLM or similar is a plus.
  • Working understanding of how quantization interacts with model architecture decisions, even if you're not a quantization specialist.
  • Experience using agentic coding tools such as GitHub Copilot, Cursor, Claude Code, Codeium, or similar AI-assisted development tools to improve coding productivity and problem-solving


Level of Responsibility
  • Works independently on open-ended optimization challenges. Provides technical guidance and mentorship to teammates.
  • Decisions have broad impact on model accuracy, on-device performance, and the developer experience of teams using the preparation pipeline.
  • Communicates complex model architecture and inference optimization concepts to a range of audiences: hardware engineers, research scientists, compiler engineers, OEM partners, and external developers.
  • Has meaningful influence on the generative AI optimization roadmap, supported model strategy, and cross-team integration priorities.


Pay range and Other Compensation & Benefits:
$140,800.00 - $211,200.00

The above pay scale reflects the broad, minimum to maximum, pay scale for this job code for the location for which it has been posted. Even more importantly, please note that salary is only one component of total compensation at Qualcomm. We also offer a competitive annual discretionary bonus program and opportunity for annual RSU grants (employees on sales-incentive plans are not eligible for our annual bonus). In addition, our highly competitive benefits package is designed to support your success at work, at home, and at play. Your recruiter will be happy to discuss all that Qualcomm has to offer - and you can review more details about our US benefits at this link.

If you would like more information about this role, please contact Qualcomm Careers.

About Qualcomm

Qualcomm Ventures is the investment arm of Qualcomm Incorporated. Founded in 2000, Qualcomm Ventures is a corporate venture capital fund with over 150 active portfolio companies and more than 20 exits over a billion dollars, including 99 Taxis, Cruise Automation, Fitbit, Invensense, NQ Mobile, Waze, and more. As a global investor, Qualcomm Ventures helps connect entrepreneurs to the resources, relationships, and deep industry expertise they need to succeed in the mobile technology ecosystem.

Qualcomm Careers

Joining Qualcomm offers more than just a job opportunity; it's a gateway to a career infused with innovation, leadership, and growth. As a pivotal leader in the world of wireless technology, Qualcomm stands at the forefront of digital communication advancements. Our team of professionals is dedicated to pushing the boundaries of what's possible, making this an ideal time to become part of our global community.

Work You’ll Do

At Qualcomm, you will collaborate with some of the brightest minds in the industry, engaging in work that transforms the way the world connects, computes, and communicates. Our diverse team is driven by a shared passion for creating path-breaking wireless technologies that empower mobile ecosystems worldwide.

Innovate and Grow

Embrace the opportunity to innovate alongside leaders in the field and contribute to projects that have a global impact. Qualcomm is committed to fostering a culture of innovation and continuous improvement, ensuring that every team member has the opportunity to make a significant impact.

Professional Growth and Development

Qualcomm is dedicated to the professional growth of its employees, offering unparalleled benefits, diverse career paths, and extensive training programs that encourage professional and personal development. Whether you're looking for leadership roles or specialized technical positions, Qualcomm provides the resources and support to help you drive your career forward.

Diversity and Inclusion

We believe that a diverse workforce fuels our innovation and reflects our commitment to making a positive impact. Qualcomm’s inclusive culture and diversity training programs are designed to promote an environment where all employees can thrive.

Internship Programs

Start your career with Qualcomm through our dynamic internship programs. These opportunities allow you to apply your skills in real-world scenarios, providing a robust foundation for future employment. Internships at Qualcomm are characterized by meaningful projects and the chance to network with industry leaders.

Join Our Team

Explore the numerous job opportunities at Qualcomm, from engineering to marketing, and discover how your skills and interests align with our mission. We are continuously hiring creative and driven individuals who are ready to contribute to our culture of innovation.

Prepare for Your Interview

Ready to join our team? Prepare your resume to highlight your relevant experience and skills. Our interview process is designed to understand your capabilities and how they align with our goals at Qualcomm. We look for passionate, curious, and innovative team players who are ready to take the next step in their careers.

Stay Connected

Keep up to date with the latest at Qualcomm by following our careers blog. Gain insider perspectives and industry-leading insights that can help you navigate your professional journey.

Career Opportunities Await

At Qualcomm, your career is what you make of it. With support for your ambitions and a network of global professionals, the opportunities to advance and excel are nearly limitless. Join us and be part of a team that’s leading the world in next-generation technology.

Search Qualcomm Jobs

Discover the positions that match your skills and interests. We are looking for individuals who are ready to make an impact and excel in a fast-paced, innovative environment.

Explore Qualcomm Careers

Whether you're seeking an internship, a first job, or a leadership position, Qualcomm offers a range of opportunities across various disciplines. Let your career journey begin here, where innovation, leadership, and growth come together to create extraordinary outcomes.
Learn more about Qualcomm
Size
45,000 employees
Market Cap
$122.5 billion
Industry
Net Income
$6.7 billion
Founded
1985
5 Year Trend
+14.7%
Revenue
$26.6 billion
NASDAQ

Similar Jobs

More Jobs at Qualcomm

More Information Technology Jobs

Find similar Sr Software Engineer, AI Tools - On-Device Generative AI Model Optimization jobs: