Intel

Inference Optimization Engineer (local / edge runtime)

Intel$170K — $315K *
Information Technology
5 - 7 years of experience
Job Overview by Ladders

Qualifications

  • BS/MS in Computer Science, Electrical Engineering, Mathematics or related STEM field
  • 5+ years of software development experience
  • Proficient in C++ and/or Python, with systems-level code understanding
  • Knowledgeable in LLM inference, including attention mechanisms and KV cache
  • Experience in profiling and optimizing performance on either CPU or GPU
  • Skilled in Linux, build systems, and low-level debugging

Responsibilities

  • Profile and optimize inference for latency, throughput, and memory on edge hardware
  • Tune KV cache and scheduling for efficient interactive agent workloads
  • Drive quantization strategies and assess quality impacts with Post-Training team
  • Reduce CPU overhead and enhance engine startup and model lifecycle management
  • Benchmark performance across hardware tiers and publish results
  • Contribute fixes and patches to open-source inference engines

Benefits

  • Comprehensive health benefits
  • Retirement plans
  • Generous vacation policy
  • Stock bonuses
  • Career development opportunities
Full Job Description
Job Details:

Job Description:
Role Summary

Make models fast on the hardware people actually own. You optimize inference engines (llama.cpp, vLLM) for constrained local and edge environments - GPU/iGPUs, Vulkan backends - not datacenter H100 environment, mostly PC/edge. KV cache, batching, quantization, scheduling, and CPU-overhead reduction are your daily tools.

This is the rare skill that makes a hybrid, low-cost agent product viable.

What you'll do
  • Profile and optimize local inference (llama.cpp-vulkan and vLLM) for latency, throughput, and memory on edge hardware
  • Tune KV cache, continuous batching, and scheduling for interactive agent workloads
  • Drive quantization strategy (GGUF / AWQ / GPTQ) and validate quality impact with the Post-Training team
  • Cut CPU overhead and improve engine startup, model load, and lifecycle (start / stop / health)
  • Benchmark across hardware tiers and publish honest performance comparisons
  • Upstream fixes and patches to open-source engines where it helps us


What you'll learn / grow into

Curiosity is required. You will develop:
  • The internals of modern inference engines and where the milliseconds actually go
  • Hardware-aware optimization across iGPU / CPU paths (Vulkan, SYCL, oneAPI, CUDA where relevant)
  • The quality-vs-speed-vs-memory trade space for small models
  • Interest in local / edge AI and squeezing hardware


Qualifications:

Minimum qualifications are required to be initially considered for this position. Preferred qualifications are in addition to the minimum requirements and are considered a plus factor in identifying top candidates.

You must possess the minimum qualifications to be initially considered for this position. Preferred qualifications are in addition to the minimum requirements and are considered a plus factor in identifying top candidates.

Required Qualifications
  • BS/MS in CS, EE, Math or related STEM field
  • 5+ years software development background
  • Strong in C++ and/or Python; comfortable reading systems-level code
  • Understands how LLM inference works (attention, KV cache, decoding)
  • Has profiled and optimized real performance problems (CPU or GPU) and can prove the speedup
  • Linux, build systems, and low-level debugging expertise
Preferred Qualifications
  • Hands-on with llama.cpp, vLLM, ggml, or similar engines
  • Experience with GPU / accelerator programming (Vulkan, CUDA, SYCL, Metal) or SIMD / CPU kernels
  • Familiarity with quantization formats and their quality trade-offs
  • Open-source contributions to inference engines


Requirements listed would be obtained through a combination of industry relevant job experience, internship experiences and or schoolwork/classes/research.

Benefits at Intel

Our total rewards package goes above and beyond just a paycheck. Whether you're looking to build your career, improve your health, or protect your wealth, we offer generous benefits to help you achieve your goals. Go to Intel Benefits | Intel Careers for details of benefits available to you. Intel reserves the right to modify, change or discontinue benefit plans at any time in its sole discretion.

Job Type:

Shift:
Shift 1 (United States of America)

Primary Location:
US, California, Santa Clara

Additional Locations:
US, Arizona, Phoenix, US, California, Folsom, US, Oregon, Hillsboro

Business group:
The Client Computing Group (CCG) is responsible for driving business strategy and product development for Intel's PC products and platforms, spanning form factors such as notebooks, desktops, 2 in 1s, all in ones. Working with our partners across the industry, we intend to deliver purposeful computing experiences that unlock people's potential - allowing each person use our products to focus, create and connect in ways that matter most to them.

Position of Trust
N/A

Benefits

We offer a total compensation package that ranks among the best in the industry. It consists of competitive pay, stock bonuses, and benefit programs which include health, retirement, and vacation. Find out more about the benefits of working at Intel.

Annual Salary Range for jobs which could be performed in the US: $170,500.00-315,490.00 USD

The range displayed on this job posting reflects the minimum and maximum target compensation for the position across all US locations. Within the range, individual pay is determined by work location and additional factors, including job-related skills, experience, and relevant education or training. Your recruiter can share more about the specific compensation range for your preferred location during the hiring process.

Work Model for this Role
This role will be eligible for our hybrid work model which allows employees to split their time between working on-site at their assigned Intel site and off-site. * Job posting details (such as work model, location or time type) are subject to change.

ADDITIONAL INFORMATION: Intel is committed to Responsible Business Alliance (RBA) compliance and ethical hiring practices. We do not charge any fees during our hiring process. Candidates should never be required to pay recruitment fees, medical examination fees, or any other charges as a condition of employment. If you are asked to pay any fees during our hiring process, please report this immediately to your recruiter.

About Intel

Intel Careers

Join Intel's dynamic team today and be part of a company that redefines the boundaries of technology and innovation. Intel offers a plethora of job opportunities that pave the way for professional growth and personal achievement. As a leader in the tech industry, Intel is the perfect place to advance your career, whether you're a seasoned professional or just starting out. Work You’ll Do At Intel, you will collaborate with some of the brightest minds in the industry, working together to solve complex challenges and push the limits of what's possible. Our culture of innovation fosters diversity and encourages you to bring your unique perspectives to the table. Intel is not just about hardware and software; it's about empowering people to achieve more. Join Intel’s market-leading team to help drive technological advancements in numerous fields. From semiconductor engineering to AI development, your work at Intel will have a profound impact on the world’s technological landscape. Lead with Innovation Intel stands at the intersection of technology leadership and industry innovation. Here, you will have the opportunity to lead projects that set industry standards and redefine how technology enhances our lives. Intel’s commitment to leadership development ensures that every team member is equipped with the skills needed to excel. Experience the Power of Networking and Professional Growth Intel’s global scale offers unmatched opportunities for networking and professional development. Engage with experts across different fields and participate in programs designed to hone your leadership skills and expand your professional knowledge. Intel’s dedication to career growth is evident in our robust training programs and our commitment to promoting from within. Internship and Employment Opportunities Start your career journey with an internship at Intel, where you can apply your academic knowledge in a real-world setting. Our internships provide a foundation for successful careers by allowing you to work on meaningful projects and gain valuable industry experience. Intel is hiring! Explore open positions that match your skills and interests. We look for passionate, curious, creative, and solution-driven team players. Whether you’re applying for an internship or a full-time position, preparing your resume and getting ready for the interview process at Intel is an exciting step towards a promising future. Benefits and Culture Intel is committed to fostering a diverse and inclusive workplace. We offer competitive benefits packages that promote the well-being of our employees and their families. From health and wellness to financial and education benefits, Intel ensures that our team members have what they need to succeed. Stay Connected Join Our Team Search for job opportunities that align with your career aspirations. Intel’s diverse range of employment options means there’s a place for every skill set and ambition. Keep Up to Date Stay ahead with career tips, insider perspectives, and industry-leading insights you can put to use today—all from the people who work at Intel. Job Alert Emails Personalize your subscription to receive job alerts, latest news, and insider tips tailored to your preferences. Discover the exciting and rewarding career opportunities that await at Intel.
Learn more about Intel
Size
121,100 employees
Market Cap
$106.5 billion
Industry
Net Income
$20.8 billion
Founded
1968
5 Year Trend
+5.9%
Revenue
$77.8 billion
NASDAQ

Similar Jobs

More Jobs at Intel

More Information Technology Jobs

Find similar Inference Optimization Engineer (local / edge runtime) jobs: