Senior Staff AI Software Engineer

Samsung • $189K — $301K *

San Jose, CA 95123In-Person

Enterprise Technology

8 - 10 years of experience

Today

Be an Early Applicant

By clicking Apply, I agree with Ladders' Terms of Use and Privacy Policy

Job Overview by Ladders

Qualifications

Bachelor's with 15+ years, Master's with 13+ years, or PhD with 10+ years of industry experience.
Experience in high-performance AI framework software for GPUs or other accelerators.
End-to-end understanding of AI infrastructure and software stack from model definition to serving.
Knowledge of LLM architectures and modern transformer-based designs.
Hands-on expertise with PyTorch and practical use of vLLM for model inference.
Understanding of memory wall issues impacting AI performance.
Familiarity with High Bandwidth Memory (HBM) architecture and memory-centric compute.

Responsibilities

Lead co-design of software and hardware solutions to optimize AI model inference.
Analyze and optimize LLM and agentic AI workloads across the software stack.
Profile model execution to identify memory wall limitations and influence architecture decisions.
Collaborate with hardware teams on memory architecture and compute strategies.
Develop and benchmark inference solutions using frameworks like PyTorch and vLLM.
Define best practices and mentor teams in software-hardware co-design.

Benefits

Comprehensive medical, dental, and vision plans.
401(k) with company match and performance incentives.
4+ weeks of paid time off annually, plus holidays and sick leave.
Support for family needs like fertility or adoption assistance.
Access to emotional wellness apps and therapy sessions.
Onsite gym, café, and virtual fitness classes for a healthy lifestyle.
A flexible work environment to achieve work-life balance.

Full Job Description

Please Note:

To provide the best candidate experience amidst our high application volumes, each candidate is limited to 10 applications across all open jobs within a 6-month period.

The AGI (Artificial General Intelligence) Computing Lab is dedicated to solving the complex system-level challenges posed by the growing demands of future AI/ML workloads. Our team is committed to designing and developing scalable platforms that can effectively handle the computational and memory requirements of these workloads while minimizing energy consumption and maximizing performance. To achieve this goal, we collaborate closely with both hardware and software engineers to identify and address the unique challenges posed by AI/ML workloads and to explore new computing abstractions that can provide a better balance between the hardware and software components of our systems. Additionally, we continuously conduct research and development in emerging technologies and trends across memory, computing, interconnect, and AI/ML, ensuring that our platforms are always equipped to handle the most demanding workloads of the future. By working together as a dedicated and passionate team, we aim to revolutionize the way AI/ML applications are deployed and executed, ultimately contributing to the advancement of AGI in an affordable and sustainable manner. Join us in our passion to shape the future of computing!

Location: Daily onsite presence at our San Jose, CA office / U.S. headquarters in alignment with our Flexible Work policy.

What You'll Do

Lead the co-design of software and hardware solutions that optimize AI model inference performance, with a focus on overcoming memory bottlenecks.
Analyze and optimize LLM and agentic AI workloads across the full software stack, identifying opportunities for hardware-aware acceleration.
Profile and characterize model execution to expose memory wall limitations and guide architectural decisions for HBM and memory-centric compute.
Collaborate with hardware teams to influence memory architecture, acceleration strategies, and compute placement based on real workload behavior.
Develop, optimize, and benchmark inference and serving solutions using frameworks such as PyTorch and vLLM.
Define best practices and provide technical mentorship across software-hardware co-design efforts.

What You Bring

Bachelor's with 15+ years, or Master's with 13+ years, or PhD's with 10+ years of industry experience.
Strong experience writing high-performance AI framework software development for GPUs or other accelerators.
Strong, end-to-end understanding of the AI infrastructure, AI software stack, from model definition through deployment and serving.
Solid understanding of LLM model architectures and workflows, including modern transformer-based designs.
Solid understanding of agentic AI architecture and workflows.
Hands-on expertise with the PyTorch framework.
Practical experience with vLLM for high-throughput model inference and serving.
Solid understanding of the memory wall problem and its impact on AI system performance.
Strong knowledge of memory architecture, including High Bandwidth Memory (HBM), and familiarity with memory-centric acceleration and compute approaches.
Proficiency working in a Linux development environment.
Solid command of development tooling, including agentic coding, GitHub and Jira.

#LI-VL1

What We OfferThe pay range below is for all roles at this level across all US locations and functions. Pay within this range varies by work location and may also depend on job-related knowledge, skills, and experience. We also offer incentive opportunities that reward employees based on individual and company performance.

This is in addition to our diverse package of benefits centered around the wellbeing of our employees and their loved ones. In addition to the usual Medical/Dental/Vision/401k, our inclusive rewards plan empowers our people to care for their whole selves. An investment in your future is an investment in ours.

Give Back With a charitable giving match and frequent opportunities to get involved, we take an active role in supporting the community.
Enjoy Time Away You'll start with 4+ weeks of paid time off a year, plus holidays and sick leave, to rest and recharge.
Care for Family Whatever family means to you, we want to support you along the way-including a stipend for fertility care or adoption, medical travel support, and virtual vet care for your fur babies.
Prioritize Emotional Wellness With on-demand apps and free confidential therapy sessions, you'll have support no matter where you are.
Stay Fit Eating well and being active are important parts of a healthy life. Our onsite Café and gym, plus virtual classes, make it easier.
Embrace Flexibility Benefits are best when you have the space to use them. That's why we facilitate a flexible environment so you can find the right balance for you.

Base Pay Range

$189,000-$301,000 USD

About Samsung

Samsung is a South Korean multinational conglomerate that specializes in electronics, appliances, and telecommunications equipment. The company was founded in 1938 and is headquartered in Suwon, South Korea. Samsung is one of the largest electronics companies in the world, with operations in over 80 countries. The company's products include smartphones, TVs, home appliances, and semiconductors. Samsung is committed to sustainability and has implemented several initiatives to reduce its environmental impact, such as using renewable energy and reducing waste. The company is also involved in several philanthropic initiatives, such as supporting education and healthcare programs.

Learn more about Samsung

Size

98,557 employees

Industry

Consumer Technology

Founded

1970

NASDAQ

SSNLF

* Ladders Estimates

Similar Jobs

FSO LABS - AI Developer - Senior - Bay Area
$122K — $213K *
Ernst & Young
San Jose, CA 95123 (Santa Clara County)
Today
Staff Research Engineer, Applied AI, DeepMind
$207K — $301K *
Google
Mountain View, CA 94040 (Santa Clara County)
Today
AI Field Engineer - AI Natives
$200K — $260K *
Fireworks AI
San Mateo, CA 94403 (San Mateo County)
Today
Software Engineer III, AI/ML, Display Ads
$147K — $211K *
Google
Mountain View, CA 94040 (Santa Clara County)
Today
Sr. Lead AI Engineer (Inference Optimization, FM hosting, AI Platform)
$250K — $286K *
Capital One Financial Corporation
San Francisco, CA 94112 (San Francisco County)
Today
Sr. Lead AI Engineer (Inference Optimization, FM hosting, AI Platform)
$250K — $286K *
Capital One Financial Corporation
San Jose, CA 95123 (Santa Clara County)
Today

Get Ready For Your
Next Interview

More Jobs at Samsung

Senior Staff AI Software Engineer
$189K — $301K *
San Jose, CA 95123 (Santa Clara County)
Today
Enterprise Technology
In-Person
Senior Staff Technical Program Manager
$189K — $301K *
San Jose, CA 95123 (Santa Clara County)
1 week ago
Enterprise Technology
In-Person
Senior Manager, Serdes Analog Design
$189K — $301K *
San Jose, CA 95123 (Santa Clara County)
Reposted 1 week ago
Telecommunications & Hardware
In-Person
Senior Engineer, RTL Design
$138K — $206K *
San Jose, CA 95123 (Santa Clara County)
2 weeks ago
Consumer Technology
In-Person
Staff Engineer, RTL Design
$163K — $253K *
San Jose, CA 95123 (Santa Clara County)
2 weeks ago
Telecommunications & Hardware
In-Person

More Enterprise Technology Jobs

Global Content Manager, AI, Gemini Enterprise, Google Cloud
$171K — $248K *
Google
Chicago, IL 60629 (Cook County)
Today
Security Sales Specialist Manager III, Google Cloud
$176K — $245K *
Google
Chicago, IL 60629 (Cook County)
Today
Senior Analytics Engineer
$100K — $150K *
Nscale
Seattle, WA 98115 (King County)
Today
Growth Enterprise Account Executive (NYC)
$100K — $130K *
Sigma Computing
New York, NY 10025 (New York County)
Today
Commercial Account Executive (CAE)
$100K *
Sigma Computing
San Francisco, CA 94112 (San Francisco County)
Today

Find similar Senior Staff AI Software Engineer jobs:

Nationwide San Jose, CA

Senior Staff AI Software Engineer

Job Overview by Ladders

Full Job Description

Get Ready For Your Next Interview

Find similar Senior Staff AI Software Engineer jobs:

Get Ready For Your
Next Interview