PyTorch Engineer

LTM

• $120K — $150K *

Bellevue, WA 98006In-Person

Technical Services

Less than 5 years of experience

Today

Be an Early Applicant

By clicking Apply, I agree with Ladders' Terms of Use and Privacy Policy

Job Overview by Ladders

Qualifications

Proficiency in PyTorch and Triton with practical experience in developing stress workloads.
Strong understanding of computational memory management, DMA, and execution patterns.
Experience with performance analysis and optimization of both simulator and real hardware.
Ability to design scalable test harnesses for various workloads and configurations.
Familiarity with cross-functional collaboration in a technical environment.

Responsibilities

Design and implement high-intensity stress workloads using PyTorch and Triton.
Identify and troubleshoot performance issues and system bottlenecks in simulator and real setups.
Develop complex PyTorch workloads that push model-level execution limits.
Create custom Triton kernels to assess hardware performance under stress.
Document and streamline processes for integrating workloads into CI and monitoring tools.
Maintain and update a library of reusable PyTorch stress workloads.
Collaborate with firmware and SDK teams to address risk areas and refine stress tests.

Benefits

Opportunity to work on cutting-edge machine learning and hardware integration projects.
Collaborative work environment with cross-functional teams.
Access to advanced tools for performance testing and optimization.
Possibility for innovation in stress testing methodologies and changing the tech landscape.

Full Job Description

Role description

Design and implement highintensity stress workloads using PyTorch and Triton Exercise core MAIA execution paths including compute memory DMA and collectives
Enable early detection of performance cliffs stability issues and system bottlenecks across simulator and real hardware Improve platform maturity reduce latestage escapes and increase confidence for broader internal and external adoption
Develop PyTorch workloads stressing modellevel execution such as large GEMMs attention patterns MoElike behavior mixed precision and longrunning loops
Author custom Triton kernels to stress hardware execution units memory hierarchies and synchronization paths
Build parameterized stress harnesses scalable by problem size number of devices and runtime duration Integrate workloads with existing profiling monitoring and failure triage tooling
Collaborate with platform firmware and SDK teams to target known risk areas and emerging issues
Document usage patterns and provide reproducible scripts for lab and continuous integration CI usage
Develop and maintain a library of reusable PyTorch stress workloads
Create Tritonbased micro and macrokernels designed specifically for stress and saturation testing
Build and support test harnesses and scripts for singledevice and multidevice execution
Ensure workload designs align with platform risk areas and emerging hardwaresoftware issues
Collaborate crossfunctionally with platform firmware and SDK teams to refine stress tests
Provide comprehensive documentation describing workload intent configuration options and expected stress characteristics Support profiling monitoring and failure triage by integrating stress workloads with existing tools
Deliver reproducible and scalable testing solutions for lab and CI environments

* Ladders Estimates

Similar Jobs

AIML - Sr Software Engineer in Test, Evaluation
$120K — $150K *
Apple
Seattle, WA 98115 (King County)
Today
Sr. Software Development Engineer - Agentic & Semantic System
$140K — $210K *
Workday
Vancouver, BC V5K 5J9
Today
Senior Software Engineer - Data Platform
$130K — $220K *
Samsara
Remote
Today
Senior Software Developer
$100K — $130K *
Optio Incentives
Remote
Reposted Today
Senior Software Engineer
$114K — $203K *
Microsoft
Vancouver, BC V5K 5J9
Reposted Today
Senior Software Engineer, Apple Intelligence Data Platform - Proactive
$130K — $180K *
Apple
Seattle, WA 98115 (King County)
Today

Get Ready For Your
Next Interview

More Jobs at LTM

Oracle Hyperion EPBCS Solution Architect
$120K — $150K *
Plano, TX 75025 (Collin County)
Reposted Today
Enterprise Technology
In-Person
ServiceNow ITOM SME
$90K — $120K *
Markham, ON L3R 0G6
Today
Information Technology
In-Person
ServiceNow Technical Architect
$100K — $130K *
Markham, ON L3R 0G6
Today
Information Technology
In-Person
Network Engineer
$90K — $130K *
Plano, TX 75025 (Collin County)
Today
Telecommunications & Hardware
In-Person
Specialist - Data Engineering
$90K — $120K *
Tampa, FL 33647 (Hillsborough County)
Today
Information Technology
In-Person

More Technical Services Jobs

Engineering Manager - Lifecycle Services
$90K — $145K *
Emerson Group
Chicago, IL 60629 (Cook County)
Today
Hybrid: Commissioning Agent
$100K — $110K *
Planate Management Group
Tampa, FL 33647 (Hillsborough County)
Today
Project Operations Field Implementation Manager- Calgary, Alberta
$75K — $95K *
Diversified Communications
Calgary, AB T1Y 7M8
Today
Solutions Engineer III
$90K — $120K *
Mujin Corp
Suwanee, GA 30024 (Gwinnett County)
Today
QNX- Senior Services Sales Manager
$130K — $182K *
Blackberry
Boston, MA 02115 (Suffolk County)
Reposted Today

Find similar PyTorch Engineer jobs:

Nationwide Bellevue, WA

PyTorch Engineer

Job Overview by Ladders

Full Job Description

Get Ready For Your Next Interview

Find similar PyTorch Engineer jobs:

Get Ready For Your
Next Interview