Staff AI Performance Engineer

Graphcore

• $120K — $160K *

Austin, TX 78745In-Person

Information Technology

Less than 5 years of experience

Today

Be an Early Applicant

By clicking Apply, I agree with Ladders' Terms of Use and Privacy Policy

Job Overview by Ladders

Qualifications

BS/MS in Computer Science, Electrical Engineering, or related field
Experience with distributed systems and communication libraries (MPI, NCCL, UCX, libfabric)
Strong programming skills in C++ and Python
Experience profiling and optimizing HPC or AI/ML workloads
Familiarity with ML benchmarks such as MLPerf

Responsibilities

Analyze ML models' compute and memory requirements using roofline analysis and simulations
Collaborate across hardware and software teams to optimize large-scale AI workloads
Benchmark, monitor, and troubleshoot system performance across distributed systems
Optimize communication stacks including MPI, NCCL, UCX, RDMA, and networking fabrics
Profile and optimize AI workloads, focusing on performance bottlenecks
Develop high-quality, ARM-compatible code and documentation

Benefits

Cutting-edge technology exposure in AI and ML infrastructures
Opportunity to work in a high-performance team environment
Collaboration across diverse technical domains
Impact on large-scale infrastructure and AI solutions
Focus on innovative approaches to enhance system performance

Full Job Description

Job Summary

Graphcore's AI/ML training and inference infrastructure is rapidly scaling to meet the growing demands of AI workloads across mobile, edge, and datacenter environments. This role focuses on optimizing performance across ARM-based architectures and large-scale distributed systems, ensuring efficiency, scalability, and reliability across the full hardware-software stack.
The Team

The System Engineering Performance team architects and optimizes high-performance infrastructure for large-scale datacenter deployments. The team works across hardware, software, networking, and system architecture to deliver cutting-edge AI solutions and ensure optimal system performance at scale.
Responsibilities and Duties

Analyze ML models' compute and memory requirements using roofline analysis and simulations
Collaborate across hardware and software teams to optimize large-scale AI workloads
Benchmark, monitor, and troubleshoot system performance across distributed systems
Optimize communication stacks including MPI, NCCL, UCX, RDMA, and networking fabrics
Profile and optimize AI workloads, focusing on performance bottlenecks
Develop high-quality, ARM-compatible code and documentation

Candidate Profile

Essential:

BS/MS in Computer Science, Electrical Engineering, or related field
Experience with distributed systems and communication libraries (MPI, NCCL, UCX, libfabric)
Strong programming skills in C++ and Python
Experience profiling and optimizing HPC or AI/ML workloads
Familiarity with ML benchmarks such as MLPerf

Desirable:

Experience with GPUs or accelerated computing architectures
Knowledge of HPC networking and interconnect technologies (InfiniBand, RoCE)
Familiarity with ML frameworks such as PyTorch or TensorFlow
Understanding of ARM architectures and toolchains
Strong debugging, profiling, and performance optimization skills

* Ladders Estimates

Similar Jobs

Senior IT Engineer
$100K — $130K *
CreditAssociates
Plano, TX 75025 (Collin County)
Today
Lead Infrastructure Engineer
$120K — $150K *
JP Morgan Chase & Co.
Plano, TX 75024 (Collin County)
Reposted Today
Systems Engineer
$110K — $149K *
General Dynamics Information Technology, Inc.
Fort Hood, TX 76544 (Bell County)
Today
Senior Model Based System Engineer
$89K — $148K *
ManTech International
Remote
Reposted Yesterday
Platform Engineer
$120K — $150K *
Virtasant
Austin, TX 78745 (Travis County)
Reposted Yesterday
Software Engineer, Site Reliability Engineering
$151K — $195K *
Thumbtack, Inc.
Remote
Yesterday

Get Ready For Your
Next Interview

More Jobs at Graphcore

Staff AI Performance Engineer
$120K — $160K *
Austin, TX 78745 (Travis County)
Today
Information Technology
In-Person
GPU Architect
$150K — $200K *
Milpitas, CA 95035 (Santa Clara County)
Yesterday
Consumer Technology
In-Person
Senior BMC Firmware Engineer
$120K — $160K *
Austin, TX 78745 (Travis County)
Yesterday
Information Technology
In-Person
Principal Hardware Diagnostics Engineer
$130K — $180K *
Austin, TX 78745 (Travis County)
Reposted 2 days ago
Telecommunications & Hardware
In-Person
Principal Hardware Diagnostics Engineer
$130K — $180K *
Milpitas, CA 95035 (Santa Clara County)
Reposted 4 days ago
Telecommunications & Hardware
In-Person

More Information Technology Jobs

Client Partner - Banking / Financial Services / Capital Markets
$325K — $350K + $100K bonus *
Large IT Services Firm (client of TechLink Systems)
New York, NY 10001 (New York County)
Today
Business Development Director
$300K — $345K + $120K bonus *
Tier1 IT Services Firm
Kansas City, MO 64116 (Clay County)
1 week ago
Client Partner / Business Developemnt - Banking
$250K — $320K + $70K bonus *
IT Services Firm (client of TechLink Systems)
New York, NY 10001 (New York County)
1 week ago
Associate Engineer - Devops Digital Transactions
$99K — $120K *
Lululemon
Vancouver, BC V5K 5J9
Today
Senior/Staff Software Engineer, Apple Services Engineering
$130K — $180K *
Apple
Washington, DC 20011 (District Of Columbia County)
Today

Find similar Staff AI Performance Engineer jobs:

Nationwide Austin, TX

Staff AI Performance Engineer

Job Overview by Ladders

Full Job Description

Get Ready For Your Next Interview

Find similar Staff AI Performance Engineer jobs:

Get Ready For Your
Next Interview