Systems Engineer, HPC (US & Canada)

Mistral AI

• $90K — $130K *

Toronto, ON M3C 0E3In-Person

Information Technology

Less than 5 years of experience

Today

Be an Early Applicant

By clicking Apply, I agree with Ladders' Terms of Use and Privacy Policy

Job Overview by Ladders

Qualifications

5-7 years of experience in Linux systems administration
Experience in large-scale HPC clusters or cloud infrastructure
Familiarity with job schedulers such as Slurm
Strong troubleshooting skills across systems, hardware, and networks
Proficient in automation tools like Ansible or Terraform

Responsibilities

Operate and maintain large-scale Linux environments including bare metal and cloud
Monitor system health and ensure high availability
Support various production and research workloads
Scale infrastructure clusters to handle hundreds to thousands of nodes
Automate operational tasks to improve system management
Contribute to system design and architecture decisions
Collaborate with cross-functional teams and act as a liaison between users and infrastructure

Benefits

Hybrid work environment offering flexibility
Opportunity to work on cutting-edge AI infrastructure
Collaborative work culture with multiple teams
Focus on professional development and learning new skills

Full Job Description

About the Role

We are looking for Systems Engineers / System Administrators to help design, operate, and scale the infrastructure behind Mistral's AI platforms.

This is a hands-on, hybrid role combining:

Systems administration (operating and troubleshooting large-scale Linux environments)

Systems engineering (automation, scalability, and performance improvements)

You'll work closely with infrastructure, HPC, and research teams to ensure our clusters and platforms run reliably at scale.

What You'll Work On
Core Systems Operations

Operate and maintain large-scale Linux environments (bare metal, clusters, cloud)
Monitor system health, troubleshoot incidents, and ensure high availability
Support production and research workloads across multiple environments

Scaling Infrastructure

Help scale clusters toward hundreds to thousands of nodes
Work on systems handling petabyte-scale storage
Improve performance, reliability, and resource utilisation

Automation & Engineering

Automate operational tasks using tools like Python, Bash, Ansible, or Terraform
Improve deployment, provisioning, and system lifecycle management
Contribute to system design and architecture decisions

Cross-Functional Collaboration

Work closely with:
- HPC / infrastructure teams
- Platform / DevOps engineers
- Research teams
Act as a bridge between users and infrastructure

What We're Looking For
Must-have

Strong Linux systems administration experience (core requirement)
Experience working in large-scale environments:
- HPC clusters or cloud infrastructure
Experience with Job schedulers (e.g. Slurm)
Solid troubleshooting skills across systems, hardware, and networks

Nice-to-have (any of these)

We are not expecting everything - strong depth in one area is valuable.

Containers / orchestration (e.g. Kubernetes)
Storage systems (e.g. Ceph, Lustre, NFS)
Networking fundamentals (Ethernet; InfiniBand is a plus)
Infrastructure as Code / automation tooling
GPU or AI/ML experience

Profile We Value

Pragmatic problem solver who can operate in fast-scaling environments
Comfortable working across multiple domains ("Swiss army knife" mindset)
Able to go deep in one area while learning others
Low-ego, collaborative, and hands-on

* Ladders Estimates

Similar Jobs

Senior Site Reliability Engineer
$100K — $130K *
Royal Bank of Canada
Mississauga, ON L4T 0A1
Today
Senior Platform Engineer
$100K — $140K *
Wagepoint
Remote
Reposted Today
Technical Engineer - Storage Engineering
$116K — $194K *
M&T Bank Corporation
Buffalo, NY 14221 (Erie County)
Today
System Engineer (Linux- RedHat)
$90K — $120K *
Intelerad
Remote
Today
MMEL Senior Engineer
$90K — $120K *
Government of Canada
Ottawa, ON K1G 3J6
Today
Sr. Mainframe Systems Engineer - Remote
$100K — $130K *
A.C. Coy
Remote
Today

Get Ready For Your
Next Interview

More Jobs at Mistral AI

Systems Engineer, HPC (US & Canada)
$90K — $130K *
Toronto, ON M3C 0E3
Today
Information Technology
In-Person
IT Specialist - Palo Alto
$90K — $120K *
Palo Alto, CA 94303 (Santa Clara County)
Yesterday
Information Technology
In-Person
AI Deployment Strategist - USA
$120K — $150K *
New York, NY 10025 (New York County)
1 week ago
Enterprise Technology
Hybrid
Technical Support Expert
$80K — $120K *
New York, NY 10025 (New York County)
1 week ago
Technical Services
Hybrid

More Information Technology Jobs

SDET (Software Development Engineer In Test)
Confidential Company
Washington, DC 20001 (District Of Columbia County)
2 days ago
Client Partner - Banking / Financial Services / Capital Markets
$325K — $350K + $100K bonus *
Large IT Services Firm (client of TechLink Systems)
New York, NY 10001 (New York County)
2 weeks ago
Principal Security Technology Strategist
$172K — $258K *
Citrix
Los Angeles, CA 90011 (Los Angeles County)
Today
Senior Data Center Support Services Technician
$57K — $118K *
Oracle Corporation
Harwood, ND 58042 (Cass County)
Today
VP of Engineering (Remote)
$180K — $220K *
Crystal Intelligence
Remote
Reposted Today

Find similar Systems Engineer, HPC (US & Canada) jobs:

Nationwide Toronto, ON

Systems Engineer, HPC (US & Canada)

Job Overview by Ladders

Full Job Description

Get Ready For Your Next Interview

Find similar Systems Engineer, HPC (US & Canada) jobs:

Get Ready For Your
Next Interview