AI Infrastructure Engineer

$151K — $332K *
Information Technology
5 - 7 years of experience
Job Overview by Ladders

Qualifications

  • Bachelor's degree in Computer Science, AI, ML, Cognitive Science, or related field
  • 5+ years of software engineering experience focused on infrastructure and systems
  • Expertise in GPU programming and CUDA optimization
  • Experience with Docker, Kubernetes, distributed systems, and cloud computing
  • Experience in building large-scale distributed systems and optimizing neural network performance
  • Programming skills in Python, C++, and CUDA; familiarity with deep learning frameworks like PyTorch and Transformers
  • Deep understanding of neural network architectures and training methodologies

Responsibilities

  • Design and develop scalable AI infrastructure solutions for training and deploying LLMs
  • Build and optimize distributed training platforms using advanced technologies
  • Implement and maintain containerized AI environments using Docker and Kubernetes
  • Optimize CUDA kernels for maximum GPU utilization and performance
  • Develop platform software to support AI/ML workflows
  • Collaborate with AI researchers to implement efficient training and inference pipelines

Benefits

  • Award-winning workplace culture focused on employee well-being
  • Variety of perks promoting physical, mental, emotional, and financial health
  • Support for work-life balance and community contributions
  • Structured hybrid working model allowing flexibility
Full Job Description

What you can expect

We are seeking an experienced AI Infrastructure Engineer to join our AI Incubation team. You will be focused on building and optimizing large-scale training infrastructure for Large Language Models (LLMs). The ideal candidate will combine engineering fundamentals with practical experience in AI infrastructure development, demonstrating both technical depth and the ability to deliver scalable solutions for complex AI systems.

About the Team

The AI incubation team is dedicated to incubating AI breakthroughs, including foundational AI techniques and AI native applications that will largely improve people’s work productivity.

Responsibilities:

  • Designing and developing scalable AI infrastructure solutions for training and deploying large language models

  • Building and optimizing distributed training platforms using cutting-edge technologies

  • Implementing and maintaining containerized AI environments using Docker and Kubernetes

  • Optimizing CUDA kernels for maximum GPU utilization and performance

  • Developing platform software to support AI/ML workflows

  • Collaborating with AI researchers to implement efficient training and inference pipelines

What we’re looking for

  • Have a bachelor's degree in Computer Science, Engineering, AI, Machine Learning, Distributed System or related field

  • 5+ years of software engineering experience with focus on infrastructure and systems

  • Have expertise in GPU programming and CUDA optimization

  • Have experience with container technologies (Docker, Kubernetes), distributed systems and cloud computing

  • Demonstrate experience building large-scale distributed systems and optimizing neural network performance

  • Possess programming skills in Python, C++, and CUDA, with deep learning frameworks (PyTorch, Transformers)

Salary Range or On Target Earnings:

Minimum:

$151,800.00

Maximum:

$332,200.00

In addition to the base salary and/or OTE listed Zoom has a Total Direct Compensation philosophy that takes into consideration; base salary, bonus and equity value.

Note: Starting pay will be based on a number of factors and commensurate with qualifications & experience.

We also have a location based compensation structure; there may be a different range for candidates in this and other locations.

Ways of Working
Our structured hybrid approach is centered around our offices and remote work environments. The work style of each role, Hybrid, Remote, or In-Person is indicated in the job description/posting.

Benefits
As part of our award-winning workplace culture and commitment to delivering happiness, our benefits program offers a variety of perks, benefits, and options to help employees maintain their physical, mental, emotional, and financial health; support work-life balance; and contribute to their community in meaningful ways. Click for more information.

Similar Jobs

More Jobs at

More Information Technology Jobs

Find similar AI Infrastructure Engineer jobs: