Nuro

Senior Software Engineer, ML Infrastructure

Nuro$193K — $291K *
Information Technology
Less than 5 years of experience
Job Overview by Ladders

Qualifications

  • 4+ years experience in ML Infrastructure, Backend Platform Engineering, or Distributed Systems
  • Deep knowledge of Infrastructure-as-Code tools like Terraform, Pulumi, or Crossplane
  • Hands-on experience with workload orchestrators for compute-intensive tasks
  • Proficient in distributed processing tools like Apache Spark or Apache Beam
  • Experience with feature management and caching solutions such as Feast or Redis
  • Strong grasp of distributed systems and high-performance computing challenges

Responsibilities

  • Scale automated Infrastructure-as-Code pipelines to handle thousands of GPU/CPU nodes
  • Design and optimize workload orchestration for high hardware utilization
  • Create robust ETL pipelines for petabyte-scale data
  • Implement feature caching solutions to enhance access to pre-computed features
  • Contribute to a unified ML platform that simplifies cloud infrastructure for users

Benefits

  • Comprehensive health, dental, and vision insurance
  • Eligible for an annual performance bonus
  • Participation in equity programs
  • Access to a competitive benefits package
  • Opportunities for professional development and continuous learning
Full Job Description
About the Role

Nuro is seeking a Software Engineer with expertise in large-scale infrastructure, workload orchestration, and data processing to join our ML Infrastructure team. In this role, you will focus on building and evolving the core platform that provides researchers and engineers with seamless access to compute and data resources. You will be responsible for executing the technical strategy for automated resource provisioning, high-performance workload scheduling, and efficient feature management to accelerate the Nuro Driver™ development lifecycle.

About the Work

You will build the foundation that powers Nuro's model development from experimentation to production. Key responsibilities include:
  • Resource Provisioning & IaC: Scaling automated infrastructure-as-code (IaC) pipelines to manage thousands of GPU/CPU nodes across diverse environments.
  • Intelligent Scheduling: Designing and optimizing workload orchestration to maximize hardware utilization, minimize job wait times, and handle massive-scale distributed training.
  • Data & ETL: Designing robust pipelines for the extraction and transformation of petabyte-scale sensor and telemetry data into ML-ready formats.
  • Feature Management: Implementing robust feature caching and storage solutions to reduce redundant computations and ensure low-latency access to pre-computed features.
  • Platform Abstraction: Contributing to a unified ML platform that abstracts complex cloud infrastructure for end-users.

About You
  • Experience: 4+ years of professional experience in ML Infrastructure, Backend Platform Engineering, or Distributed Systems.
  • Resource Provisioning: Deep familiarity with modern Infrastructure-as-Code and provisioning tools such as Terraform, Pulumi, or Crossplane.
  • Workload Scheduling: Hands-on experience building or managing large-scale orchestrators for compute-heavy workloads (e.g., Kubernetes, KubeRay, Ray, Slurm, or Volcano).
  • Distributed Data Processing: Proficiency in at least one distributed processing framework, such as Apache Spark or Apache Beam, for large-scale data extraction and transformation.
  • Feature Management: Experience implementing or maintaining feature stores and caching layers (e.g., Feast, Hopsworks, or Redis-based custom caching).
  • Systems Design: A strong understanding of distributed systems, networking, and storage bottlenecks in the context of high-performance computing.

Bonus Points
  • Active contributor to open-source projects in the MLOps or Cloud-Native ecosystem (e.g., CNCF, Ray, or Kubeflow communities).
  • Experience with high-performance storage systems (e.g., Lustre, Ceph, or specialized NVMe caching) for ML data loading.
  • Knowledge of cost-optimization strategies for large-scale GPU clusters in public clouds (AWS, GCP, or Azure).

At Nuro, your base pay is one part of your total compensation package. For this position, the reasonably expected base pay range is between $193,930 and $291,150 for the level at which this job has been scoped. Your base pay will depend on several factors, including your experience, qualifications, education, location, and skills. In the event that you are considered for a different level, a higher or lower pay range would apply. This position is also eligible for an annual performance bonus, equity, and a competitive benefits package.

About Nuro

Nuro is a technology company that develops autonomous delivery vehicles. The company was founded in 2016 by two former Google engineers, Dave Ferguson and Jiajun Zhu. Nuro's vehicles are designed to transport goods, rather than people, and are being used by companies such as Kroger and Domino's Pizza for deliveries. The company has raised over $1 billion in funding and is valued at over $5 billion.
Learn more about Nuro
Size
1,000 employees
Industry
Founded
2016

Similar Jobs

More Jobs at Nuro

More Information Technology Jobs

Find similar Senior Software Engineer, ML Infrastructure jobs: