Staff Machine Learning Engineer, Offline Infrastructure

Unity Technologies • $209K — $283K *

Mountain View, CA 94040In-Person

Information Technology

Less than 5 years of experience

1 month ago

By clicking Apply, I agree with Ladders' Terms of Use and Privacy Policy

Job Overview by Ladders

Qualifications

5-7 years of experience building large-scale ML pipelines
Proficiency with distributed computing frameworks like Ray, Spark, Flink
Expertise in training data generation and dataset preparation
Significant experience with production-grade data pipeline operations
Strong Python programming skills for distributed workload management
Familiarity with modern data infrastructure including data lakes and orchestration systems
Ability to lead technical decisions and influence without formal authority

Responsibilities

Design and operate large-scale data pipelines for ML training datasets
Develop infrastructure for distributed training workflows using Pytorch and Ray
Integrate ML pipelines with workflow orchestration systems like Flyte or Airflow
Enhance reproducibility and observability in ML pipelines through validation
Optimize performance and resource use in distributed compute systems
Collaborate with ML engineers for large-scale model experimentation
Lead architectural improvements for scalable and cost-effective ML pipelines

Benefits

Comprehensive health, life, and disability insurance
Commute subsidy
Employee stock ownership
Competitive retirement/pension plans
Generous vacation and personal days
Support for new parents with leave and family-care programs
Office snacks and meals
Mental Health and Wellbeing programs
Employee Resource Groups
Global Employee Assistance Program
Training and development programs
Volunteering and donation matching program

Full Job Description

The opportunity
Unity Vector builds an offline ML platform that powers insight, experimentation, attribution, and AI-driven decision-making across the company.

Our systems operate at scale across batch and streaming data, supporting analytics, product intelligence, machine learning pipelines, and business operations. As data volume and complexity grow, our platform also supports large-scale model training, feature generation, and experimentation workflows that power production ML systems.

To support this growth, we need strong technical ownership to ensure our ML pipelines remain reliable, scalable, and architecturally sound.

We are seeking a staff ML engineer to design and evolve the large-scale offline platform. This role focuses on building reliable infrastructure for generating training datasets, orchestrating ML workflows, and enabling efficient, distributed model training at scale. You will work closely with ML engineers and platform teams to ensure our pipelines can efficiently handle growing data volumes and increasingly complex training workloads.

You will play a key role in shaping how model datasets are prepared as well as model training, validated, and delivered to distributed training systems, while ensuring the reliability, scalability, and performance of our offline ML platform.

What you'll be doing

Design and operate large-scale data pipelines that generate training datasets used for machine learning training and experimentation
Develop infrastructure that supports distributed training workflows using technologies such as Pytorch, Ray Data, and Ray Train, etc.
Integrate ML pipelines with workflow orchestration systems (e.g., Flyte, Airflow, or similar) to enable reliable multi-stage training workflows
Improve reproducibility and observability of ML pipelines through dataset validation, monitoring, and automated testing
Optimize performance and resource utilization across distributed compute systems used for data processing and model training
Partner closely with ML engineers to enable efficient large-scale experimentation and model iteration
Lead architectural improvements to ensure our offline ML pipelines remain scalable, reliable, and cost-efficient

What we're looking for

Strong experience building large-scale ML pipelines
Experience working with distributed computing frameworks such as Ray, Spark, Flink and familiarity in the Ray ecosystem (Ray Data, Ray Train) for distributed data processing and model training
Experience building infrastructure for training data generation, dataset preparation, or ML feature pipelines
Deep experience designing and operating production-grade data pipelines
Strong programming skills in Python and experience working with large-scale distributed workloads
Experience with modern data infrastructure (data lakes, warehouses, orchestration systems, streaming platforms)
Strong systems thinking, with the ability to reason about performance, scalability, reliability, and cost tradeoffs in distributed systems
Proven ability to lead technical direction and influence architectural decisions across teams without formal authority

Additional information

Relocation support is not available for this position

Benefits
At Unity, we want our team members to thrive. We offer a wide range of benefits designed to support well-being and work-life balance.

Please note: Benefits eligibility, specific offerings, and coverage vary based on the country and employment status.

While specific benefits vary, here are some of the ways we strive to take care of our eligible team members globally: Comprehensive health, life, and disability insurance | Commute subsidy | Employee stock ownership | Competitive retirement/pension plans | Generous vacation and personal days | Support for new parents through leave and family-care programs | Office food snacks | Mental Health and Wellbeing programs and support | Employee Resource Groups | Global Employee Assistance Program | Training and development programs | Volunteering and donation matching program

*Note: Certain locations require a good faith disclosure of the base salary range for the role. The actual salary for the successful candidate may differ based on location, experience, and other job-related factors.

Gross pay salary

$209,700-$283,800 USD

About Unity Technologies

Unity Technologies is a software company that provides a platform for creating and operating interactive, real-time 3D content. The company's platform is used by game developers, architects, automotive designers, filmmakers, and other creators to build and distribute interactive experiences. Unity Technologies was founded in 2004 and is headquartered in San Francisco, California. The company has offices in North America, Europe, and Asia.

Learn more about Unity Technologies

Size

4,000 employees

Industry

Information Technology

Founded

2004

* Ladders Estimates

Similar Jobs

Staff Machine Learning Engineer
$189K — $389K *
Pinterest
Remote
Today
Staff Machine Learning Engineer
$189K — $389K *
Pinterest
San Francisco, CA 94112 (San Francisco County)
Today
Staff Software Engineer - Machine Learning
$134K — $235K *
General Motors
Remote
Reposted Today
Staff Applied Scientist, Marketplace (Canada-Only)
$232K — $300K *
Thumbtack, Inc.
Remote
Today
Staff ML Engineer, Fine Tuning - Slack
$197K — $313K *
Salesforce
San Francisco, CA 94112 (San Francisco County)
2 days ago
Staff Machine Learning Engineer, Fulfillment Planning
$137K — $299K *
DoorDash
San Francisco, CA 94112 (San Francisco County)
5 days ago

Get Ready For Your
Next Interview

More Jobs at Unity Technologies

Senior Machine Learning Infrastructure Engineer
$183K — $248K *
Bellevue, WA 98006 (King County)
Reposted Today
Information Technology
In-Person
Director, Content Marketing
$198K — $297K *
New York, NY 10025 (New York County)
Reposted Today
Media
In-Person
Senior Machine Learning Infrastructure Engineer
$183K — $248K *
Mountain View, CA 94040 (Santa Clara County)
Reposted Yesterday
Enterprise Technology
In-Person
Director, Content Marketing
$198K — $297K *
Remote
Reposted Yesterday
Consumer Technology
Remote in New York, NY
Senior Machine Learning Infrastructure Engineer
$183K — $248K *
Bellevue, WA 98006 (King County)
Reposted 2 days ago
Enterprise Technology
In-Person

More Information Technology Jobs

Business Development Director
$300K — $345K + $120K bonus *
Tier1 IT Services Firm
Kansas City, MO 64116 (Clay County)
6 days ago
Client Partner / Business Developemnt - Banking
$250K — $320K + $70K bonus *
IT Services Firm (client of TechLink Systems)
New York, NY 10001 (New York County)
6 days ago
Customer Support
Confidential Company
Austin, TX 78701 (Travis County)
2 weeks ago
Sr Assoc, Cyber Sec ThreatMgmt - Detection Engineer
$88K — $151K *
Northern Trust
Naperville, IL 60540 (Dupage County)
Today
Global Director – Vulnerability Management & Security Configuration
$164K — $288K *
Northern Trust
Chicago, IL 60629 (Cook County)
Today

Find similar Staff Machine Learning Engineer, Offline Infrastructure jobs:

Nationwide Mountain View, CA

Staff Machine Learning Engineer, Offline Infrastructure

Job Overview by Ladders

Full Job Description

Get Ready For Your Next Interview

Find similar Staff Machine Learning Engineer, Offline Infrastructure jobs:

Get Ready For Your
Next Interview