Machine Learning Engineer 5 - Decisioning & Optimization

Netflix • $466K — $500K+*

New York, NY 10025In-Person

Consumer Technology

5 - 7 years of experience

Today

Be an Early Applicant

By clicking Apply, I agree with Ladders' Terms of Use and Privacy Policy

Job Overview by Ladders

Qualifications

7+ years of software engineering experience
3+ years focused on ML infrastructure or model serving in an ads context
Experience with real-time model serving systems with sub-20ms latency
Proficiency in Java, Python, or Scala and understanding of performance optimization
Hands-on knowledge of ML serving frameworks and deployment constraints
Strong model monitoring skills including drift detection and latency profiling
Ability to bridge ML research and production engineering effectively

Responsibilities

Build and operate end-to-end ML model serving infrastructure for ad decisioning
Scale inference paths to support concurrent models at high QPS
Design and optimize feature serving paths with strict latency requirements
Productionize scoring and ranking models for ad selection
Monitor model performance in production to ensure reliability
Collaborate closely with Data Science & Platform teams
Create simulation infrastructure for validating model changes

Benefits

Comprehensive Health Plans
Mental Health support
401(k) Retirement Plan with employer match
Stock Option Program
Flexible paid leave policies
Family-forming benefits
Health Savings and Flexible Spending Accounts

Full Job Description

We launched a new ad-supported tier in November 2022 and are building an in-house world-class ad tech ecosystem to offer our members more choices in consuming their content. Our new tier allows us to attract new members at a lower price point while also creating a compelling path for advertisers to reach deeply engaged audiences.

Our Team

The Decisioning & Optimization engineering team owns the systems that determine which ad wins every impression, at what price, and how campaign budgets deliver across all inventory surfaces. Our work spans three platform areas:

ML infrastructure for model serving: real-time inference at 1M+ QPS, multi-model parallel evaluation, feature hydration, model lifecycle from canary deployment through production monitoring
Auction, ranking, and scoring: multi-stage candidate selection, scoring, bid valuation, dynamic pricing, and podding
Budget, pacing, and bidding: control systems for delivery optimization, budget planning, and bid computation

We are scaling from a handful of production models to 10+ while maintaining sub-20ms P99 inference budgets. We are looking for an ML engineer who can build and operate the serving infrastructure these models run on, and who understands the ads decisioning context well enough to make the right engineering tradeoffs.

What You'll Do

Build and operate end-to-end ML model serving infrastructure for real-time ad decisioning: model publishing, packaging, validation, deployment into the serving stack with zero-downtime hot-swap
Scale the inference path to support dozens of concurrent models on every ad request at 1M+ QPS with strict latency budgets, including batching strategies, CPU/GPU allocation, model versioning, and fallback tiers
Design and optimize the feature serving path: feature hydration from Chronon, Signal Service, and real-time streams with sub-10ms P99 fetch latency and online/offline consistency
Productionize scoring and ranking models for multi-stage ad selection (retrieval, early ranking, full scoring) and integrate model outputs into auction
Build model performance monitoring in production: inference latency, prediction distribution shifts, feature drift detection, score calibration, and regression detection before revenue impact
Partner closely with Data Science & Platform teams
Build simulation infrastructure to replay production traffic against candidate models offline, enabling validation of marketplace changes before live rollout
Drive operational excellence for ML systems: reliability, observability, capacity planning, incident response, and scaling for live events with 35M+ concurrent viewers

Skills & Experience We're Seeking

7+ years of software engineering experience; 3+ years focused on ML infrastructure, model serving, or ML platform work in an ads or real-time decisioning context
Built and operated real-time model serving systems at high QPS with sub-20ms latency: online inference, feature stores, model registries, model hot-swap, canary and shadow rollout
Proficiency in Java, Python, or Scala with a solid understanding of multi-threading, memory management, and performance optimization for latency-critical paths
Hands-on with ML serving frameworks: serialization, runtime optimization, and deployment constraints
Experience with feature engineering pipelines for real-time systems: online/offline consistency, hydration strategies, caching, and freshness tradeoffs
Strong understanding of model monitoring in production: drift detection, prediction distribution analysis, calibration, and latency profiling
Comfortable working at the boundary between ML research and production engineering: can take a model artifact and turn it into a production-ready service that meets SLA
Demonstrated ability to operate in an environment that requires both big-tech scale and startup speed

Nice to Haves

Ads domain experience: ranking models, bid scoring, reserve pricing, yield optimization, dynamic allocation across guaranteed and non-guaranteed inventory
Experience with auction mechanics: multi-stage ranking, bid shading, bid prediction, marketplace competition dynamics
Built or improved budget pacing and delivery control systems
Built simulation or counterfactual testing platforms for marketplace or auction systems
Experience with A/B testing infrastructure for model rollouts: online experiments, holdout groups, interference-aware evaluation in marketplace settings
Familiar with CTV constraints: server-side ad insertion, live event ad serving at scale, burst traffic patterns
JVM ecosystem

Generally, our compensation structure consists solely of an annual salary; we do not have bonuses. You choose each year how much of your compensation you want in salary versus stock options. To determine your personal top of market compensation, we rely on market indicators and consider your specific job family, background, skills, and experience to determine your compensation in the market range. The range for this role is $466,000.00 - $750,000.00.

Netflix provides comprehensive benefits including Health Plans, Mental Health support, a 401(k) Retirement Plan with employer match, Stock Option Program, Disability Programs, Health Savings and Flexible Spending Accounts, Family-forming benefits, and Life and Serious Injury Benefits. We also offer paid leave of absence programs. Full-time hourly employees accrue 35 days annually for paid time off to be used for vacation, holidays, and sick paid time off. Full-time salaried employees are immediately entitled to flexible time off. See more details about our Benefits here.

About Netflix

Netflix, Inc. is an American media company founded on August 29, 1997 by Reed Hastings and Marc Randolph in Scotts Valley, California, and currently based in Los Gatos, California, with production offices and stages at the Los Angeles-based Hollywood studios (formerly old Warner Brothers studios) and the Albuquerque Studios (formerly ABQ studios). It operates an eponymous over-the-top subscription video on-demand service, which showcases acquired and original programming as well as third-party content licensed from other production companies and distributors. Netflix is also the first streaming media company to be a member of the Motion Picture Association.

Learn more about Netflix

Size

11,300 employees

Market Cap

$127.6 billion

Industry

Hospitality & Recreation

Net Income

$2.7 billion

Founded

1997

5 Year Trend

+27.5%

Revenue

$24.9 billion

NASDAQ

NFLX

* Ladders Estimates

Similar Jobs

Staff Backend Engineer
$75K — $500K *
MLabs
New York, NY 10025 (New York County)
Reposted 3 days ago
Senior Staff React Native Engineer ($500k - 1.5m salary)
$500K+*
Baton Corporation Ltd
New York, NY 10025 (New York County)
Reposted 1 week ago
AI Engineer 6 - AI Foundation & Tooling, Ads Platform
$500K+*
Netflix
Remote
2 weeks ago
Staff Backend Engineer
$75K — $500K *
MLabs
New York, NY 10025 (New York County)
2 weeks ago
Senior Staff Software Engineer
$485K — $500K+*
Anthropic
New York City, NY 10025 (New York County)
3 weeks ago
Software Engineer 5 - Member API
$388K — $500K+*
Netflix
Remote
4 weeks ago

Get Ready For Your
Next Interview

More Jobs at Netflix

Technical Program Manager 6 - Games Data Science & Engineering
$420K — $500K+*
Remote
Today
Consumer Technology
Remote in United States
Lead Compositor
$100K — $150K *
Vancouver, BC V5K 5J9
Today
Media
In-Person
Crowd Artist
$105K — $136K *
Vancouver, BC V5K 5J9
Today
Media
In-Person
Distributed Systems Engineer 6 - Ad Eventing
$499K — $500K+*
New York, NY 10025 (New York County)
Today
Information Technology
In-Person
Project Manager, Stage Engineering
$125K — $150K *
Los Angeles, CA 90011 (Los Angeles County)
Today
Media
In-Person

More Consumer Technology Jobs

CPU Microarchitect/RTL Engineer - Execution, Load/Store
$130K — $180K *
Apple
Beaverton, OR 97007 (Washington County)
Reposted Today
Lead Hardware System Architect
$200K — $250K *
Bolt Graphics
Sunnyvale, CA 94087 (Santa Clara County)
Today
Forward Deployed Linguist
$90K — $130K *
Rime Labs
San Francisco, CA 94112 (San Francisco County)
Today
Engineering Program Manager, Demo Production Engineering
$130K — $180K *
Apple
Sunnyvale, CA 94087 (Santa Clara County)
Today
Machine Learning Engineer - News, Books, and Stocks Team
$120K — $150K *
Apple
Austin, TX 78745 (Travis County)
Today

Find similar Machine Learning Engineer 5 - Decisioning & Optimization jobs:

Nationwide New York, NY

Machine Learning Engineer 5 - Decisioning & Optimization

Job Overview by Ladders

Full Job Description

Get Ready For Your Next Interview

Find similar Machine Learning Engineer 5 - Decisioning & Optimization jobs:

Get Ready For Your
Next Interview