ML Engineer, Generative Video

Mirage

$120K — $180K *
Consumer Technology
Less than 5 years of experience
Job Overview by Ladders

Qualifications

  • BS/MS/PhD in CS, ML, or related field
  • 2+ years of professional industry experience
  • Strong experience in deep learning systems and infrastructure
  • Expertise in PyTorch, CUDA, Triton, and distributed training (FSDP, etc.)
  • Experience scaling and optimizing large models under low-latency inference constraints
  • Strong debugging and performance profiling skills
  • Ability to move quickly from prototype to production

Responsibilities

  • Train and optimize large-scale video and multimodal models
  • Improve efficiency across training and inference (memory, latency, cost)
  • Implement techniques like distillation, quantization, and pruning for faster generation
  • Build and maintain distributed training systems
  • Optimize GPU utilization, parallelism, and throughput
  • Develop tooling for experimentation, evaluation, and debugging
  • Translate research models into robust, production-ready systems
  • Monitor and improve model performance in real-world usage

Benefits

  • Comprehensive medical, dental, and vision plans
  • 401K with employer match
  • Commuter Benefits
  • Catered lunch multiple days per week
  • Dinner stipend every night for working late
  • Grubhub subscription
  • Health & Wellness Perks
  • Multiple team offsites per year and monthly team events
  • Generous PTO policy
Full Job Description
Please note that all of our roles will require you to be in-person at our NYC HQ (located in Union Square)

About the RoleMirage is seeking an ML Engineer to build and scale the systems powering our video generation models. You'll work on novel modeling approaches, training objectives, scaling strategies, and inference optimization and efficiency to bring cutting-edge models into production.

This role sits at the intersection of research and systems engineering, focusing on making advanced models faster, more efficient, and capable of ultra-low latency, real-time generation.

Responsibilities
  • Train and optimize large-scale video and multimodal models
  • Improve efficiency across training and inference (memory, latency, cost)
  • Implement techniques such as distillation, quantization, and pruning to aggressively accelerate diffusion and autoregressive generation
  • Build and maintain distributed training systems
  • Optimize GPU utilization, parallelism, and throughput
  • Develop tooling for experimentation, evaluation, and debugging
  • Translate research models into robust, production-ready systems
  • Monitor and improve model performance in real-world usage

What makes you a great fit
  • BS/MS/PhD in CS, ML, or related field
  • 2+ years of professional industry experience
  • Strong experience in deep learning systems and infrastructure
  • Expertise in PyTorch, CUDA, Triton, and distributed training (FSDP, etc.)
  • Experience scaling and optimizing large models under low-latency inference constraints
  • Strong debugging and performance profiling skills
  • Ability to move quickly from prototype to production
Benefits:
  • Comprehensive medical, dental, and vision plans
  • 401K with employer match
  • Commuter Benefits
  • Catered lunch multiple days per week
  • Dinner stipend every night if you're working late and want a bite!
  • Grubhub subscription
  • Health & Wellness Perks
  • Multiple team offsites per year with team events every month
  • Generous PTO policy

Please note benefits apply to full time employees only.

Similar Jobs

More Jobs at Mirage

More Consumer Technology Jobs

Find similar ML Engineer, Generative Video jobs: