Mozilla

Senior Machine Learning Engineer, AI Platform

Mozilla$128K — $171K *
US-AnywhereRemote in Canada
Enterprise Technology
Less than 5 years of experience
Job Overview by Ladders

Qualifications

  • Bachelor's or Master's degree with 4-6 years of relevant experience in production ML systems.
  • Strong proficiency in Python for developing machine learning systems and backend services.
  • Proven experience in deploying ML workloads using cloud infrastructure.
  • Knowledge of model serving architectures and performance tradeoffs.
  • Hands-on experience with GPU-based workloads in production environments.
  • Familiarity with CI/CD pipeline design for ML deployment.
  • Ability to drive technical initiatives independently and communicate effectively.

Responsibilities

  • Design and build core AI platform components for training and deploying ML models.
  • Own end-to-end model serving and inference workflows, ensuring reliability and scalability.
  • Lead optimization efforts for inference systems regarding throughput, latency, and cost.
  • Manage GPU-based workloads, including performance tuning and resource optimization.
  • Improve components of the model lifecycle such as packaging and deployment automation.
  • Implement observability practices for ML services and pipelines.
  • Collaborate across teams to design scalable AI platform capabilities.

Benefits

  • Generous performance-based bonus plans for eligible employees.
  • Comprehensive medical, dental, and vision coverage.
  • 100% immediate vesting on retirement contributions, regardless of employee contribution.
  • Quarterly wellness days for company-wide breaks.
  • Country-specific holidays plus an extra day off for your birthday.
  • One-time stipend for home office setup.
  • Annual budget for professional development opportunities.
  • Quarterly well-being stipend for holistic employee health.
  • Considerable paid parental leave provision.
  • Employee referral bonus program available.
Full Job Description
About this team and role:

The AI Platform team is responsible for building the foundational infrastructure that powers intelligent experiences across Mozilla products. This includes model training pipelines, high-throughput inference services, GPU orchestration, and secure, privacy-respecting AI systems that operate reliably at global scale.

We're looking for a Machine Learning Engineer with a strong platform mindset to help design, build, and operate Mozilla's AI platform. In this role, you'll work at the intersection of machine learning, distributed systems, and production infrastructure-ensuring that models can be trained, deployed, and served efficiently, securely, and at scale. You will collaborate closely with product, infrastructure, and security teams to enable fast iteration while meeting strict performance and privacy requirements.

What You'll Do:
  • Design, build, and operate core AI platform components used to train, deploy, and serve machine learning models in production environments.
  • Own model serving and inference workflows end-to-end, driving improvements in reliability, scalability, performance, and operational excellence.
  • Lead efforts to optimize inference systems for throughput, latency, and cost efficiency across CPU and GPU workloads.
  • Design and manage GPU-based inference and training workloads, including performance tuning, capacity planning, and resource utilization optimization.
  • Own and improve critical parts of the model lifecycle, including packaging, versioning, testing strategies, validation, and deployment automation.
  • Implement and evolve observability practices (metrics, logging, tracing, alerting) to improve visibility and operational resilience of ML services and pipelines.
  • Partner closely with product, infrastructure, security, and data teams to design scalable platform capabilities that enable AI-powered features.
  • Contribute to technical design discussions, propose architectural improvements, and mentor junior engineers through code reviews and knowledge sharing.
  • Participate in and help improve operational processes, including incident response, on-call rotations, and post-incident reviews.

What You'll Bring:
  • Bachelor's degree with 4-6 years of relevant industry experience, or Master's degree with significant hands-on experience building and operating production ML systems, or work experience equivalent
  • Strong experience developing in Python for machine learning systems, backend services, or distributed data processing.
  • Proven experience deploying and operating ML workloads in cloud environments, including production-grade infrastructure.
  • Solid understanding of model serving architectures, inference pipelines, and performance tradeoffs (latency, throughput, cost, scaling strategies).
  • Hands-on experience working with GPU-based workloads and accelerated computing in production settings.
  • Experience designing CI/CD pipelines and development workflows that support reliable ML system deployment.
  • Ability to independently scope and drive technical initiatives while balancing product and operational priorities.
  • Strong problem-solving skills and the ability to debug performance and reliability issues in distributed systems.
  • Clear and effective communication skills, with experience collaborating across engineering, product, and infrastructure teams.

Bonus Skills:
  • Experience implementing inference optimization strategies such as batching, quantization, compilation, model conversion, or hardware-specific tuning.
  • Familiarity with containerization and orchestration systems (e.g., Docker, Kubernetes) in production environments.
  • Experience designing observability systems for distributed services, including metrics strategy and performance profiling.
  • Exposure to privacy-preserving ML techniques, security best practices, or responsible AI system design.
  • Contributions to open-source ML infrastructure projects or leadership in building reusable internal ML tooling.

What you'll get:
  • Generous performance-based bonus plans to all eligible employees - we share in our success as one team
  • Rich medical, dental, and vision coverage
  • Generous retirement contributions with 100% immediate vesting (regardless of whether you contribute)
  • Quarterly all-company wellness days where everyone takes a pause together
  • Country specific holidays plus a day off for your birthday
  • One-time home office stipend
  • Annual professional development budget
  • Quarterly well-being stipend
  • Considerable paid parental leave
  • Employee referral bonus program
  • Other benefits (life/AD&D, disability, EAP, etc. - varies by country)

Hiring Ranges:

Canada Tier 1 Locations

$128,000-$171,000 CAD

Canada Tier 2 Locations

$116,000-$155,000 CAD

About Mozilla

Mozilla is a global community of technologists, thinkers, and builders working together to keep the internet open and accessible to all. The company is best known for its flagship product, the Firefox web browser, which is used by millions of people around the world. In addition to its browser, Mozilla also develops a range of other products and services, including a mobile operating system, a password manager, and a virtual private network (VPN) service.
Learn more about Mozilla
Size
1,000 employees
Industry
Founded
1998

Similar Jobs

More Jobs at Mozilla

More Enterprise Technology Jobs

Find similar Senior Machine Learning Engineer, AI Platform jobs: