Mozilla

Senior Software Engineer, Cloud Development

Mozilla$104K — $139K *
US-AnywhereRemote in Canada
Enterprise Technology
Less than 5 years of experience
Job Overview by Ladders

Qualifications

  • Bachelor's or Master's degree in a relevant field with 4-6 years of hands-on experience in production systems.
  • Strong proficiency in modern Python for developing maintainable code and tools.
  • Advanced experience in database management, particularly with Postgres is a plus.
  • Proven ability to deploy and operate workloads in GCP and GKE.
  • Experience with Kubernetes and Helm for deployment across environments.
  • Familiarity with Terraform for infrastructure provisioning and management.
  • Demonstrated experience in designing scalable APIs with performance considerations.

Responsibilities

  • Design, build, and operate core services and APIs for production workloads.
  • Drive service reliability improvements, enhancing availability and performance standards.
  • Optimize backend services for better throughput and cost efficiency across infrastructure.
  • Manage Kubernetes workloads including deployment pipelines and resource optimization.
  • Enhance service lifecycle management through automation and testing practices.
  • Implement observability measures to ensure operational resilience of services.
  • Collaborate with cross-functional teams to develop scalable platform capabilities.

Benefits

  • Generous performance-based bonus plans for all eligible employees.
  • Rich medical, dental, and vision insurance coverage.
  • Generous retirement contributions with immediate vesting.
  • Quarterly wellness days to ensure company-wide breaks.
  • Specific holidays and a day off for birthdays.
  • Home office stipend to support remote work needs.
  • Annual budget for professional development.
  • Quarterly well-being stipend for health and wellness.
  • Considerable paid parental leave offered to employees.
  • Employee referral bonus program to encourage team growth.
Full Job Description
About the Team & Role
The AI Platform team is responsible for building the foundational infrastructure that powers intelligent experiences across Mozilla products. This includes model training pipelines, high-throughput inference services, GPU orchestration, and secure, privacy-respecting AI systems that operate reliably at global scale.

We're looking for a Senior Software Engineer with a strong platform mindset to help design, build, and operate Mozilla's AI platform. In this role, you'll work at the intersection of machine learning, distributed systems, and production infrastructure-ensuring that models can be trained, deployed, and served efficiently, securely, and at scale. You will collaborate closely with product, infrastructure, and security teams to enable fast iteration while meeting strict performance and privacy requirements.

What You'll Do
  • Design, build, and operate core platform services and APIs used to deploy and serve production workloads at scale.
  • Own service reliability end-to-end, driving improvements in availability, scalability, performance, and operational excellence.
  • Lead efforts to optimize backend services for throughput, latency, and cost efficiency across distributed infrastructure.
  • Design and manage Kubernetes-based workloads, including GitOps deployment pipelines, environment configuration, and resource utilization optimization.
  • Own and improve critical parts of the service lifecycle, including packaging, versioning, testing strategies, validation, and deployment automation.
  • Implement and evolve observability practices (metrics, logging, tracing, alerting) to improve visibility and operational resilience of backend services and pipelines.
  • Partner closely with product, infrastructure, security, and data teams to design scalable platform capabilities that enable new product features.
  • Contribute to technical design discussions, propose architectural improvements, and mentor junior engineers through code reviews and knowledge sharing.
  • Participate in and help improve operational processes, including incident response, on-call rotations, and post-incident reviews.

What You'll Bring
  • Bachelor's degree with 4-6 years of relevant industry experience, or Master's degree with significant hands-on experience building and operating production systems, or work experience equivalent
  • Strong, modern Python skills, with experience writing clean, maintainable code and working with a fast toolchain (dependency management, linting, formatting, type checks, pre-commit), building both libraries and CLIs that output structured data.
  • Advance experience with database deployment and management, bonus points for familiarity with Postgres
  • Proven experience deploying and operating workloads in cloud environments, including production-grade infrastructure on GCP and GKE (artifact registries, managed caches, networking and internal load balancing, VPC, DNS, and separation of nonprod and prod).
  • Hands-on experience with Kubernetes and Helm, writing charts that deploy across environments with per-environment configuration and progressive feature rollout.
  • Experience with Terraform for provisioning infrastructure across environments, including schema validation and PR-level plan review.
  • Experience designing and running scalable APIs that hold up under load, including health and readiness checks, auth, and clean startup and shutdown.
  • Experience with Grafana or similar tools for metrics, dashboards, and reading application and infrastructure health together during rollouts.
  • Strong problem-solving skills and the ability to debug performance and reliability issues in distributed systems.
  • Clear and effective communication skills, with experience collaborating across engineering, product, and infrastructure teams.
  • On-call experience, including participating in incident response and post-incident reviews.

Bonus Skills
  • Experience with Ray or Ray Serve for GPU-backed model serving, including setting resource requests and replica counts aligned with available hardware.
  • Experience building stateless ML services such as embedding or similarity models, including multi-model loading, runtime device selection, batch APIs, and handling model-cache and cold-start tradeoffs.
  • Experience running a multi-provider LLM gateway, including routing between providers, migrating models, and mixing self-hosted with third-party serving.
  • Familiarity with containerization and orchestration systems in production environments beyond core Kubernetes/Helm usage.
  • Exposure to privacy-preserving ML techniques, security best practices, or responsible AI system design.
  • Contributions to open-source infrastructure projects or leadership in building reusable internal tooling.

What you'll get:
  • Generous performance-based bonus plans to all eligible employees - we share in our success as one team
  • Rich medical, dental, and vision coverage
  • Generous retirement contributions with 100% immediate vesting (regardless of whether you contribute)
  • Quarterly all-company wellness days where everyone takes a pause together
  • Country specific holidays plus a day off for your birthday
  • One-time home office stipend
  • Annual professional development budget
  • Quarterly well-being stipend
  • Considerable paid parental leave
  • Employee referral bonus program
  • Other benefits (life/AD&D, disability, EAP, etc. - varies by country)

Hiring Ranges:

Canada Tier 1 Locations

$104,000-$139,000 CAD

Canada Tier 2 Locations

$95,000-$126,000 CAD

About Mozilla

Mozilla is a global community of technologists, thinkers, and builders working together to keep the internet open and accessible to all. The company is best known for its flagship product, the Firefox web browser, which is used by millions of people around the world. In addition to its browser, Mozilla also develops a range of other products and services, including a mobile operating system, a password manager, and a virtual private network (VPN) service.
Learn more about Mozilla
Size
1,000 employees
Industry
Founded
1998

Similar Jobs

More Jobs at Mozilla

More Enterprise Technology Jobs

Find similar Senior Software Engineer, Cloud Development jobs: