Senior Staff Production Engineer

Zscaler • $140K — $200K *

San Jose, CA 95123In-Person

Enterprise Technology

8 - 10 years of experience

2 weeks ago

By clicking Apply, I agree with Ladders' Terms of Use and Privacy Policy

Job Overview by Ladders

Qualifications

8+ years managing reliability and scalability of large-scale production services
Deep programming expertise in Python, Go, or C/C++
Strong background in networking protocols and Linux/FreeBSD systems
Experience with high-stakes incident management and 24/7 on-call rotation
Proficient in ITIL frameworks and systematic problem management

Responsibilities

Design and implement scalable infrastructure across AWS, Azure, GCP, and bare-metal environments
Drive automation culture by coding to eliminate manual tasks
Implement observability tools and define SLIs/SLOs
Act as lead Incident Commander, develop incident response playbooks
Collaborate with engineering teams for operability reviews

Benefits

Comprehensive health plans
Vacation and sick time
Parental leave options
Retirement options
Education reimbursement
In-office perks

Full Job Description

Role

We are looking for a Sr. Staff Production Engineer to join our team. This role is available as a hybrid opportunity 3 days a week in San Jose, CA or as a remote position, reporting to Production Engineering in the Cloud Infrastructure & Operations department. Join Zscaler to be a force multiplier for the reliability of a global platform protecting over 15 million users.

In this role, you will provide the technical vision and hands-on execution to drive an "automation-first" culture across the company. By maturing our observability and architectural standards, you will directly reduce our Mean Time to Mitigate (MTTM) and shape the scalability of our globally distributed, multi-cloud infrastructure.

What you'll do (Role Expectations)

Design and implement highly available, scalable infrastructure across AWS, Azure, GCP, and bare-metal environments
Drive an "automation-first" culture by writing code (Python/Go) to eliminate manual toil and build self-healing systems
Implement and maintain sophisticated observability (Prometheus, Grafana, OpenTelemetry), define SLIs/SLOs, and establish error budgets
Act as a lead Incident Commander (TDO on-call), develop response playbooks, and conduct deep-dive post-incident analyses
Partner with Engineering and partner teams to conduct operability reviews

Who You Are (Success Profile)

You act like an owner with a bias for action and integrity.
You are a pragmatic builder obsessed with creating, iterating, and shipping.
You champion simplicity by distilling complex problems into clear, actionable plans.
You are data-driven, valuing evidence over assumptions.
You think at scale, building solutions and processes built to last a high-growth global organization.

What We're Looking for (Minimum Qualifications)

8+ years of experience managing reliability, scalability, and availability for large-scale production services
Deep expertise in programming (e.g., Python, Go, or C/C++)
Strong background in networking protocols, Linux/FreeBSD systems, and distributed architecture
Experience in high-stakes incident management and participation in a 24/7 on-call rotation
Proficiency in leveraging ITIL frameworks and incident data to drive service maturity through systematic problem management and technical operability reviews

What Will Make You Stand Out (Preferred Qualifications)

Extensive experience with public cloud (AWS, Azure, GCP) and Infrastructure-as-Code (Ansible, Terraform)
Experience with chaos engineering and disaster recovery planning at scale
Expertise in global routing (BGP) and traffic tunneling (GRE, IPSec) with a deep understanding of L7 proxy architectures (HAProxy), DNS at scale, and OS networking stack internals

#LI-Hybrid

#LI-CM3

Zscaler's salary ranges are benchmarked and are determined by role and level. The range displayed on each job posting reflects the minimum and maximum target for new hire salaries for the position across all US locations and could be higher or lower based on a multitude of factors, including job-related skills, experience, and relevant education or training.

The base salary range listed for this full-time position excludes commission/ bonus/ equity (if applicable) + benefits.

Base Pay Range

$140,000-$200,000 USD

Our Benefits program is one of the most important ways we support our employees. Zscaler proudly offers comprehensive and inclusive benefits to meet the diverse needs of our employees and their families throughout their life stages, including:

Various health plans
Time off plans for vacation and sick time
Parental leave options
Retirement options
Education reimbursement
In-office perks, and more!

Learn more about Zscaler's Future of Work strategy, hybrid working model, and benefits here.

About Zscaler

Zscaler is a cloud-based information security company that provides Internet security, web security, firewalls, sandboxing, SSL inspection, antivirus, vulnerability management and granular control of user activity in cloud computing, mobile and Internet of things environments. The company is headquartered in San Jose, California, and has offices in Australia, India, Japan, Singapore, the United Kingdom, and the United States.

Learn more about Zscaler

Size

3,153 employees

Market Cap

$15.5 billion

Industry

Information Technology

Net Income

-$191.4 million

Founded

2008

5 Year Trend

+54.1%

Revenue

$536 million

NASDAQ

* Ladders Estimates

Similar Jobs

Staff Operations Engineer
$128K — $171K *
Mozilla
Remote
3 days ago
Staff Production Engineer (Cloud Platform & Reliability - Machine Identity Security) - hybrid
$130K — $180K *
Palo Alto Networks
Santa Clara, CA 95051 (Santa Clara County)
4 days ago
Sr Staff DevOps Engineer
$197K — $278K *
42dot, Inc
Sunnyvale, CA 94087 (Santa Clara County)
2 weeks ago
Staff Software Engineer, Backend (Continuous Integration)
$200K — $275K *
Affirm
Remote
2 weeks ago
Staff Site Reliability Engineer (SRE) | Dev Ops Engineer
$169K — $224K *
Grail
Menlo Park, CA 94025 (San Mateo County)
1 month ago

Get Ready For Your
Next Interview

More Jobs at Zscaler

Principal Product Specialist (Eastern Time)
$164K — $235K *
Remote
Reposted Yesterday
Information Technology
Remote in United States
Treasury Manager
$145K — $182K *
San Jose, CA 95123 (Santa Clara County)
2 days ago
Finance & Insurance
In-Person
Principal Product Specialist (Eastern Time)
$164K — $235K *
Remote
Reposted 3 days ago
Enterprise Technology
Remote in United States
Specialist Account Executive, ZT Branch - Majors
$133K — $190K *
Remote
3 days ago
Enterprise Technology
Remote in United States
Principal Product Manager-Agentic SecOps
$171K — $245K *
San Jose, CA 95123 (Santa Clara County)
3 days ago
Enterprise Technology
In-Person

More Enterprise Technology Jobs

Senior Audit Manager - Cybersecurity - AI and Cloud
$94K — $176K *
Bank of Montreal
Toronto, ON M3C 0E3
Today
EverPro - Engineering Manager (Remote)
$150K — $165K *
EverCommerce
Remote
Today
EverCommerce - Staff Software Engineer
$150K — $170K *
EverCommerce
Remote
Today
Director, Global Lead, PI and Digital Twin
$168K — $281K *
Aveva
Lake Forest, CA 92630 (Orange County)
Today
Managing Director, Digital Transformation
$150K — $220K *
Baker Tilly
New York, NY 10025 (New York County)
Today

Find similar Senior Staff Production Engineer jobs:

Nationwide San Jose, CA

Senior Staff Production Engineer

Job Overview by Ladders

Full Job Description

Get Ready For Your Next Interview

Find similar Senior Staff Production Engineer jobs:

Get Ready For Your
Next Interview