THE ROLEAs a Senior Software Engineer in Production Engineering, you will be the architect of speed and reliability for our flagship FlashBlade and FlashArray products. You'll lead the mission to transform CI/CD pipelines into high-performance engines, ensuring our developers can ship world-class code without bottlenecks. By partnering across the engineering organization, you will treat "Quality as Code," building the automated systems and invisible infrastructure that define how Everpure delivers innovation.
WHAT YOU'LL DO- Architect CI/CD Ecosystems: Design and operate scalable, multi-stage CI pipelines and release services (using Jenkins, Groovy, and Kubernetes) that enforce quality gates and branch policies automatically, eliminating manual overhead.
- Build Developer-First Tooling: Create and maintain robust services, APIs, and CLIs in Python or Go that simplify how hundreds of engineers onboard, configure, and debug their workflows.
- Engineered Reliability & Observability: Integrate deep telemetry (metrics, logs, traces) into our delivery systems to proactively detect flakiness and resolve incidents before they impact the developer experience.
- Drive Data-Backed Quality: Develop data pipelines and actionable dashboards to monitor KPIs like pass-rates, escape metrics, and deployment velocity, using these signals to influence engineering-wide release decisions.
- Technical Leadership: Serve as a technical "captain" during high-impact integration cycles, leading cross-functional projects from initial design through global rollout while mentoring peers on modern production engineering practices.
WHAT YOU BRING- System Engineering Mastery: You possess a deep background in distributed systems design and software development (Python, Go, or similar), with a proven ability to build automated infrastructure at scale for large engineering organizations.
- Pipeline Expertise: You have extensive experience managing complex CI/CD topologies, including pipeline-as-code, artifact promotion strategies, and container orchestration (Kubernetes/OpenShift) within Linux/Unix environments.
- Operational Excellence: You bring a "quality-first" mindset to operations, utilizing SQL or analytics platforms to drive decisions and maintaining a calm, decisive approach when leading incident mitigations.
- Strategic Collaboration: You are a skilled communicator who can influence technical strategy across teams without formal authority, translating complex infrastructure trade-offs into clear outcomes for stakeholders.
- Location: We are primarily an in-office environment and therefore, you will be expected to work from the Santa Clara, CA office in compliance with Everpure's policies, unless you are on PTO, or work travel, or other approved leave.
#LI-Onsite
Salary ranges are determined based on role, level and location. For positions open to candidates in multiple geographical locations, the base salary range is reflective of the labor market across the applicable locations.
This role may be eligible for incentive pay and/or equity.
There is no application deadline and we accept applications on an ongoing basis until the job is filled.
The annual base salary range is:
$180,000-$270,000 USD
WHAT YOU CAN EXPECT FROM US:- Innovation: We celebrate those who think critically, like a challenge, and aspire to be trailblazers.
- Growth: We give you the space and support to grow along with us and to contribute to something meaningful. We have been named Fortune's Best Workplaces in Technology™, Fortune's Best Workplaces in the Bay Area™, and certified as a Great Place to Work®!
- Team: We build each other up and set aside ego for the greater good.
And because we understand the value of bringing your full and best self to work, we offer a variety of perks to manage a healthy balance, including flexible time off, wellness resources, and company-sponsored team events. Check out purebenefits.com for more information.