Sr. Staff Engineer, DevOps Platform

Ayar Labs

$150K — $180K *
Enterprise Technology
5 - 7 years of experience
Job Overview by Ladders

Qualifications

  • Bachelor's degree in Computer Science, Information Systems, or equivalent experience.
  • 5+ years of experience in platform/cloud infrastructure and/or data engineering.
  • Hands-on experience with core AWS services like IAM, VPC, S3, EC2, and managed databases.
  • Proficiency in Terraform/OpenTofu and knowledge of CloudFormation.
  • Experience with Kubernetes (EKS preferred) and CI/CD pipeline design.
  • Familiarity with monitoring tools such as Datadog, Prometheus, and Grafana.
  • Strong Python and SQL skills, with experience in cloud data platforms.

Responsibilities

  • Design, build, and operate cloud infrastructure using Infrastructure as Code.
  • Develop and manage Kubernetes clusters along with the necessary security and networking layers.
  • BuildCI/CD pipelines and deployment automation for safe and frequent software releases.
  • Implement reliability and security measures including IAM, SLAs, and observability best practices.
  • Optimize costs and create self-service tools for engineering teams.
  • Enhance observability through metrics, logging, and incident response strategies.
  • Collaborate on Python-based data platforms and mentor engineers on best practices.

Benefits

  • Meaningful equity including new-hire stock options and an evergreen program.
  • 401(k) with immediate-vesting employer match.
  • Premium medical, dental, and vision coverage fully paid for employees.
  • Twelve weeks of paid parental leave.
  • Daily lunch provided.
Full Job Description
Sr. Staff Engineer, DevOps Platform

Location: San Jose, CA (on-site)

Reporting to the Head of the Data Pillar, the Sr. Staff DevOps Platform Engineer owns the internal platforms that power our data and engineering teams. In this role, you'll improve developer productivity, reliability, scalability, and operational excellence across our infrastructure, deployment systems, observability stack, and cloud-native services.

You will build the foundational systems - cloud infrastructure, CI/CD, observability, and our data and compute platform - that let engineers across silicon, photonics, firmware, and software move quickly and reliably. You should be comfortable owning systems end to end, from architecture through on-call.

What You'll Achieve
  • Cloud infrastructure: Design, build, and operate cloud infrastructure (mostly AWS) with everything defined as code using IaC tooling.
  • Build & Operate the Container Platform: Design and run Kubernetes clusters and the surrounding networking, security, and storage layers that workloads depend on.
  • Automate the Path to Production: Build and own CI/CD pipelines and deployment automation that let engineers ship safely and frequently, with GitOps-style workflows and repeatable, auditable releases.
  • Own Reliability & Security: Implement least-privilege IAM and guardrails, define and uphold SLAs, instrument end-to-end observability, and lead on-call and incident response - for both the platform and the data flowing through it.
  • Drive Efficiency & Self-Serve: Optimize cloud spend and deliver self-service paved-road tooling and environments that let our teams provision and ship safely with low overhead.
  • Observability & reliability: Improve observability through metrics, logs, tracing, dashboards, alerts, and incident response practices.
  • Data & compute platforms: Contribute and collaborate on the development of our Python-based data and compute platforms. Data asset pipelines, HPC, ML. and large-scale simulation.
  • Leading & mentoring: Establish and champion best practices for reliability, security, cost efficiency, and infrastructure governance. Mentor engineers and raise the engineering bar through reviews, documentation, and shared standards


Required Skills
  • Bachelor's Degree in Computer Science, Information Systems, or a related field, or equivalent practical experience.
  • Minimum of 5+ years across platform/cloud infrastructure and/or data engineering, with a track record of building and operating production cloud platforms.
  • Deep hands-on experience across core AWS services - IAM, VPC and networking, S3, EC2/ECS/EKS, Lambda, CloudWatch, and managed databases (RDS).
  • Strong proficiency with Terraform/OpenTofu - reusable modules, state management at scale, and sound code structure (CloudFormation a plus).
  • Production Kubernetes experience (EKS preferred - Helm, networking, RBAC) and experience designing CI/CD pipelines (Gitlab); GitOps tooling such as ArgoCD or Flux a plus.
  • Hands-on with monitoring/observability tooling such as Datadog, CloudWatch, Prometheus, Grafana, and PagerDuty, plus experience defining SLAs and managing cost.
  • Python and SQL skills, as well as experience with cloud data platforms and databases
  • A passion for building scalable, well-documented platform and data solutions that have a measurable impact; interest in applying AI tools and modern approaches to infrastructure and data problems.
  • Excellent written and oral communication skills - you can write a clear status update, design proposal, or post-mortem that people actually read - and work effectively in a team environment.
  • Strong decision-making, technical versatility, and problem-resolution skills. HashiCorp (Terraform Associate) or AWS certifications are a plus.


NOTE TO RECRUITERS: Principals only. We are not accepting resumes from recruiters for this position. Remuneration for recruiting activities is only applicable subject to a signed and executed agreement between the parties. Please don't send candidates to Ayar Labs, and do not contact our managers.

Compensation & Benefits
The base salary range for this role is $150,000 - $180,000, depending on experience, skills, and qualifications. In addition to base pay, Ayar Labs offers meaningful equity (new-hire stock options plus an evergreen program), a 401(k) with immediate-vesting employer match, premium medical/dental/vision coverage (100% paid for employees, 75% for dependents), twelve weeks of paid parental leave, daily lunch, and more.

Similar Jobs

More Jobs at Ayar Labs

More Enterprise Technology Jobs

Find similar Sr. Staff Engineer, DevOps Platform jobs: