Senior Staff Engineer, Software

Vistance Networks, Inc.

$118K — $170K *
Enterprise Technology
5 - 7 years of experience
Job Overview by Ladders

Qualifications

  • 5+ years in Site Reliability Engineering, DevOps, Cloud Infrastructure, or Production Engineering
  • Strong programming skills in Python
  • Experience with Linux systems administration and troubleshooting
  • Hands-on experience with Google Cloud Platform (GCP)
  • Experience with Kubernetes, containers, and cloud-native infrastructure
  • Knowledge of observability tools like Prometheus and Grafana
  • Familiarity with ClickHouse or large-scale telemetry platforms

Responsibilities

  • Operate and enhance high-availability cloud services and infrastructure
  • Troubleshoot production issues across multiple systems
  • Improve observability via metrics and monitoring tools
  • Define and refine SLIs, SLOs, and operational health metrics
  • Participate in incident response for critical events
  • Contribute to post-incident reviews for long-term improvements
  • Develop operational tooling and automation using Python

Benefits

  • Comprehensive medical, dental, and vision plans
  • Life and accidental death insurance
  • 401(k) plan participation
  • Incentive Plan eligibility
  • Eleven paid holidays annually
  • Two weeks of paid vacation
  • Additional leave options
Full Job Description
At Ruckus Networks, you will work on large-scale cloud networking platforms that support enterprise customers globally. You will help improve reliability, automation, observability, and customer experience while working with modern cloud and SRE technologies in a collaborative engineering environment.

How You'll help us connect the world:

Ruckus Networks is looking for a customer focused Senior Site Reliability Engineer (SRE) to help improve reliability, scalability, operational excellence, and customer experience across our cloud platform ecosystem.

This role is ideal for engineers who enjoy solving production problems, building automation, and improving platform reliability at scale. You will work on distributed systems powering cloud networking services used by customers globally in fast paced environment.

As part of the SRE organization, you will work closely with engineering, cloud operations, and support teams to improve platform stability, observability, automation, and operational readiness.

THIS IS A HYBRID ROLE AND NEEDS TO BE ON-SITE AT OUR SUNNYVALE, CA OFFICE 3 DAYS A WEEK. NO RELOCATION OR 3RD PARTY AGENCIES PLEASE

Key Responsibilities:

Reliability Engineering & Operations

* Operate and improve highly available, scalable cloud services and infrastructure

* Troubleshoot production issues across applications, infrastructure, networking, databases, and cloud services

* Improve observability through metrics, logging, tracing, synthetic monitoring, and alerting

* Help define and improve SLIs, SLOs, and operational health metrics

* Participate in incident response and support Sev-1/customer-impacting events

* Contribute to post-incident reviews and long-term reliability improvements

* Improve operational processes, automation, and deployment safety

Automation & Engineering

* Build operational tooling and automation using Python

* Improve operational efficiency through automation and self-service tooling

* Support CI/CD improvements and deployment validation workflows

* Develop health checks, monitoring integrations, and operational diagnostics

Cloud & Infrastructure

* Support services running in Google Cloud Platform (GCP)

* Work with Kubernetes, containers, and cloud-native platforms

* Analyze scalability, performance, and resource utilization

* Collaborate with software engineering teams on operational readiness and reliability improvements

Observability & Monitoring

* Build dashboards, alerts, and telemetry pipelines

* Work with observability platforms such as Prometheus, Grafana, OpenTelemetry, and ELK

* Support monitoring and analytics platforms including ClickHouse

* Improve signal quality and reduce operational alert noise

* Develop synthetic monitoring focused on customer workflows

Collaboration

* Partner with Engineering, Product Management, Customer Support, and Cloud Operations teams

* Participate in architecture and operational readiness discussions

* Mentor junior engineers and contribute to SRE best practices

* Promote operational excellence, ownership, and customer focus

Required Qualifications:

* 5+ years of experience in Site Reliability Engineering, DevOps, Cloud Infrastructure, or Production Engineering

* Strong programming skills in Python

* Experience with Linux systems administration and troubleshooting

* Hands-on experience with Google Cloud Platform (GCP)

* Experience with Kubernetes, containers, and cloud-native infrastructure

* Experience troubleshooting distributed systems in production environments

* Experience with observability tools such as Prometheus, Grafana, Open Telemetry, or ELK

* Familiarity with ClickHouse or large-scale telemetry platforms

* Understanding of networking fundamentals, APIs, databases, and cloud architectures

* Experience participating in production incident response and operational support

You Excite us if you have:

* Experience supporting SaaS or cloud platforms at scale

* Familiarity with Kafka or event-driven architectures

* Experience building automation and monitoring solutions

* Familiarity with wireless networking or enterprise networking platforms

* Experience improving operational processes and reliability practices

#LI-RB1

#LI-HYBRID

Our salary ranges consider a wide variety of factors, including but not limited to benchmarking by independent third-party consultants, skill sets, years of experience, training, education, geography, and other business needs. Depending on experience, the range can be higher for candidates with exceptional experience and a demonstrated history of successful performance. This position's expected total compensation (base salary and commission range) is $118,000.00-$170,000.00

The candidate will be rewarded with a comprehensive benefits package, including medical, dental, and vision plans, life and accidental death insurance, a 401(k) plan, and participation in the Company's Incentive Plan. Candidates starting with the Company will be eligible for eleven paid holidays in a full calendar year, two weeks of paid vacation (prorated based on start date), as well as other leave options.

Similar Jobs

More Jobs at Vistance Networks, Inc.

More Enterprise Technology Jobs

Find similar Senior Staff Engineer, Software jobs: