RingCentral

Site Reliability Engineer

RingCentral$94K — $135K *
Information Technology
5 - 7 years of experience
Job Overview by Ladders

Qualifications

  • 6+ years of experience as a Site Reliability Engineer or similar role.
  • Strong problem-solving and troubleshooting abilities.
  • In-depth knowledge of Linux servers.
  • Familiarity with at least one programming language.
  • Experience with cloud platforms such as AWS, Azure, or GCP.
  • Knowledge of configuration management tools and practices.
  • Ability to work collaboratively in a diverse, global team environment.

Responsibilities

  • Collaborate with development and operations to integrate monitoring solutions.
  • Define and drive improvements for monitoring and self-healing services.
  • Design redundancy and load-balancing strategies for reliability.
  • Conduct risk assessments to identify and propose solutions for infrastructure failures.
  • Respond to incidents and outages promptly during on-call duties.
  • Monitor capacity requirements in a scaling environment.
  • Extend observability by working with various codebases.

Benefits

  • Comprehensive medical, dental, vision, disability, and life insurance.
  • Health Savings Account (HSA) and Flexible Spending Accounts (FSAs).
  • 401K match and Employee Stock Purchase Plan (ESPP).
  • Generous paid time off, sick leave, and parental leave.
  • Family-forming benefits, including IVF and adoption assistance.
  • Emergency backup care for dependents and pets.
  • Employee Assistance Program (EAP) offering counseling support.
  • Free legal services for advice and estate planning.
  • Bonus referral program for employees.
  • Student loan refinancing assistance.
  • Employee perks and discounts program.
Full Job Description
This is where you and your skills come in. We're currently looking for: An experienced Site Reliability Engineer (SRE) to join the RingCentral Collaboration team. As a SRE, you will be responsible for maintaining and improving uptime and availability across several of our services. You will play a crucial role in ensuring the reliability, performance, and availability of our services by identifying potential issues, and proactively resolving them. The ideal candidate should have a background in various service observability platforms as well as experience with containerization using Kubernetes, message queuing systems like Kafka, and SQL/NoSQL databases. Programming experience is desired for the role.

Job Duties:
  • Collaborate with development and operations teams to integrate monitoring solutions into the software development lifecycle and operational processes.
  • Define, propose, and drive efforts to continually improve monitoring, troubleshooting, and self-healing for our services.
  • Design and implement redundancy, failover mechanisms, and load-balancing strategies to ensure system reliability.
  • Conduct risk assessments and identify potential points of failure in the infrastructure and propose solutions to fix it.
  • Respond to (on-call) and take actions to mitigate incidents and outages.
  • Be on top of capacity requirements in a growing environment.
  • Actively work with various teams' codebases to extend observability and improve uptime.
  • Represent the team in global incidents resolution, and participate in on-call rotation


To succeed in this role you must have experience in:
  • Proven experience as an SRE or similar role of 6+ years.
  • Problem-solving and troubleshooting skills.
  • Linux in-depth knowledge.
  • Knowledge of one of the programming languages (see Preferable technology stack).
  • Experience with cloud platforms.
  • Knowledge of one or more of the configuration management tools.
  • Ability to work in a diverse multicultural environment, communicating with globally distributed teams.
  • Team player with self-start ability and strong drive to dig deeply and solve problems.
  • Fluent in spoken and written English.


Preferable Technology Stack:
  • OS: Linux (CentOS/RedHat/Oracle/Amazon Linux)
  • Programming languages: Python, JavaScript, Java
  • Scripting languages: Bash, Go
  • Cloud: AWS, Azure, GCP
  • Containerization: Kubernetes
  • Distributed Log: Kafka, ELK stack
  • Monitoring: Zabbix, Prometheus, Alertmanager, Grafana
  • DBs: VictoriaMetrics, MongoDB, PostgreSQL, MySQL
  • IaaC: Ansible, Terraform
  • GitOps: ArgoCD
  • CI: Gitlab CI, Jenkins
  • VCS: GitLab
  • HA: Nginx Proxy


Desired Qualifications:
  • B.S in Computer Engineering, Computer Science, or equivalent experience with 4+ years of related experience
  • Proven experience with influencing the software engineering of cloud/SaaS services
  • Familiarity with AI, LLM, and various related technologies
  • Deep understanding of the DevOps Lifecycle and application of it within organizations
  • Deep understanding of SRE principle & fundamentals


What we offer:
  • Comprehensive medical, dental, vision, disability, life insurance
  • Health Savings Account (HSA), Flexible Spending Account (FSAs) and Commuter benefits
  • 401K match and ESPP
  • Paid time off and paid sick leave
  • Paid parental and pregnancy leave and new parent gift boxes
  • Family-forming benefits (IVF, Preservation, Adoption etc.)
  • Emergency backup care (Child/Adult/Pets)
  • Employee Assistance Program (EAP) with counseling sessions available 24/7
  • Free legal services that provide legal advice, document creation and estate planning
  • Employee bonus referral program
  • Student loan refinancing assistance
  • Employee perks and discounts program


RingCentral's Engineering team works on high-complexity projects that set the standard for performance and reliability at massive scale. What kind of scale? Millions of users today and hundreds of millions tomorrow. This is your chance to help imagine, develop and deliver products that raise the technological bar, and power human connections. If you're a talented, ambitious, creative thinker, RingCentral is the perfect environment to join a world class team and bring your ideas to life.

About RingCentral

RingCentral is a cloud-based communication and collaboration platform that provides businesses with a range of tools to manage their communications and enhance their productivity. The company offers a variety of services, including voice, video, messaging, and collaboration tools, all of which are accessible from a single platform. RingCentral's platform is designed to be flexible and scalable, making it suitable for businesses of all sizes and industries. The company was founded in 1999 and is headquartered in Belmont, California.
Learn more about RingCentral
Size
3,919 employees
Market Cap
$3.2 billion
Industry
Net Income
-$83 million
Founded
2003
5 Year Trend
+33.2%
Revenue
$1.1 billion
NASDAQ

Similar Jobs

More Jobs at RingCentral

More Information Technology Jobs

Find similar Site Reliability Engineer jobs: