Service Reliability Engineer

OpenEye

$80K — $110K *
Information Technology
Less than 5 years of experience
Job Overview by Ladders

Qualifications

  • 1-5 years relevant experience in Service Reliability or a similar role
  • Proficient in AWS and CI/CD technologies, with infrastructure automation skills
  • Experience in scripting languages or development in C#, Java, or C++
  • Understanding of development practices and TCP/IP network protocols
  • Familiar with monitoring tools such as Coralogix, Datadog, Prometheus, or Grafana
  • Knowledge of Agile methodologies and project management tools like Jira
  • Strong critical thinking and problem-solving abilities

Responsibilities

  • Engineer and implement solutions to enhance system integrity and reliability
  • Monitor and improve system metrics, driving service improvements
  • Operate and improve monitoring and alerting systems
  • Participate in incident response and postmortems, developing tools to reduce recurrence
  • Collaborate with cross-functional teams on strategic reliability initiatives
  • Define and enhance service-level indicators for platform reliability
  • Identify performance trends and engineer proactive solutions

Benefits

  • Eligible for an annual discretionary bonus
  • Discounted company stock purchase option
  • Collaborative and creative work culture
  • Casual dress code
  • Medical, dental, vision & prescription benefits starting from day 1
  • Up to $5,000 annual company match for 401k contributions
  • Paid maternity and paternity leave
  • 15 days of paid vacation, increasing after 3 years
  • Flexible hybrid work schedule
  • Educational Assistance Program for degree support
Full Job Description
Job Summary

As a Service Reliability Engineer (SRE) at OpenEye, you will play a hands-on role in elevating the reliability, quality, and consistency of our software and release processes. Embedded within the DevOps team and partnering closely with the Service Reliability Manager, you'll engineer and implement initiatives that ensure stable releases and exceptional customer experience from validation, to deployment, through post-release performance.

As SRE you will be responsible for maintaining system reliability, automating operations, and monitoring key metrics and service-level indicators. You'll participate in incident response and root cause analysis, while collaborating with development and DevOps teams to enhance observability and tooling. Alongside shared responsibility for CI/CD health and service-level objectives, you'll work closely with QA, engineering, and product teams to champion quality throughout the entire software lifecycle. Your contributions will span both strategic and tactical domains, blending hands-on technical execution with a strong, customer-focused mindset.

Role and Responsibilities
  • Engineer and implement solutions that increase system integrity, scalability, security, and reliability.
  • Monitor, analyze, and improve system metrics, proactively driving service improvements and customer experience.
  • Implement, operate, and continuously improve monitoring, alerting, and observability systems.
  • Participate in incident response, postmortems, and release health processes; develop tools and dashboards to minimize recurrence.
  • Partner with the Service Reliability Manager and cross-functional teams to plan and deliver strategic reliability and quality initiatives.
  • Define, measure, and improve service-level indicators (SLIs) and objectives (SLOs) for platform reliability.
  • Identify trends and risks in system performance, stability, and customer satisfaction; engineer proactive solutions.
  • Champion operational excellence, reliability, and quality throughout the software development and deployment lifecycle.
  • Advocate for the customer experience, ensuring all operational and engineering efforts support end-user satisfaction and robust performance.
  • Collaborate with DevOps Engineers on CI/CD pipeline optimization, environment maintenance, automation, and troubleshooting.
  • Participate in capacity planning and performance tuning.

The Tech Stack
  • AWS, Coralogix, Typescript, MySQL, CrateDB, Git, Java, JavaScript

Qualifications
  • 1-5 years related experience and/or training
  • Technical proficiency in cloud environments (AWS preferred), CI/CD technologies, infrastructure automation, and monitoring tools
  • Experience in a scripting language and/or C# and/or Java and/or C++ development language
  • Solid understanding of development practices and TCP/IP network protocols
  • Experience with service monitoring tools (e.g. Coralogix, Datadog, Prometheus, or Grafana)
  • Familiarity with Agile methodologies and project management tools (e.g. Jira)
  • Ability to quickly learn new technologies and practices
  • Excellent critical thinking and problem-solving skills
  • Strong quality ethic and test-first attitude
  • Great communication and teamwork skills

Please note that sponsorship of new applicants for employment authorization, or any other immigration-related support, is not available for this position at this time.

The Perks!
  • The pay range for this opportunity is $80,000 - $110,000 annually. In addition, this position is eligible for an annual discretionary bonus.
  • Employees are eligible to purchase company stock at a discounted rate
  • Collaborative, fun, creative culture where idea sharing is encouraged
  • Casual dress (Jeans are welcome!)
  • Medical, dental, vision & prescription benefits starting day 1! Generous medical plan subsidy and health savings account option with company contribution helps keep your costs low.
  • Up to $5,000 annual company match for 401k
  • Company paid short-term/long-term disability, AD&D and life insurance
  • Paid maternity and paternity leave
  • 15 Days of Paid Vacation accrued per year (increases after year 3)
  • 7 Paid Sick/Wellness days per year
  • 9 Paid Holidays per year
  • This position is eligible for a flexible hybrid work schedule
  • Educational Assistance Program covering non-degree support, undergraduate and graduate degrees
  • Employee Equipment Program - Free Alarm.com system for your home!

The base salary range of this opportunity is listed below and is determined within a range based on factors including qualifications, location and experience. This allows opportunity for growth and development within the role. The base salary offered is part of a total compensation package.

Base Salary Range

$80,000-$100,000 USD

Similar Jobs

More Jobs at OpenEye

More Information Technology Jobs

Find similar Service Reliability Engineer jobs: