Relx Group

Senior Site Reliability Engineer

Relx Group$104K — $174K *
US-Anywhere
+ 8 other locationsRemote
Information Technology
Less than 5 years of experience
Job Overview by Ladders

Qualifications

  • Extensive experience deploying, managing, and troubleshooting containerized applications.
  • Deep understanding of Kubernetes architecture, networking, security, and operational best practices.
  • Proven expertise with AWS services and architectural principles.
  • Extensive knowledge of AWS security, compliance, and best practices.
  • Advanced skills in writing modular, reusable IaC components.
  • Strong Python scripting skills for automation, tooling, and data processing.
  • Ability to develop custom solutions for monitoring, automation, and incident management.

Responsibilities

  • Design, deploy, and maintain highly available, scalable Kubernetes clusters on AWS EKS.
  • Manage and optimize cross-portfolio cloud infrastructure, leveraging AWS services.
  • Develop and maintain Infrastructure as Code (IaC) solutions for cloud and Kubernetes resources.
  • Write automation processes to streamline operational workflows and incident response.
  • Implement CI/CD pipelines for deployments, testing, and validation.
  • Support multi-regional critical infrastructure, ensuring high availability and rapid incident resolution.
  • Mentor junior team members and promote best practices in SRE, automation, and cloud architecture.

Benefits

  • Comprehensive country-specific benefits packages to support employee well-being and happiness.
Full Job Description
We are looking to immediately hire a highly skilled and proactive Senior SRE to join our dynamic team. You will combine software thinking and service operations to enable and run Elsevier's large-scale, 24x7, distributed and fault-tolerant systems within agreed reliability objectives, whilst enabling the fast flow of feature and service updates. The successful candidate will possess deep expertise in cloud-native architectures, along with strong automation skills.

About team; This diverse team of Engineers in assisting multiple product teams as we continue to innovate all of our products within our global Cloud AWS landscape.

Key Responsibilities:
  • Designing, deploying, and maintaining highly available, scalable Kubernetes clusters on AWS EKS as well as the supporting ecosystem.
  • Managing and optimizing cross-portfolio cloud infrastructure, leveraging AWS services and supported organizational tooling
  • Developing and maintaining Infrastructure as Code (IaC) solutions to automate provisioning and management of cloud and Kubernetes resources.
  • Writing automation processes to streamline operational workflows, incident response, and infrastructure management.
  • Implementing CI/CD pipelines to facilitate deployments, testing, and validation.
  • Supporting multi-regional critical infrastructure, ensuring high availability and rapid incident resolution. Monitoring system health, instrument system components, troubleshoot issues, and perform root cause analysis.
  • Managing and supporting a complex cross-portfolio environment, coordinating across teams to ensure consistency and reliability.
  • Maintaining comprehensive documentation and best practice guides for solutions, ensuring users have clear instructions and support to effectively implement and operate their systems.
  • Mentoring junior team members and promoting best practices in SRE, automation, and cloud architecture.


Technical Skills & Qualifications:
  • Extensive experience deploying, managing, and troubleshooting containerised applications.
  • Deep understanding of Kubernetes architecture, networking, security, storage, and operational best practices.
  • Proven expertise with AWS services and architectural principles.
  • Extensive knowledge of AWS security, compliance, and best practices.
  • Advanced skills in writing modular, reusable IaC components.
  • Strong Python scripting skills for automation, tooling, and data processing.
  • Ability to develop custom solutions for monitoring, automation, and incident management. Experience designing and maintaining CI/CD workflows using GitHub Actions.
  • Curren experience Automating deployment pipelines, testing, and validation processes.
  • Familiarity with monitoring tools such as NewRelic. Knowledge of security best practices, network policies, and enterprise-grade RBAC policies.


U.S. National Base Pay Range: $104,900 - $174,700. Geographic differentials may apply in some locations to better reflect local market rates.If performed in Maryland, the base pay range is $110,100 - $183,500.If performed in New Jersey, the base pay range is $118,349 - $189,051.This job is eligible for an annual incentive bonus.
We know your well-being and happiness are key to a long and successful career. We are delighted to offer country specific benefits. Click here to access benefits specific to your location.

About Relx Group

RELX Group is a global provider of information-based analytics and decision tools for professional and business customers. The company operates in four market segments: scientific, technical and medical; risk and business analytics; legal; and exhibitions. RELX's products and services include electronic databases, online information services, workflow tools, and print and digital books. The company was founded in 1993 and is headquartered in London, England.
Learn more about Relx Group
Size
33,500 employees
Market Cap
$53.1 billion
Industry
Net Income
$1.2 billion
Founded
2018
5 Year Trend
+1%
Revenue
$7.1 billion
NASDAQ

Similar Jobs

More Jobs at Relx Group

More Information Technology Jobs

Find similar Senior Site Reliability Engineer jobs: