Relx Group

Senior Site Reliability Engineer

Relx Group$104K — $174K *
Information Technology
Less than 5 years of experience
Job Overview by Ladders

Qualifications

  • 5+ years experience with containerized applications deployment and management.
  • In-depth knowledge of Kubernetes architecture and operational practices.
  • Proven AWS services expertise and architectural understanding.
  • Advanced skills in Infrastructure as Code (IaC) development.
  • Strong Python scripting skills for automation tasks.
  • Experience in designing CI/CD workflows using GitHub Actions.
  • Familiarity with monitoring tools like NewRelic and security best practices.

Responsibilities

  • Design and maintain scalable Kubernetes clusters on AWS EKS.
  • Manage and optimize cloud infrastructure across multiple portfolios.
  • Develop and implement Infrastructure as Code solutions for resource management.
  • Automate operational workflows and incident response processes.
  • Implement CI/CD pipelines for deployment and testing.
  • Support multi-regional infrastructure with a focus on high availability and rapid resolution.
  • Document processes and best practices, providing clear guidance to users.
  • Mentor junior team members in SRE practices and cloud architecture.

Benefits

  • Comprehensive country-specific benefits offered.
  • Opportunities for professional growth and mentorship.
  • Flexible working arrangements to support work-life balance.
  • Access to advanced tools and technologies in a collaborative environment.
Full Job Description
About the role, We are looking to immediately hire a highly skilled and proactive Senior SRE to join our dynamic team. You will combine software thinking and service operations to enable and run Elsevier's large-scale, 24x7, distributed and fault-tolerant systems within agreed reliability objectives, whilst enabling the fast flow of feature and service updates. The successful candidate will possess deep expertise in cloud-native architectures, along with strong automation skills.

About team; This diverse team of Engineers in assisting multiple product teams as we continue to innovate all of our products within our global Cloud AWS landscape.

Key Responsibilities:
  • Designing, deploying, and maintaining highly available, scalable Kubernetes clusters on AWS EKS as well as the supporting ecosystem.
  • Managing and optimizing cross-portfolio cloud infrastructure, leveraging AWS services and supported organizational tooling
  • Developing and maintaining Infrastructure as Code (IaC) solutions to automate provisioning and management of cloud and Kubernetes resources.
  • Writing automation processes to streamline operational workflows, incident response, and infrastructure management.
  • Implementing CI/CD pipelines to facilitate deployments, testing, and validation.
  • Supporting multi-regional critical infrastructure, ensuring high availability and rapid incident resolution. Monitoring system health, instrument system components, troubleshoot issues, and perform root cause analysis.
  • Managing and supporting a complex cross-portfolio environment, coordinating across teams to ensure consistency and reliability.
  • Maintaining comprehensive documentation and best practice guides for solutions, ensuring users have clear instructions and support to effectively implement and operate their systems.
  • Mentoring junior team members and promoting best practices in SRE, automation, and cloud architecture.


Technical Skills & Qualifications:
  • Extensive experience deploying, managing, and troubleshooting containerised applications.
  • Deep understanding of Kubernetes architecture, networking, security, storage, and operational best practices.
  • Proven expertise with AWS services and architectural principles.
  • Extensive knowledge of AWS security, compliance, and best practices.
  • Advanced skills in writing modular, reusable IaC components.
  • Strong Python scripting skills for automation, tooling, and data processing.
  • Ability to develop custom solutions for monitoring, automation, and incident management. Experience designing and maintaining CI/CD workflows using GitHub Actions.
  • Curren experience Automating deployment pipelines, testing, and validation processes.
  • Familiarity with monitoring tools such as NewRelic. Knowledge of security best practices, network policies, and enterprise-grade RBAC policies.


U.S. National Base Pay Range: $104,900 - $174,700. Geographic differentials may apply in some locations to better reflect local market rates.If performed in Maryland, the base pay range is $110,100 - $183,500.If performed in New Jersey, the base pay range is $118,349 - $189,051.This job is eligible for an annual incentive bonus.
We know your well-being and happiness are key to a long and successful career. We are delighted to offer country specific benefits. Click here to access benefits specific to your location.

About Relx Group

RELX Group is a global provider of information-based analytics and decision tools for professional and business customers. The company operates in four market segments: scientific, technical and medical; risk and business analytics; legal; and exhibitions. RELX's products and services include electronic databases, online information services, workflow tools, and print and digital books. The company was founded in 1993 and is headquartered in London, England.
Learn more about Relx Group
Size
33,500 employees
Market Cap
$53.1 billion
Industry
Net Income
$1.2 billion
Founded
2018
5 Year Trend
+1%
Revenue
$7.1 billion
NASDAQ

Similar Jobs

More Jobs at Relx Group

More Information Technology Jobs

Find similar Senior Site Reliability Engineer jobs: