System Reliability Engineer

Compunnel

$70K — $95K *
Information Technology
Less than 5 years of experience
Job Overview by Ladders

Qualifications

  • 2-5 years of relevant experience in systems reliability engineering.
  • Strong troubleshooting skills with a focus on root cause analysis.
  • Excellent communication and interpersonal skills for effective collaboration.
  • Proficient in Linux system administration and other system tasks.
  • Basic knowledge of scripting languages like Python or PERL for automation.
  • Hands-on experience with monitoring tools like AppDynamics, Grafana, Splunk, Dynatrace.
  • Familiarity with modern architectures, including cloud services and distributed systems.

Responsibilities

  • Troubleshoot hardware, software, application, and network issues.
  • Collaborate with development teams to design and maintain systems.
  • Drive automation initiatives for deployment and service management.
  • Identify and mitigate systems reliability risks proactively.
  • Participate in follow-the-sun support with a weekend on-call rotation.
  • Represent the RPE team in design reviews and operational readiness assessments.

Benefits

  • Onsite presence required three times a week.
  • Work in a fast-paced, dynamic environment with a collaborative culture.
  • Gain exposure to modern technologies and systems architecture.
  • Opportunity to improve and expand SRE capabilities.
Full Job Description
JOB SUMMARY
The Reliability and Production Engineering (RPE) team is seeking skilled professionals with 2-5 years of experience for a Systems Reliability Engineer role in Montreal. This position requires a passion for production support and real-time problem-solving within a dynamic, fast-paced environment that emphasizes face-to-face communication with technology and business partners. The role is central to growing SRE capabilities and involves improving system service availability, observability, scalability, performance, and resilience by applying software engineering principles and modern technology. The ideal candidate will possess strong technical skills, sound interpersonal abilities, and a proactive approach to system improvement, with a requirement for onsite presence three times per week for day one onboarding.

Key Responsibilities
• Troubleshoot issues across hardware, software, application, and network.
• Collaborate with engineering/development teams to design, build, and maintain systems.
• Identify and drive opportunities for platform automation, including deployment, management, and visibility of services.
• Proactively identify and address systems reliability risks.
• Work alongside global and regional team members on a follow-the-sun basis, including a weekend on-call rotation.
• Represent the RPE organization in design reviews and operational readiness exercises.

Required Qualifications
• 2-5 years of relevant experience.
• Demonstrated ability to troubleshoot problems and debug to identify root cause.
• Excellent communication and interpersonal skills with a professional ownership of issues.
• Strong technical, analytical, and problem-solving skills.
• Ability to present technology problems clearly to both technical and non-technical audiences.
• Good working knowledge of Linux system administration.
• Ability to handle other system administration tasks.
• Basic use of a scripting language (Python, PERL).
• Hands-on experience with enterprise tools such as AppDynamics, Grafana, Splunk, Dynatrace.
• Awareness of, and ability to reason about modern software and systems architectures, including load-balancing, databases, queueing, caching, distributed systems failure modes, micro services, Cloud, etc.
• Onsite presence required 3x/week in Montreal for day 1 onboarding.

Preferred Qualifications
• Experience with Ansible, GitHub, or any automation/configuration/release management tools.
• Automation-related experience using scripting languages such as python, bash, perl, ruby.
• Practical experience supporting large scale systems.

Similar Jobs

More Jobs at Compunnel

More Information Technology Jobs

Find similar System Reliability Engineer jobs: