DevOps Engineer - BTP Site Reliability Engineering team

SAP • $97K — $166K *

Montreal, QC H1A 0A1In-Person

Technical Services

Less than 5 years of experience

Reposted Today

Be an Early Applicant

By clicking Apply, I agree with Ladders' Terms of Use and Privacy Policy

Job Overview by Ladders

Qualifications

3+ years of experience in Site Reliability Engineering (SRE)
Proficient in Kubernetes and container technologies
Scripting skills with CI/CD tools (e.g., ArgoCD, GitHub Actions)
Strong analytical skills for problem-solving in high-pressure situations
Excellent communication skills in English, both spoken and written

Responsibilities

Act as the technical expert during live site incidents, troubleshooting issues deeply
Drive root cause analysis to prevent future incidents
Perform troubleshooting and log analysis based on service agreements
Develop software solutions to enhance service reliability and stability
Enhance monitoring infrastructure by implementing recovery tools and metrics
Collaborate with development teams to integrate postmortem findings
Maintain technical documentation and advocate for SRE best practices

Benefits

Hybrid work model: 3 days in-office, 2 days remote
Opportunity to work on critical cloud services
Participation in on-call rotation with special compensation
Engagement in continuous learning of new technologies
Exposure to innovative SAP technologies and open-source tools

Full Job Description

This is a hybrid role based out of SAP Montreal office, working in-office with the team 3 days per week.
Candidates must be legally entitled to work in Canada at the time of application. This position is not eligible for employer-sponsored work authorization (e.g., LMIA or other immigration support).

What you'll do

We are looking for an engineer to join an already established SRE team for the SAP Business Technology Platform.

As a Site Reliability Engineer, you will have the opportunity to operate and support business critical Cloud services. As part of your daily job, you will proactively monitor the service behavior and identify areas for improvement. You will participate in the development of tools for monitoring and troubleshooting cloud services built on latest open source and SAP technologies, following SRE principles.

Responsibilities:

Act as technical expert during Live site incidents (downtimes of supported services in scope), investigate and solve incidents on a deep technical level.
Drive root cause analysis and follow-up improvements to prevent issues from reoccurring.
Perform in-depth troubleshooting and log analysis to identify and solve complex issues in accordance with internal and external SLAs.
Build software-based solutions to address improvements in service reliability and stability.
Enhance infrastructure and platform monitoring by gathering system metrics (4 Golden Signals) and implementing tools for recovery.
Integrate and collaborate closely with development teams and work with them on outputs from Postmortems and product improvements.
Learn new technologies and keep up to date with latest development increments.
Create and maintain technical documentation.
Define, advocate, apply SRE best practices.
Participate in the on-call rotation (follow the sun approach) to react to major incidents. On-call has a special compensation package.

What you bring

Experience with Kubernetes and good understanding of container technologies.
3+ years experience in SRE.
Understanding of modern cloud architectures (experience with Cloud Platforms such as AWS, Azure, GCP are a plus).
Scripting skills, CI/CD (ArgoCD, Concourse, Github Actions and are a plus) - enthusiasm for automation - make the computers do the work for
you.
Working efficiently in emergency situations. Affinity to quickly analyze and solve problems in a global team setup.
Excellent team player, passionate about his/her work, self-motivated and driven.
Excellent communication skills - precise, based on facts.
Fluency in English.
Preferred Additional Skills and Competencies:
- Coding experience with GO, Python, Bash
- CKA/CKAD/CKS certifications
- Experience with Unix/Linux operating system
- Experience with modern monitoring, logging, and alerting tools (Grafana, Prometheus, Kibana, Loki, Splunk On-Call, Dynatrace)
- Security best practices for application development and operations in a public Cloud Environment
- Contribution to open-source projects

Meet the team

The Reliability Engineering organization provides multitude of products and services related to operations and continuity of business delivery.

The Site Reliability Engineering teams make the SAP Business Technology Platform run better by providing 24x7 deep technical coverage for Incident Management (Outages and other incidents with major customer impact) applying SRE principles. We share a Live Site First culture and care for the business continuity of our customers running mission critical applications in the Cloud.

#LI-GL1

Requisition ID: 450691 | Work Area: Software-Development Operations | Expected Travel: 0 - 10% | Career Status: Professional | Employment Type: Regular Full Time | Additional Locations: #LI-Hybrid

Requisition ID: 450691

Posted Date: May 6, 2026

Work Area: Software-Development Operations

Career Status: Professional

Employment Type: Regular Full Time

Expected Travel: 0 - 10%

Location:

* Ladders Estimates

Similar Jobs

Support Engineer III
$70K — $109K *
Minuteman Security and Life Safety
Remote
Today
Eng Sr - Sys
$100K — $130K *
BAE Systems
Nashua, NH 03060 (Hillsborough County)
Reposted Today
Lead Satellite Engineer
$100K — $150K *
Calian Group Ltd.
Montreal, QC H1A 0A1
Today
IT/OT Engineer III
$100K — $120K *
Stellix
Boxborough, MA 01719 (Middlesex County)
Today
Senior Systems Engineer (MBSE/DOORS/CAMEO) preferred
$118K — $131K *
General Dynamics
Pittsfield, MA 01201 (Berkshire County)
Today
Senior Systems Engineer - Unmanned Underwater Vehicles
$109K — $121K *
General Dynamics
Quincy, MA 02169 (Norfolk County)
Today

Get Ready For Your
Next Interview

More Jobs at SAP

Solution Advisor Senior Specialist FP&A and Financial Reporting
$137K — $294K *
Burlington, MA 01803 (Middlesex County)
Today
Finance & Insurance
In-Person
DevOps Engineer - BTP Site Reliability Engineering team
$97K — $166K *
Montreal, QC H1A 0A1
Reposted Today
Technical Services
In-Person
Solution Sales Expert - Finance and Spend (Finance) - Public Sector Regulated Industries
$256K — $435K *
Plano, TX 75025 (Collin County)
Today
Finance & Insurance
In-Person
SAP Concur: Sales Development Specialist
$78K — $180K *
Saint Louis Park, MN 55436 (Hennepin County)
Yesterday
Business Services
In-Person
Senior Account Executive - Federal Civilian
$186K — $397K *
Washington, DC 20011 (District Of Columbia County)
Yesterday
Enterprise Technology
In-Person

More Technical Services Jobs

Sr. Technical Infrastructure Program Manager, Robotics Technical Services Retrofits
$148K — $201K *
Amazon
Reading, MA 01867 (Middlesex County)
Today
Sales Manager (technical sales, drive technology exp.)
$70K — $95K *
Talent Search PRO
Ponchatoula, LA 70454 (Tangipahoa County)
Reposted Today
Sr. Technical Program Manager, Advertising Services, Amazon Ads
$163K — $221K *
Amazon
New York, NY 10025 (New York County)
Today
Operations Engineer
$75K — $95K *
NCD Agency LLC
Dallas, TX 75217 (Dallas County)
Today
Workday Consultant
$94K — $132K *
Tata Consultancy Services
Sunnyvale, CA 94087 (Santa Clara County)
Today

Find similar DevOps Engineer - BTP Site Reliability Engineering team jobs:

Nationwide Montreal, QC

DevOps Engineer - BTP Site Reliability Engineering team

Job Overview by Ladders

Full Job Description

Get Ready For Your Next Interview

Find similar DevOps Engineer - BTP Site Reliability Engineering team jobs:

Get Ready For Your
Next Interview