Site Reliability Engineer III (SRE III)

Emburse • $100K — $130K *

Toronto, ON M3C 0E3Hybrid

Information Technology

5 - 7 years of experience

Reposted 1 month ago

By clicking Apply, I agree with Ladders' Terms of Use and Privacy Policy

Job Overview by Ladders

Qualifications

Bachelor's degree in Computer Science or a STEM field required.
Minimum 6 years of experience in reliability-focused engineering or operations role.
Preferred certifications include Certified Kubernetes Administrator (CKA) and/or AWS Certification.
Strong proficiency in Linux-based distributed environments with hands-on experience.
Deep experience with cloud platforms, preferably AWS or Azure, and Infrastructure-as-Code (Terraform).

Responsibilities

Proactively enhance customer experience by identifying and implementing preventative measures.
Ensure services maintain 24/7 availability, scalability, and resilience.
Monitor and troubleshoot to enhance site performance and uptime visibility.
Design and automate cloud infrastructure and platform services for reliability.
Implement Infrastructure-as-Code (IaC) for managing large-scale distributed systems.
Collaborate with engineering teams on project planning and operational goal alignment.
Lead cross-functional troubleshooting for complex operational issues.

Benefits

Mentorship opportunities for professional growth and skill enhancement.
Collaboration with distributed teams, promoting a diverse work environment.
Contribution to innovative projects with a focus on reliability and automation.
Engagement in continuous improvement initiatives and postmortem reviews.

Full Job Description

What you will do :

Service Reliability & Performance

Proactively identify, evaluate, and implement preventative measures to reduce customer impact.
Ensure all services are designed and operated with 24/7 availability, scalability, and resilience in mind.
Monitor, troubleshoot, and provide visibility to improve site latency, performance, and uptime.

Engineering Excellence & Automation

Design, develop, and automate reliable cloud infrastructure and platform services.
Apply Infrastructure-as-Code (IaC) principles to manage large-scale distributed systems.
Write and maintain scripts, tools, and automation frameworks to support operational efficiency.
Partner with engineering leadership to develop solutions enabling developer productivity and remove cross functional dependencies.

Collaboration & Process Development

Collaborate with Platform Engineering teams on project definitions, requirements, backlog grooming, and planning processes.
Align operational goals with product and engineering roadmaps to ensure reliability requirements are met early in the lifecycle.
Define non-functional requirements (NFRs) and influence standards for scalability, observability, and fault tolerance.
Lead cross-functional troubleshooting of complex issues spanning applications, infrastructure, databases, and networks.

Leadership & Mentorship

Serve as a technical mentor to SRE I and II engineers, guiding them in best practices for reliability, automation, and incident management.
Lead root cause analysis and postmortem reviews, driving continuous improvement initiatives.
Support offshore and distributed teams, promoting effective collaboration and communication.
Participate in design and architecture reviews, offering technical recommendations and documentation for key stakeholders

What we are looking for :

Education:

Required: Bachelor's degree in Computer Science or a STEM field

Experience:

Minimum 6 years of experience in an engineering or operations role with a focus on reliability, scalability, and automation.

Certifications:

Preferred: Certified Kubernetes Administrator (CKA) and/or AWS Certification

Additional Eligibility Qualifications

Required Skills:

Strong proficiency in Linux-based distributed environments (up to 70% hands-on work).
Deep experience with cloud platforms (AWS or Azure) and Infrastructure-as-Code (Terraform).
Excellent scripting skills (Python, Bash, Powershell); object-oriented programming experience is a plus.
Demonstrated ability to develop and maintain internal tools and automation solutions.
Excellent written and verbal communication skills in English.
Strong project management and organizational abilities with a bias for action.
Experience collaborating with offshore or globally distributed teams.
Expertise in containerization and orchestration technologies (Docker, Kubernetes).
Experience with Kubernetes scaling tooling (Karpenter, KEDA).
Strong understanding of DevOps principles and modern CI/CD pipelines.
Experience with observability stacks (Prometheus, Grafana, OpenTelemetry).
Familiarity with self-healing systems, and site reliability best practices.
Background in SaaS environments or large-scale distributed applications.
Analytical thinker with a focus on root-cause problem solving.
Self-starter with a strong ownership mentality and accountability.
Mentor and collaborator who uplifts teams and promotes learning culture.
Committed to operational excellence and continuous improvement.

About Emburse

Emburse is a global leader in expense management and AP automation solutions, with more than 4,500 customers and 10 offices worldwide. The company?s expense management platform automates the purchase-to-reimbursement process, helping businesses make better decisions, streamline operations, and improve employee productivity. Emburse?s solutions are used by companies of all sizes, from small businesses to Fortune 500 corporations, and across a range of industries, including healthcare, retail, and technology. The company was founded in 2011 and is headquartered in San Francisco, California.

Learn more about Emburse

Size

500 employees

Industry

Finance & Insurance

Founded

2014

* Ladders Estimates

Similar Jobs

Application Engineer - Power Platform Developer
$90K — $130K *
ASM Research
Remote
Reposted Yesterday
Senior Site Reliability Engineer
$120K — $150K *
Stack AV
Remote
2 days ago
Senior Site Reliability Engineer
$110K — $140K *
Stack AV
Pittsburgh, PA 15237 (Allegheny County)
2 days ago
Senior Site Reliability Engineer
$100K — $130K *
Royal Bank of Canada
Mississauga, ON L4T 0A1
1 week ago
DevOps / Site Reliability Engineer (SRE) - API Platform
$90K — $130K *
HTC Global Services
Dearborn, MI 48126 (Wayne County)
1 week ago
Senior Site Reliability Engineer (Remote Build)
$54K — $150K *
Remote
Remote
1 week ago

Get Ready For Your
Next Interview

More Jobs at Emburse

Director, Renewal Operations
$120K — $160K *
Dallas, TX 75217 (Dallas County)
1 week ago
Enterprise Technology
Hybrid
Senior Software Engineer - C#
$110K — $140K *
Dallas, TX 75217 (Dallas County)
Reposted 1 week ago
Information Technology
Hybrid
Manager, Financial Planning and Analysis (B2B, GTM, SaaS)
$100K — $130K *
Addison, TX 75001 (Dallas County)
3 weeks ago
Finance & Insurance
Hybrid
Director, Marketing (B2B)
$120K — $150K *
Addison, TX 75001 (Dallas County)
3 weeks ago
Business Services
Hybrid
Senior Marketing Manager
$90K — $130K *
Addison, TX 75001 (Dallas County)
3 weeks ago
Enterprise Technology
Hybrid

More Information Technology Jobs

SDET (Software Development Engineer In Test)
Confidential Company
Washington, DC 20001 (District Of Columbia County)
1 week ago
Senior Penetration Tester - Web & Hardware/IoT
$120K — $150K *
JP Morgan Chase & Co.
Brooklyn, NY 11245 (Kings County)
Today
Lead Site Reliability Engineer
$120K — $150K *
JP Morgan Chase & Co.
Plano, TX 75024 (Collin County)
Today
Network Lead Infrastructure Engineer- Risk remediation
$110K — $140K *
JP Morgan Chase & Co.
Plano, TX 75024 (Collin County)
Today
Software Engineer III - AWS Cloud Engineer
$100K — $130K *
JP Morgan Chase & Co.
Columbus, OH 43240 (Delaware County)
Reposted Today

Find similar Site Reliability Engineer III (SRE III) jobs:

Nationwide Toronto, ON

Site Reliability Engineer III (SRE III)

Job Overview by Ladders

Full Job Description

Get Ready For Your Next Interview

Find similar Site Reliability Engineer III (SRE III) jobs:

Get Ready For Your
Next Interview