Job DescriptionWhat is the opportunity? Reporting to senior technology leadership, you'll lead end-to-end DevOps transformation initiatives, champion automation and reliability, and ensure our infrastructure and delivery pipelines meet the highest standards of performance, security, and operational resilience. In this strategic role, you'll be responsible for designing, building, and maintaining highly available, secure, and scalable platform solutions that power mission-critical applications across the organization.
What will you do? - Lead the design, implementation, and management of containerized application platforms using Docker and Kubernetes, ensuring scalable, resilient, and self-healing infrastructure across development, staging, and production environments.
- Administer and optimize web server and application server environments including Apache HTTP Server, Nginx, Apache Tomcat, and JBoss/WildFly, ensuring high availability, performance tuning, and secure configurations.
- Build and maintain centralized logging and observability platforms using the ELK Stack (Elasticsearch, Logstash, Kibana), enabling real-time log aggregation, search, and visualization for operational insights.
- Manage SSL/TLS certificate lifecycle including provisioning, renewal, rotation, and revocation across all environments, ensuring compliance with enterprise security policies and zero certificate-related downtime.
- Collaborate with application development teams working with Node.js, Java, and middleware platforms such as OpenText TeamSite to ensure seamless deployment, integration, and platform support.
- Implement and maintain SSO, SAML, and OIDC-based authentication and authorization solutions, working closely with identity and access management teams to secure application access.
- Lead vulnerability management efforts across the platform stack, coordinating remediation of security findings, applying patches, and ensuring compliance with enterprise security frameworks and audit requirements.
- Serve as a senior escalation point for production incidents, leading troubleshooting efforts across the full technology stack (network, OS, middleware, application, database) to ensure rapid resolution and minimal business impact.
What do you need to succeed? Must Have: - Bachelor's degree in Computer Science, Engineering, Information Technology, or a related field with 7+ years of hands-on experience in DevOps, platform engineering, site reliability engineering (SRE), or infrastructure operations.
- Expert-level proficiency with container orchestration platforms - Docker for containerization and Kubernetes for orchestration - including cluster management, Helm charts, service mesh, and production-grade deployment strategies (blue-green, canary, rolling updates).
- Solid hands-on experience administering web and application servers - Apache HTTP Server, Nginx, Apache Tomcat, and JBoss/WildFly - including virtual host configuration, reverse proxy setup, load balancing, performance tuning, and security hardening.
- Working knowledge of SSO, SAML 2.0, and OpenID Connect (OIDC) authentication protocols, with experience integrating identity providers and securing enterprise applications.
- Experience with vulnerability management processes including scanning, triage, remediation tracking, and compliance reporting across infrastructure and application layers.
Nice to have: - Familiarity with middleware platforms such as OpenText TeamSite and application frameworks like Node.js and Java in enterprise environments.
- Demonstrated ability to troubleshoot complex production issues across the full stack (network, OS, middleware, application, database) under pressure with minimal business disruption.
- Proficiency with the ELK Stack (Elasticsearch, Logstash, Kibana) for centralized log management, including pipeline design, index lifecycle management, and building operational dashboards.
- Hands-on experience with RHEL and Windows Server administration, including OS hardening, patch management, performance troubleshooting, and security with thorough understanding of SSL/TLS certificate management.
What's in it for you? - A comprehensive Total Rewards Program including bonuses and flexible benefits, competitive compensation, commissions, and stock where applicable.
- Leaders who support your development through coaching and managing opportunities.
- Ability to make a difference and have a lasting impact.
- Work in a dynamic, collaborative, progressive, and high-performing team.
- Opportunities to do challenging work.
- Opportunities to take on progressively greater responsibilities.
- Flexible work/life balance options
#LI-POST
Job SkillsInformation Technology (IT) Infrastructure, Programming Languages, Software Change Request Management, Software Development Life Cycle (SDLC), Software Engineering, Software Integration Engineering, Software Product Design, Software Product Technical Knowledge, Software Release Management, System Testing Tools
Additional Job DetailsAddress:RBC CENTRE, 155 WELLINGTON ST W:TORONTO
City:Toronto
Country:Canada
Work hours/week:37.5
Employment Type:Full time
Platform:CAPITAL MARKETS
Job Type:Regular
Pay Type:Salaried
Posted Date:2026-06-05
Application Deadline:2026-07-10
Note: Applications will be accepted until 11:59 PM on the day prior to the application deadline date above