CBC/Radio-Canada
• $80K — $110K *Qualifications
Responsibilities
Benefits
Position Title:
Senior Analyst, Cloud Ops (English Services) (Telework/Hybrid)Status of Employment:
PermanentPosition Language Requirement:
English, FrenchLanguage Skills:
English (Reading - C - Advanced), English (Speaking - C - Advanced), English (Writing - B - Intermediate), French (Reading - C - Advanced), French (Speaking - C - Advanced), French (Writing - B - Intermediate)Unposting Date:
2026-07-09 11:59 PMThis role is a hybrid work arrangement. Work schedule to be discussed with the Hiring Manager according to the guidelines defined by the department.
Why is this role important?
Reporting to the Senior Manager, Web and Infrastructure, we're looking for a Senior Analyst to join our growing team. It’s an opportunity to help shape the way the corporation works internally and make a contribution towards our “Space for us all” strategic plan. As a part of the CloudOps team, you will play a significant role in supporting the team to install, monitor and manage critical systems that span multiple platforms across the country.
How you will make an impact:
We spend our days solving problems on an unbelievable scale. Media files are highly nuanced and incredibly complicated; moving our broadcasting content from on-prem servers to the cloud is a complex technical feat. And that’s just the beginning. In this role, you will continuously be challenged to apply your judgment, knowledge, experience and analytical skills to:
Enhance application functionality. You will support the planning, coordination, implementation and management of system configuration for new installations or upgrades.
Support. You will be responsible for supporting the initiation, planning and coordinating the effective client support for the system.
Maintain. You will troubleshoot problems and coordinate activities to support the administration of systems.
You will assist in the development and implementation of standards for running new computer system processes and procedures.
Think in terms of platforms and products, not one-off solutions.
Favor automation over manual processes and reproducibility over ad hoc fixes.
Comfortable working in fast-evolving cloud-native ecosystems and learning new tools when they solve real problems.
Collaborate openly, share knowledge, and take pride in team outcomes rather than individual ownership.
Develop systems that combine operational simplicity with long-term scalability.
Communicate clearly and work effectively with engineers, product teams, and stakeholders.
Overseeing infrastructure usage & load during major events such as the Federal Elections, Olympics etc.
What you bring:
Strong hands-on experience with IaC using Terraform and Ansible with working knowledge of CloudFormation and AWS SAM.
Proven experience designing and operating Kubernetes platforms and implementing GitOps workflows based on core principles and best practices.
Hands-on experience with GitOps tools such as Argo CD, Helm, and GitLab CI, as well as Kubernetes-native controllers like AWS Controllers for Kubernetes (ACK) to manage cloud resources declaratively.
Hands-on experience implementing Kubernetes-native scaling strategies, including event-driven and on-demand scaling using tools such as KEDA and Karpenter.
Solid understanding of cloud-native architectures, containers, and serverless services on AWS.
Experience building and operating observability systems for logs, metrics, tracing, and alerting in distributed systems.
Practical knowledge of GNU/Linux systems, networking fundamentals including DNS, TLS, HTTP, CDN and load balancing.
Experience managing and scaling relational databases such as MySQL and PostgreSQL including RDS.
Proficiency in one or more programming languages (Go, Python) for automation and tooling.
Hands-on experience working from the command line in production environments.
Exposure to FinOps practices and cost-aware infrastructure design.
Experience migrating workloads to different architectures, including x86 to ARM Graviton-based Ec2 instances, and transitioning workloads into containers or virtual machines.
Experience supporting production systems in high-availability environments.
Familiarity with SRE principles and reliability metrics such as SLOs, SLIs, error budgets, and DORA metrics.
Participate in a 24/7 on-call rotation and drive continuous improvement of operational playbooks and postmortem practices.
The flexibility. You are able and willing to work outside of regular working hours. You can travel between Montreal and Toronto when required.
We are looking for a candidate with the following technical background and expertise:
You have post-secondary education in Computer Science or Engineering with 1-5 years working with high availability systems. You have hands-on expertise with.
Support cloud-native infrastructure and platforms primarily on AWS, with exposure to GCP where relevant.
Build and maintain Kubernetes based platforms on EKS and GKE, including deployment workflows using Terraform, Helm, and Argo CD.
Exposure to CI/CD pipelines and GitOps practices to enable fast, reproducible, and auditable delivery.
Implement and operate observability platforms covering metrics, logs, traces, and alerting, including self-hosted LGTM stacks and OpenTelemetry.
Contribute to initiatives that improve scalability, reliability, elasticity, and cost efficiency across platforms.
Build automation and internal tooling to reduce operational overhead and improve developer productivity.
Support and evolve production systems, including deployment, networking, monitoring, alerting, and incident response.
Participate in incident response and post-incident reviews, ensuring learnings are translated into system and process improvements.
Contribute to infrastructure security best practices across all environments.
Support cost optimization and FinOps practices without compromising reliability or performance.
Document infrastructure standards, patterns, and operational practices.
Collaborate closely with software engineers, product teams, and other stakeholders to deliver shared outcomes.
Contribute positively to a team culture that values shared ownership, learning, and pride in collective achievements.
English Fluent (Writing, Speaking and Reading)
French work knowledge (Writing, Speaking and Reading) is an asset.
This position will not be filled until consideration has been made by the CBC/Radio-Canada / APS Workforce Adjustment Committee.
Candidates may be subject to skills and knowledge testing.
We thank all applicants for their interest, but only candidates selected for an interview will be contacted.
Primary Location:
Broadcast Centre 205 Wellington St. W., Toronto, Ontario, M5V 3G7Number of Openings:
1Work Schedule:
Full timeSimilar Jobs

More Jobs at CBC/Radio-Canada
More Information Technology Jobs