Full Job Description
Skills and Responsibilities:
• Deep End-to-End Systems Expertise Strong knowledge of complex, multi-tier environments spanning on-prem and cloud-native systems supporting large-scale transaction flows.
• Advanced Observability APM Experience Hands-on expertise with Dynatrace (or similar tools), including instrumentation, monitoring, and troubleshooting distributed applications.
• Full-Stack Troubleshooting Capability Proven ability to diagnose and resolve issues across application, infrastructure, network, and platform layers in E2E environments.
• SRE Leadership Roadmap Execution Drives and executes SRE roadmap initiatives (e.g., SRE WCCS), including capability assessments, gap analysis, and strategic improvements.
• Dynatrace SME Skillset (Day 1 Ready) Expertise in DQL, Grail traces, Gen3 dashboards, ACTIVE Gate plugins, SRG workflows, and Business Events.
• Deep Observability Fundamentals (MELT)Strong command of metrics, events, logs, and traces with ability to correlate signals for root cause analysis and performance optimization. Cloud Observability (AWS Focus)
• Experience with AWS observability stack (CloudWatch, Application Signals, Lambda, API Gateway, tracing, and logging).
• Engineering Automation Skills Strong programming in Python and Node.js experience with serverless (AWS Lambda, Azure Functions), ECS, and backend integrations. Platform Engineering SRE Practices
• Experience implementing SRE principles (Google SRE), building platform capabilities like self-service pipelines, policy-as-code, and Engineering SRE strumentation frameworks. Complex Enterprise Financial Systems.
• Experience Background in large-scale, highly integrated environments (e.g., financial services), with ability to design observability for systems with limited visibility (e.g., IBM DataPower) and monitor AI-driven systems."
Salary Range - CA$ 100,000 - CA$ 120,000 Per Year
TCS does not use artificial intelligence tools for candidate screening or evaluation. This post is for a current vacancy. The hiring process includes an initial screening, followed by a technical evaluation and managerial discussion.