Job Title: SENIOR CONSULTANT - TECH & IMPL L1
City: Chandler
State/Province: Arizona
Posting Start Date: 6/29/26
Job Description:
͏
Role Name: SRE Lead - Messaging Services
Primary Skills: Site Reliability Engineering / Production Engineering, IBM MQ, Kafka
Secondary Skills: Observability (Splunk/Dynatrace), Linux/Unix, Python/Shell Automation, Incident & Problem Management, Messaging Platform
POSITION_SUMMARY:
We are seeking an experienced Site Reliability Engineer (SRE) Lead - Messaging Services to drive platform reliability, observability, and operational excellence across IBM MQ and Kafka environments.
This role combines:
• Production engineering and reliability leadership for messaging platforms
• Platform security, resilience engineering, and vulnerability remediation
• Ownership of large-scale, distributed messaging runtimes
͏
Key responsibilities include:
• Leading reliability engineering for high-scale messaging platforms supporting tens of thousands of runtimes and high-volume message throughput
• Driving EOL remediation, patching, and stabilization across MQ queue managers and Kafka clusters
• Implementing SRE best practices:
o SLIs / SLOs focused on message delivery, latency, and availability
o Incident management, escalation, and postmortem culture
• Enhancing observability and monitoring for messaging flows, queue depths, lag, and throughput
• Designing proactive fault detection and auto-remediation strategies (e.g., DLQ handling, backlog mitigation, failover recovery)
• Building resilient messaging platforms capable of supporting real-time, event-driven workloads
• Supporting global production messaging environments with on-call rotation and escalation ownership
• Partnering with engineering, application, and security teams to ensure reliability, scalability, and secure message transport
͏
REQUIRED_SKILL:
• Strong experience in Site Reliability Engineering / Production Engineering
• Hands-on expertise with:
o IBM MQ (queue managers, clustering, channels, DLQ management)
o Kafka / Confluent platform (topics, brokers, partitions, consumer groups)
o Large-scale distributed messaging systems and runtime management
• Deep understanding of:
o System reliability, scalability, and high availability design
o Messaging reliability patterns (guaranteed delivery, retry handling, replay, ordering)
o Incident management, root cause analysis, and problem management
͏
DESIRED_SKILL
• Experience implementing SRE frameworks (SLIs, SLOs, error budgets) specifically for messaging workloads
• Familiarity with:
o Kubernetes / containerized messaging platforms
• Experience with:
o Kafka ecosystem components (Schema Registry, Connect, Streams)
o IBM MQ advanced features (Native HA, clustering)
• Exposure to:
o AI-driven operations (AIOps), anomaly detection, or automated remediation
o Large-scale messaging modernization or migration programs
• Messaging or middleware certifications (IBM MQ, Kafka, or equivalent)
• Experience in regulated environments (e.g., financial services)
Mandatory Skills: SRE Operations.
Experience: 5-8 Years.
The expected compensation for this role ranges from $60,000 to $135,000 .
Final compensation will depend on various factors, including your geographical location, minimum wage obligations, skills, and relevant experience. Based on the position, the role is also eligible for Wipro's standard benefits including a full range of medical and dental benefits options, disability insurance, paid time off (inclusive of sick leave), other paid and unpaid leave options.