Job DescriptionThe Impact You Will Have in This RoleAs a Senior Application Support Engineer (SRE), you will play a critical role in improving the
reliability, scalability, and performance of DTCC's mission-critical applications.
You will go beyond traditional application support by applying
Site Reliability Engineering (SRE) principles to drive system stability, reduce operational risk, and enhance overall resilience. This role sits at the intersection of
engineering, infrastructure, and operations, where you will influence application design, strengthen observability, and proactively prevent incidents before they occur.
You will partner closely with application development, infrastructure, network, and security teams to improve
operational readiness, monitoring, and system reliability, while helping promote a strong SRE culture across the organization.
Your Primary Responsibilities:
SRE & Reliability Engineering - Apply SRE principles and practices to improve system reliability, scalability, and performance
- Evaluate system behavior under failure scenarios and contribute to failure mode analysis and resilience design
- Define and implement strategies for fault tolerance, recovery, and disaster readiness
Monitoring, Observability & Automation - Partner with development teams to implement monitoring, alerting, and observability solutions
- Define actionable alerts and establish SLIs / SLOs to measure system health
- Drive automation of operational processes to reduce manual effort and improve recovery times
Incident & Problem Management - Participate in major incident resolution, validating diagnosis and driving root cause analysis (RCA)
- Lead or contribute to post-incident reviews, identifying long-term fixes to prevent recurrence
- Improve overall system stability by addressing recurring issues and operational gaps
Collaboration & SDLC Integration - Work closely with development teams to embed SRE practices into the software development lifecycle
- Participate in design reviews, sprint planning, and standups to advocate for reliability, scalability, and observability
- Ensure non-functional requirements (NFRs) such as availability and performance are considered early
Qualifications - Bachelor's degree preferred or equivalent practical experience
- 6-8 years of experience in application support, SRE, or similar role
Talent Needed for Success - Strong understanding of SRE principles, reliability engineering, and production support best practices
- Working knowledge of Unix/Linux, Windows, Mainframe, and SQL/PLSQL
- Experience working in application or production support environments with complex systems
- Proven ability in root cause analysis, incident management, and problem resolution
- Hands-on experience with monitoring and observability tools (e.g., Splunk, Dynatrace)
- Familiarity with cloud platforms (AWS preferred) and distributed systems
- Experience with DevOps tools and automation practices
- Strong problem-solving mindset and passion for improving system reliability
- Working knowledge of Unix/Linux, Windows, and SQL/PLSQL
- Exposure to scripting languages such as Python, Shell, or similar
- Familiarity with tools such as AutoSys, ServiceNow, or JIRA
- Strong communication and collaboration skills across cross-functional teams
Leadership Expectations - Champion Inclusion: Foster an environment of trust, belonging, and collaboration
- Communicate Clearly: Deliver messages with clarity to both technical and non-technical stakeholders
- Build Relationships: Partner across engineering, infrastructure, and business teams to deliver results
- Own Outcomes: Take accountability for reliability, performance, and operational excellence
- Drive Growth: Mentor others and contribute to building team capability
- Lead Change: Challenge existing processes and drive continuous improvement
The salary range is indicative for roles at the same level within DTCC across all US locations. Actual salary is determined based on the role, location, individual experience, skills, and other considerations.
About the TeamServes as a dedicated technology resource for advancing DTCC's business opportunities and providing industry thought leadership for leveraging new technology. The goal of this new department is to partner internally with IT, our business and regulatory divisions and externally with clients, regulators, and fintech vendors, to help build new platforms and business models to advance DTCC's mission to support the financial markets.