Sr IT Manager

Capital One Financial   •  

Richmond, VA

Industry: Accounting, Finance & Insurance


8 - 10 years

Posted 175 days ago

This job is no longer available.

Sr IT Manager

We started Capital One with a simple principle – attract great talent and give them the opportunity to be great.   That strategy has made us the nation’s leading digital bank with more than 65-million customers and 40,000 associates worldwide.

We may look like a bank, but we think and operate like a tech company. A tech company that writes its own code, develops its own software, and builds its own products. A tech company that experiments and innovates with the latest technologies, engineers breakthrough customer experiences, and brings simplicity and humanity to banking.

We are looking for experienced Senior IT Managers with operational and/or engineering background with a passion for providing superior system availability and customer experience.  We are looking for candidates who can drive reliability and performance across massive scale by mastering the full depth of the stack.  As a Senior IT Manager, you will have the opportunity to tackle complex problems of scale which are unique to tech companies while using your expertise in delivery and support of critical services.


- Increase operational efficiencies to pro-actively reduce and mitigate production incidents

- Provide Call Leadership to mitigate critical incidents

- Lead a team of experienced support engineers to meet or exceed expectations on incident SLAs

- Ability to understand full technology stack of systems in assigned domain

- Lead a high performing support engineers to provide a 24x7 support for systems with an ever-watchful eye on their availability, latency, performance, and capacity

- Collaborating with other tech leads and support teams to ensure integrated end-to-end availability, reliability, and performance

- Define support strategies for systems in the Cloud (AWS)

- Influencing resiliency and scalability in production environments in Amazon Web Services? and other cloud platforms

- Identify and drive resolution on monitoring and alerting gaps

- Lead a team to design, write and deliver technical and process automations to improve the availability, scalability, latency, and efficiency of Capital One’s services

- Solve problems relating to critical services and build automation to prevent problem recurrence; with the goal of automating response to all non-exceptional service conditions

- Engage in service capacity planning and demand forecasting, software performance analysis and system tuning

- Identifying and remediating risk to critical and non-critical system KPIs

- Familiarity with application architectures and networking

- Familiarity with automation of routine maintenance tasks and common issues

- Understanding of Unix/Linux systems from kernel to shell and beyond, taking in system libraries, file systems, and client-server protocols along the way

- Networking: knowledge and understanding of network theory, such as different protocols (TCP/IP, UDP, ICMP, etc), MAC addresses, IP packets, DNS, OSI layers, and load balancing)

- At least 1 year of experience in two or more of the following:

  • AWS cloud services configuration & administration
  • Chef, Ansible, Puppet or UDeploy
  • Jenkins, Artifactory, or Travis experience
  • Restful web/API services support and deployment
  • Splunk, Datadog, New Relic, and App Dynamics monitoring / alerting

- Current technical certification(s) in any of the above technologies

Basic Qualifications:

- Bachelor’s Degree or Military experience

- At least 7 years of experience in managing production support teams involving systems in the Cloud

Preferred Qualifications:

- Master’s Degree in Computer Science

- 3 years of experience in AWS

Job ID R49071