JP Morgan Chase & Co.

Site Reliability Engineer III

JP Morgan Chase & Co.$120K — $150K *
Information Technology
8 - 10 years of experience
Job Overview by Ladders

Qualifications

  • 8+ years of software engineering experience
  • 3+ years of Site Reliability Engineering experience in Data Warehousing and cloud environments
  • Proficiency in Python/Java for data movement and automation
  • Hands-on experience with AI/ML infrastructure components
  • Experience in observability tools like Grafana and Datadog
  • Familiarity with CI/CD tools like Jenkins or GitLab
  • Ability to work collaboratively in team environments

Responsibilities

  • Collaborate with engineering teams to define and enforce Non-Functional Requirements (NFRs)
  • Conduct Failure Mode and Effects Analysis (FMEA) to identify potential failure points
  • Define and manage Service Level Indicators (SLIs) and Service Level Objectives (SLOs)
  • Implement and maintain infrastructure-as-code and CI/CD pipelines
  • Drive observability maturity through structured logging and intelligent alerting
  • Lead incident response for production issues and conduct blameless postmortems
  • Support SRE practices for AI/ML platforms and reduce operational toil through automation

Benefits

  • Mentorship and support for growth
  • Opportunities to work on award-winning banking tools and services
  • Collaboration with diverse teams in a risk-taking environment
  • Focus on innovative technologies and next-gen banking solutions
  • Chance to contribute to cutting-edge mobile applications and digital experiences
Full Job Description
JOB DESCRIPTION

As a Site Reliability Engineer III at JPMorgan Chase within thewithin theConsumer & Community Banking Data and Analytics team, youwill solve complex and broad business problems with simple and straightforward solutions. Through code and cloud infrastructure, you will configure, maintain, monitor, and optimize applications and their associated infrastructure to independently decompose and iteratively improve on existing solutions. You are a significant contributor to your team by sharing your knowledge of end-to-end operations, availability, reliability, and scalability of your application or platform.

Job responsibilities

  • Guides and assists others in the areas of building appropriate level designs and gaining consensus from peers where appropriate
  • Collaborates with other software engineers and teams to design and implement deployment approaches using automated continuous integration and continuous delivery pipelines
  • Collaborates with other software engineers and teams to design, develop, test, and implement availability, reliability, scalability, and solutions in their applications
  • Implements infrastructure, configuration, and network as code for the applications and platforms in your remit
  • Collaborates with technical experts, key stakeholders, and team members to resolve complex problems
  • Understands service level indicators and utilizes service level objectives to proactively resolve issues before they impact customers
  • Supports the adoption of site reliability engineering best practices within your team
  • Uses enterprise-authorized AI capabilities within the work environment to accelerate incident triage, troubleshooting, and post-incident analysis, validating outputs and handling operational data according to sensitivity and security requirements.

  • Applies enterprise-authorized AI capabilities within the work environment to identify patterns in operational signals that indicate reliability risk or recurring toil, prioritizing reuse-first improvements tied to SLO outcomes.

Required qualifications, capabilities, and skills

  • Formal training or certification on site reliability engineering concepts and 3+ years applied experience
  • Exposure to or hands-on experience in supporting SRE practices for AI/ML platforms and products, with familiarity in infrastructure components such as Databricks, Vector Databases, Model Serving endpoints, and ML training/deployment pipelines.
  • Understanding of how to apply SRE fundamentals 6 including monitoring, incident response, capacity awareness, and toil identification 6 to AI/ML and data-intensive workloads, with the ability to define and track relevant SLOs/SLIs (e.g., model latency, inference availability, data freshness).
  • Familiarity with Agentic AI concepts such as AI Agents, Skills, Context Management, and Retrieval-Augmented Generation (RAG), with the ability to leverage these tools to support SRE functions like incident triage, alert enrichment, runbook automation, and root cause analysis.
  • Experience in observability including white and black box monitoring, service level objective alerting, and telemetry collection using tools such as Grafana, Dynatrace, Prometheus, Datadog, Splunk, etc.
  • Experience with platforms and applications hosted on public/private/hybrid cloud environments, including container orchestration technologies such as Kubernetes, ECS, and Docker.
  • Experience with continuous integration and continuous delivery tools such as Jenkins, GitLab, or Terraform.
  • Familiarity with troubleshooting common networking technologies and issues.
  • Working knowledge of using enterprise-authorized AI capabilities within the work environment to support SRE workflows with strong validation habits and awareness of data sensitivity

  • Ability to validate AI-assisted operational recommendations before applying changes, escalating when uncertain and following data sensitivity requirements

Preferred qualifications, capabilities, and skills
  • Experience supporting reliability practices for AI/ML platforms, including model serving endpoints and ML pipelines.
  • Experience with Databricks, vector databases, or large-scale feature or embedding pipelines in production environments.
  • Experience applying automation techniques to reduce toil, including self-healing workflows and runbook automation.
  • Experience with Kubernetes and containerized workloads in production. Proficient in site reliability culture and principles with familiarity in implementing site reliability practices within an application or platform. Formal training or certification in SRE concepts is preferred.
  • Familiarity with distributed tracing practices for complex, multi-service systems.
  • Experience running chaos engineering or game day resilience exercises.
  • Familiarity with agentic AI concepts (for example, retrieval-augmented generation) to assist incident triage and operational workflows

About JP Morgan Chase & Co.

JP Morgan Chase & Co. stands at the forefront of the global financial services industry. They offer an expansive array of products and services to a diverse clientele, including individuals, corporations, governments, and institutions. Ever since the merger of J.P. Morgan & Co. and Chase Manhattan Corporation in 2000, this industry-leading entity has become renowned for its comprehensive portfolio encompassing consumer and community banking, corporate and investment banking, commercial banking, as well as asset and wealth management. Headquartered in the vibrant city of New York, JP Morgan Chase & Co. boasts a formidable presence across over 100 countries worldwide.

Unveiling Employment Opportunities at JP Morgan Chase & Co.

Vacancies and Hiring Initiatives

JP Morgan Chase & Co. is continuously on the lookout for talented individuals eager to contribute to its legacy of excellence. The company's recruitment efforts are geared towards identifying candidates with the right blend of skills and qualifications to drive forward its various business segments. Whether you are a seasoned professional or a recent graduate, JP Morgan Chase offers a plethora of job openings across multiple disciplines.

High-Demand Positions

Among the myriad of roles, certain positions stand out for their attractive compensation packages and career advancement prospects. Notably, high-paying jobs at JP Morgan Chase & Co. include Relationship Manager, Branch Manager, and Software Engineer. These roles are critical to the firm's operations and offer lucrative opportunities for those with the requisite expertise.

Navigating the Job Market at JP Morgan Chase & Co.

Leveraging Job Portals and Job Alerts

For job seekers aiming to tap into the opportunities at JP Morgan Chase, staying updated through job portals and subscribing to job alerts is crucial. These tools can provide timely information about job openings, job fairs, and recruitment events, enabling candidates to apply promptly and prepare adequately for interviews.

Preparing Your Job Application

Your job application, comprising your resume and cover letter, is your ticket to securing an interview at JP Morgan Chase. Highlight your qualifications, skills, and experiences that align with the job listing, ensuring you stand out in the competitive job market.

Acing the Interview

Preparation is key to succeeding in your interview with JP Morgan Chase. Familiarize yourself with the company's business segments, values, and recent achievements. Demonstrating how your background and aspirations match the company's goals can significantly increase your chances of employment. A World of Job Opportunites in the Financial Services Industry JP Morgan Chase & Co. offers a world of job opportunities for those seeking to make their mark in the financial services industry. With competitive salaries, comprehensive benefits, and endless possibilities for growth, positions at JP Morgan Chase are highly coveted. By staying informed through job sites, tailoring your applications, and preparing thoroughly for interviews, you can enhance your prospects of joining the esteemed ranks of JP Morgan Chase employees. Explore the job board, seize the job opportunities, and embark on a rewarding career journey with one of the world's leading financial institutions.
Learn more about JP Morgan Chase & Co.
Size
661 employees
Market Cap
$384.5 billion
Industry
Net Income
$29.1 billion
Founded
1823
5 Year Trend
+0.7%
Revenue
$261.5 million
NASDAQ

Similar Jobs

More Jobs at JP Morgan Chase & Co.

More Information Technology Jobs

Find similar Site Reliability Engineer III jobs: