Federal Reserve Bank

Senior Site Reliability Engineer

Federal Reserve Bank$140K — $210K *
Enterprise Technology
Less than 5 years of experience
Job Overview by Ladders

Qualifications

  • 5+ years in Site Reliability Engineering, DevOps, or a related field
  • Strong knowledge of AWS services and cloud infrastructure
  • Proficient in programming/scripting languages such as Python, Java, or Go
  • Experience with CI/CD and IaC pipeline automation
  • Familiarity with fault injection and observability tools

Responsibilities

  • Operate and manage the production environment for the FedNow program
  • Architect and implement monitoring solutions for capacity and scaling
  • Develop automation for CI/CD pipelines
  • Ensure systems' resiliency and conduct disaster recovery testing
  • Drive continuous improvement initiatives within ITIL processes

Benefits

  • Work from an office in a collaborative environment
  • Opportunity to influence a transformative financial initiative
  • Develop skills in advanced technologies and practices
  • Engagement with a diverse team of professionals
  • Support for ongoing career development and education
Full Job Description
Company
Federal Reserve Bank of Boston

The Federal Reserve has developed a new interbank 24x7x365 real-time gross settlement (RTGS) service with integrated clearing functionality, called the FedNow Service. This service enables financial institutions to provide their customers with the ability to send and receive payments any time, any day, and have full access to those funds within seconds. This position is a unique opportunity to be part of this mission-critical Federal Reserve initiative that is transforming the payments landscape in the United States.

The position will be primarily on-site with residency commutable to one of our offices required.

Candidates may come from infrastructure/DevOps backgrounds or software engineering backgrounds (e.g., Java Python, Go) with strong interest in operating and improving reliability of distributed production systems.

Responsibilities
As a Senior Engineer of the SRE / Production Operations team for FedNow, you will operate the production environment for the program.

You will architect, implement, and leverage solution monitoring and tooling to be used for capacity planning, utilization reporting, and scaling.

The team uses open source and proprietary software to support Engineering, DevOps, and DevSecOps tools, services, and solutions.

CI/CD and IaC Pipeline automation design and development.

Resiliency, DR and BCP (including testing)

The SRE / Production Operations team is part of the Technical Operations (TechOps) department and has the overall responsibility for the design, management and execution of operations required to support the ongoing technical and delivery needs of the FedNow Program, as well as the transition to production support and operations.

It owns ongoing ITIL processes, and the implementation and driving of continuous improvement initiatives.

The role applies both software engineering and system engineering practices to operate and improve large-scale distributed systems. You will work closely with Engineers and Architects of the FedNow program in order to maintain seamless automation across the entire platform.

Proactively identify suspected gaps in system architecture and design experiments to expose them

The ideal candidate is someone who loves building and maintaining reliable and scalable systems, CI/CD tooling, and automating cloud-based highly available, high performing applications.

Key Skills

Strong communication and collaboration skills

Extensive knowledge and understanding of working in AWS environments & services

EC2, EBS, EKS, RDS, Aurora, S3, Route 53, ELB, IAM, etc.

Hashicorp Terraform, Consul, Vault, and Ansible

Experience developing automation or operational tooling using scripting or programming languages such as Python, Java, Go, or similar languages.

Experience working with cloud infrastructure platforms or distributed system environments

Experience working in Linux environment and shell scripting

Experience supporting infrastructure for large multi-services applications

Experience working with continuous deployment in micro-services architectures

Experience working with Docker, Containers, ECR and EKS.

Observability - CloudWatch, OpenSearch, Dynatrace, Grafana, Prometheus

Familiarity with Fault Injection tooling
(i.e. AWS Fault Injection Simulator, Gremlin, ChaosToolkit, Chaos Monkey)

Automation mindset to enable consistency and dependability in common actions

The salary range for this position is $140,000 - $210,900. The position and job description posted is for a Senior Site Reliability Engineer however, candidates will be placed in an appropriate level within the Site Reliability Engineer job family based on the extent of their experience.

Full Time / Part Time
Full time

Regular / Temporary
Regular

Job Exempt (Yes / No)
Yes

Job Category
Information Technology Family Group

Work Shift
First (United States of America)

About Federal Reserve Bank

Industry
Founded
1913

Similar Jobs

More Jobs at Federal Reserve Bank

More Enterprise Technology Jobs

Find similar Senior Site Reliability Engineer jobs: