Synchrony

AVP, Reliability Engineer - OnePay

Synchrony$100K — $170K *
Information Technology
5 - 7 years of experience
Job Overview by Ladders

Qualifications

  • Bachelor's degree with 5+ years in application development or reliability engineering, or 8+ years of experience without a degree.
  • Proven troubleshooting expertise in distributed systems within cloud environments.
  • Solid understanding of distributed systems and cloud concepts like containerization and data replication.
  • Experience with operational metrics for incident management and support.
  • Hands-on scripting/automation skills in languages like Python or Bash.

Responsibilities

  • Investigate and analyze production defects with cross-functional teams.
  • Ensure high availability and scalability of OnePay applications.
  • Enhance observability through dashboards and monitoring tools.
  • Design and implement service metrics to track performance and SLAs.
  • Develop automation using AIOps to aid incident response.
  • Continuously monitor and report on application health and performance.
  • Support CI/CD processes and troubleshoot pipeline issues.

Benefits

  • Flexible work options, allowing remote work near local hubs.
  • Engagement in career development activities such as training and team meetings.
  • Incentive opportunities like annual bonuses based on performance.
Full Job Description

Role Summary/Purpose:

The AVP, Reliability Engineer 6 OnePay plays a pivotal technical role within Synchrony Financial to ensure high availability, stability, security, and performance of applications supporting OnePay integrations. In order to provide operational excellence in a highly regulated environment, this role provides technical expertise and rigor to identify and remediate failures or looming issues that could negatively impact customer and partner experiences or prevent adherence to SLAs. The ideal candidate excels at problem analysis, troubleshooting methods, and situational awareness within the context of distributed systems.

This is a hands-on technologist role requiring exposure to SRE and DevOps technology stacks and strong understanding of application support processes, including monitoring and addressing incidents/alerts across engineering applications and ensuring effective coordination and handoffs with vendors, partners, and internal Synchrony teams. The role also develops automation and leverages AIOps approaches to detect gaps, monitor trends, reduce operational toil, and expedite response and remediation.

Essential Responsibilities:

  • Drive investigations with cross-functional teams to understand failures, analyze production defects, troubleshoot systems, identify root cause, and implement fixes to prevent recurrence.

  • Ensure the dependability, availability, and scalability of OnePay-integrated applications and services by partnering with application, platform, and infrastructure teams.

  • Enhance observability, including establishing and maintaining dashboards and monitoring capabilities (e.g., Splunk, New Relic, and similar tools), improving alert quality, and strengthening operational readiness.

  • Design and implement monitoring, alerting, and metrics to track and report adherence to service SLAs/SLOs, performance, and operational efficiency.

  • Develop automation and leverage AIOps to detect reliability gaps, monitor trends, reduce noise, and expedite incident response and restoration activities.

  • Continuously monitor the health and performance of engineering applications, production servers, and key service indicators; provide monitoring/reporting and recommendations as needed.

  • Support release and operational processes, including troubleshooting CI/CD pipeline issues (e.g., Jenkins pipelines) and coordinating releases with partner teams.

  • Participate in Agile sprints with cross-functional teams involving multiple technologies, personnel, and processes; contribute reliability requirements and improvements that support continuous delivery.

  • Support a root cause analysis discipline and continuous improvement practices that reduce downtime and increase resiliency.

  • Coordinate effectively with vendor partner teams and Synchrony teams to ensure seamless support handoffs and timely issue resolution.

  • Communicate the status of technical stacks, incidents, risks, and reliability initiatives to stakeholders and leadership, including partner-facing stakeholders as appropriate.

  • Work closely with an experienced staff comprising both Synchrony resources and third-party contractors.

  • Participate in an on-call rotation to respond to critical production issues.

  • Perform other duties and/or special projects as assigned.

Qualifications/Requirements:

  • Bachelor 27s degree and a minimum of 5 years of relevant experience in application development, reliability engineering, systems engineering, and/or production application support (or equivalent practical experience) OR in lieu of a Degree, High School Diploma/GED and a minimum of 8+ years of experience of relevant experience.

  • Demonstrated experience troubleshooting and supporting distributed systems in cloud environments.

  • Good understanding of the nature of distributed systems and cloud providers.

  • Solid understanding of cloud concepts such as containerization, message queues, load balancing, data replication, and high availability patterns.

  • Understanding of IT application support processes, including incident management, problem resolution, and operational/support metrics used for decision-making.

  • Knowledgeable in UNIX Operating System fundamentals.

  • Familiar with network programming concepts and protocols.

  • Proficiency in DevOps concepts and Site Reliability Engineering (SRE) principles, including automation, monitoring, and reliability best practices.

  • Hands-on experience with scripting/automation in at least one language such as Python, Bash, JavaScript, PowerShell, Go, or similar.

  • Familiar with one or more configuration automation/tools such as Terraform, Ansible, Puppet, Chef, etc.

  • Strong communication skills (verbal and written) and excellent interpersonal skills with ability to interact with multiple audiences, including clients/partners, developers, managers, and senior executives.

  • Customer-focus mindset; self-driven, detail-oriented; strong organizational and time management skills; ability to operate with limited supervision.

  • Well-developed analytical and problem-solving skills.

  • Continuously seeks opportunities to enhance products/services through process improvements.

Desired Characteristics:

  • Strong alignment to DevOps tools and SRE best practices; demonstrated ability to reduce operational toil through automation.

  • Experience with cloud providers such as AWS, Azure, and/or GCP; exposure to deployment processes such as AWS/PCF where applicable.

  • Familiar with toolsets such as Jira, PagerDuty, OpsGenie, Kibana, Grafana, Splunk, and application performance monitoring tools such as New Relic.

  • Experience supporting or coordinating CI/CD pipelines (e.g., Jenkins/CloudBees) and release processes.

  • Knowledge of an application or systems language such as Java, Golang, Rust, or C++.

  • ITIL Foundation and/or SRE/DevOps certifications are a plus.

  • Experience driving reliability improvements through resiliency patterns, performance tuning, and operational readiness practices in partner-integrated environments.

Grade/Level: 10

The salary range for this position is 100,000.00 - 170,000.00 USD Annual and is eligible for an annual bonus based on individual and company performance.

                                                      

Actual compensation offered within the posted salary range will be based upon work experience, skill level or knowledge.

Salaries are adjusted according to market in CA, NY Metro and Seattle.

Our Way of Working:

We 27re proud to offer you flexibility. At Synchrony, our way of working allows you to have the option to work from home near one of our Hubs or come into one of our offices.You will be required to commute to your nearestHub (either virtual or physical) for in-person engagement activities such as regularbusiness or team meetings, training and culture events.

*Field Sales and some Commercial team roles may have varied location requirements based upon partner obligations or preferences.

Job Family Group:

Information Technology

About Synchrony

Synchrony (NYSE: SYF) is a leading consumer financing company at the heart of American commerce and opportunity. From health to home, auto to retail, our Synchrony products have been serving the needs of people and businesses for nearly 100 years. We provide responsible access to credit and banking products to support healthier financial lives for tens of millions of people, enabling them to access the things that matter to them. Additionally, through our innovative products and experiences, we support the growth and operations of some of the country's most respected brands, as well as more than 400,000 small and midsize businesses and health and wellness providers that Americans rely on. Synchrony is proud to be ranked as the country's #2 Best Company to Work For® by Fortune magazine and Great Place to Work®.
Learn more about Synchrony
Size
18,000 employees
Market Cap
$14.4 billion
Industry
Net Income
$1.3 billion
Founded
1993
5 Year Trend
+0.7%
NASDAQ

Similar Jobs

More Jobs at Synchrony

More Information Technology Jobs

Find similar AVP, Reliability Engineer - OnePay jobs: