Monitoring Engineer

Adyen • $90K — $130K *

Chicago, IL 60629In-Person

Information Technology

5 - 7 years of experience

Today

Be an Early Applicant

By clicking Apply, I agree with Ladders' Terms of Use and Privacy Policy

Job Overview by Ladders

Qualifications

5+ years of experience in incident and problem management
Proficient in incident client communication and platform monitoring operations
Strong skills in analyzing incident trends and driving preventative actions
Experienced with monitoring tools like Prometheus and Grafana
Familiar with observability platforms such as Datadog and Splunk
Excellent analytical abilities and problem-solving skills
Ability to communicate complex technical concepts clearly to diverse audiences

Responsibilities

Monitor platform and merchant performance as part of an on-call support team
Coordinate incident resolution, ensuring a customer-centric approach during high-impact incidents
Communicate real-time incident updates to merchants and escalate critical issues appropriately
Analyze incident trends to identify and advocate for long-term fixes
Collaborate with engineering, product, and operations teams to enhance monitoring strategies
Investigate alerts and provide constructive feedback for effective logging
Lead initiatives aimed at automating monitoring processes for improved efficiency

Benefits

Opportunities for professional development and skill enhancement
Inclusive company culture that values diverse perspectives
Supportive team environment that fosters collaboration
Encouragement to apply even if you don't meet 100% of qualifications
Structured hiring process designed for a smooth candidate experience

Full Job Description

Platform Monitoring Engineer / Incident Manager

A team within Engineering under the Platform Excellence pillar exhibits an unwavering attention to detail and a deep understanding of the platform wide monitoring implications to all merchants.

In this role, you will be on-call monitoring platform performance, coordinating and commanding incidents, communicating with our customers, working on monitoring frameworks, providing feedback to product engineering teams to improve the reliability of the platform. You will initiate and lead initiatives across our platform offerings prioritizing merchant impact to proactively detect any issues, inform merchants quickly, and increase the reliability of our platform.

What you'll do

On-call: The team operates in a follow the sun model, where you will be participating in the EMEA shift (shifts: 9.00AM - 6.00PM PDT, 11.00-8.00PM CDT). Observe platform and merchant performance and detect any issues proactively to mitigate risks in partnership with Engineering teams.
Incident Management: Coordinate the mitigation, recovery, and resolution of high-impact incidents, ensuring a rapid and effective response across teams. Represent the customer perspective during incidents, maintaining a strong customer-centric approach.
Communication: Be an expert in communicating with merchants real time during an incident and present the most accurate and updated information to keep them informed. Escalate critical incidents when needed and provide structured communication to senior management.
Problem Management: Go beyond reactive incident response by analyzing incident trends to identify recurring issues and systemic weaknesses. Partner with engineering and product teams to advocate for long-term fixes over repeated short-term patches.
Working together with Operations, Product, and Engineering teams to integrate, grow, and continuously improve our monitoring strategy and increase our reliability.
Investigate alerts and provide feedback to engineering teams to build effective logging and alerts across the platform architecture.
Mitigate merchant impact risk by actioning on alerts in partnership with Engineering teams, and contribute to the monitoring playbook by documenting your learnings.
Improve operations by leading/project managing initiatives and, or tools-development of automation for effective monitoring. .
Focus on ruthlessly prioritizing, automating, and scaling every aspect of our detection capabilities.

Who you are

You have at least 5 years of experience with incident management, problem management, incident client communication, and platform monitoring operations.
You have experience with problem management practices - identifying trends across incidents, conducting root cause investigations and driving preventative action.
You have solid communication skills and the ability to develop strong working relationships throughout the organization, able to translate technical situations clearly and concisely to a diverse audience via data-visualizing dashboards and written documents.
You're willing to participate in the on-call rotation and work in a fast-paced, dynamic environment.
You have experience with monitoring and logging tools like Prometheus, Grafana, ELK Stack, etc.
You have experience with observability platforms like Datadog, Dynatrace, Splunk.
You have excellent analytical and problem-solving skills, with the ability to analyze complex systems and spot the root cause of issues.
You thrive in an environment where collaboration is crucial and where a global approach is key for are you successful implementation of processes and projects
You have a passion for defining and standardizing processes to drive strategic improvement and able to translate complex technical concepts with ease for all non technical audiences
You have a natural ability for handling complex situations and multiple responsibilities simultaneously
You're a strong team player and thrive in a dynamic environment.
Work schedule: The shifts are from 9.00AM - 6.00PM PDT or 11.00AM - 8.00PM CDT with a 6-day workweek at least twice a month (Sunday-Friday or Monday-Saturday).

Our Diversity, Equity and Inclusion commitments

Our unique approach is a product of our diverse perspectives. This diversity of backgrounds and cultures is essential in helping us maintain our momentum. Our business and technical challenges are unique, and we need as many different voices as possible to join us in solving them - voices like yours. No matter who you are or where you're from, we welcome you to be your true self at Adyen.

Studies show that women and members of underrepresented communities apply for jobs only if they meet 100% of the qualifications. Does this sound like you? If so, Adyen encourages you to reconsider and apply. We look forward to your application!

What's next?

Ensuring a smooth and enjoyable candidate experience is critical for us. We aim to get back to you regarding your application within 5 business days. Our interview process tends to take about 4 weeks to complete, but may fluctuate depending on the role. Learn more about our hiring process here. Don't be afraid to let us know if you need more flexibility.

San Francisco

This role is based out of our [CITY NAME] office. We are an office-first company and value in-person collaboration; we do not offer remote-only roles.

About Adyen

Adyen is a Dutch payment company that allows businesses to accept e-commerce, mobile, and point-of-sale payments. The company was founded in 2006 and has since grown to become one of the largest payment processors in the world. Adyen's platform is designed to be flexible and scalable, allowing businesses of all sizes to accept payments in a variety of currencies and payment methods. The company's clients include some of the world's largest companies, such as Uber, Spotify, and Microsoft.

Learn more about Adyen

Size

2,180 employees

Industry

Finance & Insurance

Founded

2006

NASDAQ

ADYYF

* Ladders Estimates

Similar Jobs

Senior Systems Engineer, Technical Support
$90K — $172K *
Calix
Remote
Reposted Today
Technical Support Engineer - Java/.NET
$72K — $90K *
New Relic
Detroit, MI 48228 (Wayne County)
Today
Senior TAM / Technical Account Manager
$100K — $130K *
Astra
Remote
Today
Cloud Development Engineer I
$70K — $100K *
Sentinel
Downers Grove, IL 60516 (Dupage County)
Yesterday
Software Technical Account Manager II
$90K — $120K *
Boston Scientific
Remote
Yesterday
Senior Technical Support Engineer
$100K — $130K *
PreVeil, Inc.
Remote
Yesterday

Get Ready For Your
Next Interview

More Jobs at Adyen

Monitoring Engineer
$90K — $130K *
Chicago, IL 60629 (Cook County)
Today
Information Technology
In-Person
Senior CI/CD Engineer
$180K — $243K *
Chicago, IL 60629 (Cook County)
Reposted Today
Information Technology
In-Person
Project Operations Manager
$100K — $130K *
Chicago, IL 60629 (Cook County)
Yesterday
Finance & Insurance
In-Person
Project Operations Manager
$145K — $190K *
New York, NY 10025 (New York County)
Yesterday
Finance & Insurance
In-Person
Manager, Rewards (Compensation & Benefits) - Americas
$250K — $320K *
San Francisco, CA 94112 (San Francisco County)
1 week ago
Business Services
In-Person

More Information Technology Jobs

Client Partner - Banking / Financial Services / Capital Markets
$325K — $350K + $100K bonus *
Large IT Services Firm (client of TechLink Systems)
New York, NY 10001 (New York County)
1 week ago
Principal Engineer
$171K — $375K *
San Jose, CA 95123 (Santa Clara County)
Reposted Today
Staff Network Engineer
$124K — $271K *
Remote
Reposted Today
Principal Information Security Engineer
$96K — $207K *
Fifth Third Bancorp
Remote
Reposted Today
Senior Manager, Data Science
$122K — $228K *
Bank of Montreal
Chicago, IL 60629 (Cook County)
Reposted Today

Find similar Monitoring Engineer jobs:

Nationwide Chicago, IL

Monitoring Engineer

Job Overview by Ladders

Full Job Description

Get Ready For Your Next Interview

Find similar Monitoring Engineer jobs:

Get Ready For Your
Next Interview