Job DescriptionAmerican Express Platform Services team is looking for innovators to help us build world-class applications, Cloud platforms and infrastructure supported by integrated CICD, Observability and security capabilities.
The Director Infrastructure Engineering - Head of Private Cloud (OpenShfit, IAC, Data Middleware Services) Operations - US is responsible for leading the strategy, execution, and continuous improvement of cloud operations across OpenShift, Redis, Kafka, Elasticsearch, Terraform and other platform services. This role ensures secure, reliable, scalable, and cost-effective cloud environments that support enterprise applications and digital transformation initiatives.
The ideal candidate combines strong technical depth in cloud infrastructure with operational excellence, financial governance (FinOps), DevOps, Site Reliability Engineering, automation leveraging GenAI/AgenticAI, and people leadership.
ResponsibilitiesKey Responsibilities: Cloud Operations Leadership
- Lead and manage Private Cloud operations for production and non-production environments.
- Establish and enforce operational standards, SLAs, and SLOs.
- Drive incident, problem, and change management processes.
- Ensure high availability, performance, and resilience of cloud platforms.
Cloud Infrastructure & Reliability
- Oversee infrastructure design, deployment, monitoring, and optimization.
- Implement Infrastructure as Code (IaC) using Terraform
- Drive SRE principles including reliability engineering and automation.
- Manage Disaster Recovery, and business continuity strategies.
Automation & DevOps Enablement
- Champion automation-first operational models.
- Leverage GenAI/AgenticAI to automate common platform operations including customer support
- Integrate CI/CD pipelines with cloud infrastructure.
- Reduce manual operational overhead through scripting and tooling.
- Enable platform engineering capabilities for internal teams.
Financial Governance
- Own cloud cost management, forecasting, and optimization.
- Implement tagging standards and chargeback/showback models.
- Drive cost-efficiency initiatives across workloads.
Vendor & Stakeholder Management
- Manage relationships with Service providers
- Collaborate with application teams, architecture, security, and enterprise IT.
- Support cloud migration and modernization programs.
Team Leadership & Development
- Build, mentor, and retain high-performing cloud operations teams.
- Define hiring strategy and succession planning.
- Establish performance metrics and career development plans.
- Foster a culture of accountability, innovation, and continuous improvement.
Qualifications- Bachelor's degree in Computer Science, Engineering, or related field (Master's preferred).
- 8+ years of experience in Platform Engineering & Operations, API Support or Site Reliability Engineering (SRE), with a proven track record of leading teams in managing large-scale cloud infrastructure with a focus on reliability and resilience.
- Deep hands-on experience with any Kubernetes platform(multi-cloud preferred).
- Strong experience with:
- Infrastructure as Code (Terraform, CloudFormation, ARM)
- Container platforms (OpenShift/Kubernetes)
- Monitoring tools (Prometheus, OTEL, LOKI)
- CI/CD pipelines (Jenkins, GitHub Actions)
- Strong understanding of cloud networking, security, and architecture.
- Experience managing large-scale, mission-critical production environments.
- Proven experience in financial management and cloud cost optimization.
- Relevant certifications preferred
- Experience with DevOps practices and methodologies, including CI/CD pipelines, configuration management, and infrastructure as code.
- Experience in leveraging GenAI and AgenticAI in automation and self-healing of platforms
- Experience with observability tools such as Prometheus, Splunk, ELK, Dynatrace.
- Strong analytical and problem-solving skills, with the ability to troubleshoot complex issues and drive resolution in a fast-paced environment.
- Excellent communication and leadership skills, with the ability to effectively collaborate with cross-functional teams and influence decision-making at all levels of the organization.
Employment eligibility to work with American Express in the United States is required as the company will not pursue visa sponsorship for these positions.
About the TeamWe back you with benefits that support your holistic well-being so you can be and deliver your best. This means caring for you and your loved ones' physical, financial, and mental health, as well as providing the flexibility you need to thrive personally and professionally:
- Competitive base salaries
- Bonus incentives
- 6% Company Match on retirement savings plan
- Free financial coaching and financial well-being support
- Comprehensive medical, dental, vision, life insurance, and disability benefits
- Flexible working model with hybrid, onsite or virtual arrangements depending on role and business need
- 20+ weeks paid parental leave for all parents, regardless of gender, offered for pregnancy, adoption or surrogacy
- Free access to global on-site wellness centers staffed with nurses and doctors (depending on location)
- Free and confidential counseling support through our Healthy Minds program
- Career development and training opportunities
For a full list of Team Amex benefits, visit our Colleague Benefits Site.
The below represents the expected salary range for this job requisition. Ultimately, in determining your pay, we'll consider your location, experience, and other job-related factors.