Department:Information Technology
Job Description:Join a growing Infrastructure team to help design, build, and maintain robust, scalable, and secure cloud environments that power our organization's operations. As a Senior Cloud Infrastructure Engineer, you will also lead governance and oversight of our Cloud Infrastructure Managed Services (CIMS) provider -ensuring that contracted services are delivered reliably, securely, and cost-effectively in alignment with the organization's cloud strategy. This role blends deep cloud engineering with operational excellence, incident leadership, and vendor performance management.
Work Arrangement:- Employees who live within 30 miles of the TMG home office are expected to follow a hybrid or in-office schedule. The initial training period may require additional in-office days.
Accountabilities:Cloud Engineering & Platform Operations- Design & deliver cloud foundations including account/subscription setup, networking, access controls, guardrails, and secure, scalable architecture patterns across AWS and Azure.
- Implement automation through Infrastructure as Code (Terraform or similar) and CI/CD pipelines to provision, update, and maintain environments.
- Run reliable services by monitoring system health, performance, logs, and security events, responding to incidents, and driving root-cause analysis.
- Support modernization & migrations, including containerization, serverless adoption, and transition to resilient multi-AZ/region patterns.
Vendor Governance & Managed Services Oversight - Serve as the primary technical day-to-day interface with the Cloud Infrastructure Managed Services (CIMS) provider supporting our AWS and Azure environments.
- Oversee adherence to SOW/MSA obligations, including scope, SLAs, security tasks, DR drills, ticket concurrency limits, escalation matrix steps, cost reporting, and monthly governance deliverables.
- Review and validate supplier-delivered RCAs, ensuring corrective actions are completed and prevention steps are implemented.
- Review and approve vendor-initiated changes to ensure alignment with internal standards.
- Participate in bi-weekly governance meetings, contributing to KPI reviews, risk tracking, cost insights, and optimization recommendations.
Incident & Problem Management - Lead technical response for cloud-related Severity 1 & 2 incidents, coordinating with the supplier and internal teams to restore service quickly.
- Ensure incidents meet response and resolution SLAs and escalate via the SOW's escalation matrix when needed.
- Drive problem management by identifying recurring patterns and implementing remediation with the supplier.
Security, Identity & Compliance - Apply and enforce cloud security baselines, including MFA, PAM, RBAC, encryption, logging, monitoring, and identity governance.
- Validate periodic vulnerability assessments and cloud security scoring delivered by the supplier; ensure remediation progress.
- Partner with Security and Compliance to ensure cloud environments adhere to data-protection requirements.
Cost Optimization & FinOps Collaboration - Review monthly cloud consumption and cost reports delivered by the supplier; validate accuracy of consumption-based CIMS billing tiers.
- Identify and action cost optimization opportunities including rightsizing, scheduling, storage optimization, and cleanup activities.
- Enforce tagging and cost-allocation standards for consistent reporting and chargeback readiness.
Documentation, Standards & Knowledge Sharing - Maintain up-to-date runbooks, diagrams, and SOPs; ensure supplier documentation meets internal expectations.
- Share best practices and mentor teammates in modern cloud practices, automation, and operational excellence.
Qualifications:- 8+ years in infrastructure/operations/DevOps/SRE roles, with 5+ years in cloud infrastructure engineering.
- Hands-on experience running production workloads in AWS and/or Azure.
- Strong Infrastructure-as-Code skills (Terraform or equivalent) and experience with CI/CD automation.
- Solid understanding of cloud networking, IAM/RBAC, security, and cost management.
- Experience working with or overseeing a cloud managed services provider, including ticket management, SLA interpretation, and escalation processes.
- Strong troubleshooting and incident-response experience, especially in cloud environments.
- Excellent communication and collaboration skills.
- Experience with containers, orchestration, and GitOps workflows.
- Familiarity with compliance frameworks (SOC 2, PCI, HIPAA) and policy-as-code concepts.
- Cloud certifications (AWS or Azure) strongly preferred but not required.
Pay Range:Anticipated Hiring Range:- $100,000 - $150,000 annual base salary depending on experience, qualifications, and geographic location
Benefits:We are proud to offer our full-time regular employees a robust benefits suite that includes:
- Competitive base salary plus incentive plans for eligible team members
- 401(K) retirement plan that includes a company match of up to 6% of your eligible salary
- Free basic life and AD&D, long-term disability and short-term disability insurance
- Medical, dental and vision plans to meet your unique healthcare needs
- Wellness incentives
- Generous time off program that includes personal, holiday and volunteer paid time off
- Flexible work schedules and hybrid/remote options for eligible positions
- Educational assistance