The OpportunityWe are seeking a highly skilled Senior Cloud / DevOps Engineer with a strong background in AWS, automation, infrastructure as code, and networking to support and modernize our cloud environments. This role is hands-on and will partner closely with Cloud Operations, SREs, Networking, and Application teams to improve scalability, reliability, security, and operational efficiency across mission-critical systems.
The ideal candidate is comfortable operating at both the infrastructure and application layers, has strong troubleshooting skills, and can automate repeatable operational tasks while supporting high-availability production workloads.
Key ResponsibilitiesCloud & DevOps Engineering
- Design, build, and maintain AWS-based infrastructure supporting production and non-production environments
- Implement and maintain Infrastructure as Code (IaC) using tools such as Terraform, CloudFormation, or equivalent
- Develop and support CI/CD pipelines for infrastructure and application deployments
- Partner with application teams to improve deployment reliability and performance
Automation & Reliability
- Create and maintain automation scripts and tooling (Python, Bash, PowerShell, etc.) to reduce manual operations
- Improve system reliability through self-healing mechanisms, monitoring, and alerting
- Support SRE-style practices including incident response, root cause analysis, and continuous improvement
Networking & Security
- Design and support cloud networking (VPCs, subnets, routing, VPNs, security groups, NACLs)
- Troubleshoot complex network, connectivity, and performance issues across hybrid environments
- Implement security best practices aligned with AWS Well-Architected Framework
Operations & Collaboration
- Participate in on-call rotations supporting critical production systems
- Provide operational support, troubleshooting, and resolution for cloud-related incidents
- Collaborate across CloudOps, Networking, DBAs, and Application teams
- Document architectures, runbooks, and operational procedures
What Success Looks Like in This Role
- Reduced manual operational work through automation
- Improved deployment reliability and production stability
- Faster recovery and clearer root cause analysis during incidents
- Strong partnership with CloudOps, Networking, and Application teams
Skills & Requirements
Required QualificationsTechnical Skills
- 5-8+ years experience in cloud, DevOps, SRE, or systems engineering roles
- Strong hands-on experience with AWS (EC2, VPC, IAM, ELB/ALB, RDS, S3, CloudWatch, etc.)
- Proven experience with Infrastructure as Code (Terraform preferred)
- Strong scripting and coding experience (Python, Bash, PowerShell, or similar)
- Solid background in networking fundamentals (TCP/IP, DNS, VPNs, routing, firewalls)
- Experience with Linux-based systems in production environments
- Familiarity with monitoring/logging platforms (Datadog, CloudWatch, LogicMonitor, etc.)
DevOps Tooling (one or more)
- CI/CD tools (GitHub Actions, GitLab CI, Jenkins, Azure DevOps, etc.)
- Configuration management and automation tools
- Containerization and orchestration (Docker, ECS, EKS, Kubernetes - preferred but not mandatory)
Preferred Qualifications
- AWS certifications (Solutions Architect, DevOps Engineer, or equivalent)
- Experience supporting high-availability, regulated, or SaaS environments
- SRE experience (error budgets, SLIs/SLOs, post-incident reviews)
- Experience working in hybrid cloud or legacy-to-cloud migration environments
- Strong documentation, communication, and cross-team collaboration skills
Qualifications