Job DescriptionKavaliro is seeking an experienced Cloud Infrastructure & DevOps Leader for our local Jacksonville client to oversee the reliability, scalability, and security of our AWS-hosted SaaS platforms. This role will play a critical part in supporting rapid business growth by partnering with engineering teams on deployments, performance optimization, and infrastructure strategy. The ideal candidate combines strong hands-on technical expertise with leadership experience and a passion for automation, operational excellence, and continuous improvement.
This is an onsite position based in Jacksonville, FL, working Monday through Friday from 9:00 AM to 6:00 PM.
Responsibilities
- Collaborate with R&D and engineering teams to support system deployments, upgrades, capacity planning, and performance tuning initiatives.
- Manage the day-to-day operations of AWS infrastructure and core SaaS platforms, including monitoring, alerting, incident response, and production support.
- Establish and maintain high-availability standards to ensure the reliability and performance of mission-critical systems while continually enhancing service quality.
- Lead efforts to automate operational processes through CI/CD best practices, improving release efficiency and accelerating incident response.
- Promote Infrastructure as Code methodologies using Terraform and CloudFormation to drive consistency, scalability, and operational maturity.
- Implement and maintain monitoring and observability solutions that provide visibility into system health and proactively identify issues.
- Partner with security teams to enforce standards around system hardening, patch management, access controls, and audit readiness.
- Continuously refine operational processes and establish best practices that strengthen execution and improve organizational effectiveness.
- Provide technical leadership, mentorship, and guidance to team members while fostering a culture centered around accountability, innovation, and continuous improvement.
- Manage on-call processes and ensure timely responses to critical production events and customer-impacting incidents.
Qualifications
- Bachelor's degree in Computer Science or a related technical field.
- 3+ years of hands-on experience managing AWS environments, with at least one year of team leadership or project management responsibility.
- Experience supporting Kubernetes in production environments, including the ability to diagnose and resolve common issues independently.
- Strong working knowledge of Nginx, MySQL, Redis, and Kafka, including configuration, optimization, and troubleshooting.
- Proficiency with Shell scripting or Python for automation and operational tasks.
- Experience implementing Infrastructure as Code using Terraform and/or CloudFormation.
- Familiarity with AWS services including VPC, EC2, S3, IAM, and RDS.
- Experience with CI/CD tools and processes, including platforms such as Jenkins and GitHub Actions.
- Fluent Mandarin Chinese is required to effectively collaborate with overseas headquarters, along with professional English communication skills for the local work environment.
- Demonstrated leadership skills, strong ownership mentality, and the ability to thrive in a collaborative environment.
- Willingness to participate in and oversee on-call rotations to support production availability.
If you're passionate about building resilient cloud platforms, leading high-performing teams, and driving automation at scale, we'd love to hear from you.
Job RequirementsOn-Site