KEY RESPONSIBILITIES- Design, build, and continuously evolve a unified cloud-native compute and network platform that provides consistent access, runtime, and traffic management capabilities for all company microservices.
- Build, optimize, and operate Kubernetes-based compute orchestration platforms and related ecosystem components, including but not limited to HPA, VPA, Cluster Autoscaler, and CoreDNS.
- Design and implement compute and network infrastructure capabilities on AWS, including VPC, Subnet, NAT, VPN, dedicated connectivity, and integration with EC2 and serverless platforms.
- Build and enhance traffic management capabilities for large-scale microservice environments, including service mesh, gateways, load balancers, CDN, and global acceleration.
- Lead the architecture, deployment, governance, performance optimization, and reliability assurance of large-scale Istio service mesh in production environments to support service discovery, traffic routing, resilience, and secure service-to-service communication.
- Collaborate closely with application engineering, architecture, and platform teams to standardize infrastructure capabilities and improve scalability, reliability, and operational efficiency.
- Drive technical evolution and best practices in cloud-native compute and networking, and contribute to the long-term development of the company's infrastructure platform.
RequirementsREQUIRED QUALIFICATIONS- Bachelor's degree or above in Computer Science, Software Engineering, or a related field.
- 7+ years of experience in cloud infrastructure, platform engineering, or related areas.
- Hands-on experience with AWS is required, with solid practical knowledge of cloud compute and networking services.
- Deep expertise in Kubernetes, including cluster architecture, workload orchestration, networking, autoscaling, and ecosystem components; Kubernetes certifications such as CKA or CKS, or equivalent proven expertise, are strongly preferred.
- Proven experience managing large-scale Istio service mesh in production environments is required.
- Strong understanding of cloud-native traffic management and networking, including gateways, load balancers, service-to-service communication, CDN, and global traffic routing.
- Strong problem-solving skills with the ability to independently troubleshoot and resolve complex infrastructure and network issues in distributed systems.
- Strong communication and cross-functional collaboration skills.
PREFERRED QUALIFICATIONS- Experience with other public cloud platforms such as Azure or GCP.
- Experience building compute and network platforms in multi-cloud or hybrid-cloud environments.
- Experience building shared infrastructure platforms or standardized platform capabilities for largescale microservice architectures.
BenefitsSalary range: $150,000 - 200,000
- Free snacks and drinks
- Fully paid medical, dental, and vision insurance (partial coverage for dependents)
- Contributions to 401k funds
- Bi-annual reviews, and annual pay increases
- Health and wellness benefits, including free gym membership
- Quarterly team-building events
Please, no third-party agency inquiries, and we are unable to offer visa sponsorships at this time.