Job DescriptionWhat You'll Do:
Own the end-to-end lifecycle (design, provisioning, upgrades, maintenance, and decommissioning) of core platform components, including:
- Cloud infrastructure primitives
- Kubernetes clusters and cluster services
- Networking, ingress, and service discovery
- Service Mesh and supporting data-plane components
Design platform components to be resilient by default, applying SRE principles such as:
- Fault isolation and graceful degradation
- Capacity planning and saturation control
- Reduced operational toil and clear failure modes
Lead the design and implementation of infrastructure bootstrap orchestration, including:
- Automated cluster and environment provisioning
- Deterministic, repeatable platform bring-up and teardown
- Dependency-aware orchestration across cloud, network, and Kubernetes layers
Drive Infrastructure-as-Code and GitOps-first practices to ensure:
- Platform components are reproducible and auditable
- Changes are automated, testable, and reversible
- Manual intervention is minimized or eliminated
- Identify automation gaps and lead initiatives that reduce human effort, onboarding time, and operational risk.
Apply and promote SRE operational excellence practices, including:
- Clear ownership and runbooks for platform components
- Participation in on-call rotation as a platform reliability escalation point
- Incident response, post-incident reviews, and problem management
- Improve day-2 operations by standardizing upgrade/rollback strategies and reducing MTTD/MTTR.
- Ensure platform operations align with security, compliance, and internal control requirements.
- Collaborate with engineering teams across the organization to influence platform adoption, reliability standards, and cloud-native best practices.
This is a hybrid position. Expectation of days in office will be confirmed by your hiring manager.
QualificationsBasic Qualifications:
- 2+ years of relevant work experience and a Bachelors degree, OR 5+ years of relevant work experience
Preferred Qualifications:
- 3 or more years of work experience with a Bachelor's Degree or more than 2 years of work experience with an Advanced Degree (e.g. Masters, MBA, JD, MD)
- Experience in creating and updating documentation for infrastructure and operational procedures.
- Experience in providing first-level support for infrastructure and deployment issues.
- Experience in automating repetitive tasks and suggesting workflow improvements.
- Experience in learning and applying DevOps and SRE best practices.
- Experience in supporting implementation and management of containerization technologies.
- Language Skills:
- Proficiency in English at B2 level or above (Upper-Intermediate)
- Technical Skills:
- Strong hands-on experience with public cloud platforms (Azure mandatory, AWS preferred).
- Proven experience operating and administering Kubernetes at scale in production environments.
- Strong experience with container orchestration platforms and cloud architecture fundamentals (networking, IAM/security concepts, and reliability patterns).
- Experience with Infrastructure as Code (Terraform preferred) and automation-first workflows.
- Familiarity with GitOps practices and CI/CD pipelines.
- Strong troubleshooting skills for distributed systems, including root-cause analysis and reliability improvements.
- Experience with observability concepts and practices (monitoring, logging, alerting, tracing).
- Experience with Service Mesh technologies (Istio preferred, App Mesh or Linkerd).
- Experience working with critical or mission-critical systems.
- Strong background applying SRE principles (operational readiness, incident management, runbooks, toil reduction).
U.S. Applicants OnlyThe estimated salary range for this position is $110,700.00 to $ 171,800.00 USD per year, which may include potential sales incentive payments (if applicable). Salary may vary depending on job-related factors which may include knowledge, skills, experience, and location. In addition, this position may be eligible for bonus and equity.Visa has a comprehensive benefits package for which this position may be eligible that includes Medical, Dental, Vision, 401(k), FSA/HSA, Life Insurance, Paid Time Off, and Wellness Program.
Work HoursVaries upon the needs of the department.
Travel RequirementsThis position requires travel 5-10% of the time.
Mental/Physical RequirementsThis position will be performed in an office setting. The position will require the incumbent to sit and stand at a desk, communicate in person and by telephone, frequently operate standard office equipment, such as telephones and computers.