Kubernetes Platform Engineer (Remote)

Oxley Enterprises®, Inc.

$66K — $110K *
US-AnywhereRemote in Stafford, VA
Information Technology
Less than 5 years of experience
Job Overview by Ladders

Qualifications

  • 3+ years managing AWS EKS clusters in production environments
  • Bachelor's Degree in computer science or related field
  • Expertise in Kubernetes operations and cluster management
  • Proficient in Terraform for Infrastructure as Code governance
  • Experience with Istio service mesh management
  • Strong skills in monitoring with Prometheus and Grafana
  • Current Federal Civilian Public Trust clearance

Responsibilities

  • Operate and maintain AWS EKS clusters for production, staging, and sandbox
  • Detect and remediate node failures promptly
  • Manage the lifecycle of EKS cluster add-ons
  • Implement and uphold Infrastructure as Code practices
  • Enforce policies for no manual changes in production
  • Participate in daily Change Control Board meetings
  • Contribute to monthly reporting on cluster health and compliance

Benefits

  • Medical, dental, vision, and prescription drug coverage for you and your family
  • Life insurance, short-term, and long-term disability provided by the company
  • Supplemental coverages such as Accident and Critical Illness insurance
  • 401k plan with multiple options for retirement planning
Full Job Description
The following states/districts are excluded from this job ad: AK, CA, CO, CT, DC, HI, LA, MA, MN, MO, NE, NV, NH, NJ, NM, NY, ND, OR, PR, RI, VT, WA, WY

Future Need - Actively Interviewing

Location: Remote in any United States jurisdiction not excluded from this job advertisement.

Keep the engine running on a complex Kubernetes environment! As a Kubernetes Platform Engineer, you will operate and maintain AWS EKS clusters supporting 300+ applications across production, staging, and sandbox environments.

Position Description: The Kubernetes Platform Engineer supports the day-to-day operation, maintenance, and continuous improvement of AWS EKS clusters including cluster lifecycle, node operations, add-on version compliance, namespace administration, Istio service mesh operations, and Infrastructure as Code (IaC)-driven configuration.

Minimum/General Experience: 3 years of experience in Kubernetes platform engineering and managing AWS EKS clusters in production environments

Minimum Education: Bachelor's Degree in computer science, information technology, systems engineering, or related field

Essential Skills/Qualifications:
  • Excellent experience managing AWS EKS clusters including managed node groups, Fargate profiles, cluster upgrades, add-on lifecycle management, and multi-cluster operations at enterprise scale
  • Excellent knowledge of Kubernetes cluster operations including namespace administration, resource quotas, limit ranges, pod disruption budgets, HPA/VPA, and cluster autoscaler configuration
  • Excellent ability to detect and remediate node failures within 30 minutes using cordon, drain, and replace procedures
  • Excellent experience with Istio service mesh operations including control plane management, mTLS enforcement, traffic management, virtual services, and sidecar injection policy on EKS
  • Excellent experience implementing and maintaining 100% IaC governance using Terraform for all cluster configuration
  • Excellent knowledge of Kubernetes add-on lifecycle including CoreDNS, kube-proxy, CNI, AWS Load Balancer Controller, and EBS/EFS CSI drivers
  • Above average experience with Kubernetes monitoring and observability including Prometheus, Grafana, Dynatrace, log aggregation, and distributed tracing
  • Working knowledge of EKS multi-region deployment patterns including cluster federation and cross-region service discovery
  • Experience supporting a federal agency
  • Excellent verbal and written communication skills

General Physical Requirements needed to perform the essential functions of this job may vary based on the location of the assignment.
  • Assignment Location - Remote
  • Sedentary Work - Exerting up to 10 pounds of force occasionally and/or a negligible amount of force frequently or constantly to lift, carry, push, pull or otherwise move objects.
  • Typing, communicating, repetitive motions.
  • Close visual acuity to prepare and analyze data, view computer monitors and read. May need to view presentation screens and other visual aids in a virtual setting.
  • Inside environmental conditions with protection from outside elements.

Security: Active Federal Civilian Public Trust clearance
  • U.S. Citizenship or Permanent Resident that has lived in the United States for at least 3 years

Federal Civilian Public Trust Consists of a review of up to but not limited to:
  • Covers 10 year period and in some instances lifetime events
  • OPM Security Investigations Index (SII)
  • DOD Defense Central Investigations Index (DCII)
  • National Agency Check (NAC) records
  • FBI name check
  • FBI fingerprint check
  • Credit report check
  • Written inquiries to previous employers and references listed on the application for employment
  • Potential interviews with the subject, spouse, neighbors, supervisor, coworkers
  • Law enforcement check
  • Court records check
  • Education check - Attendance and Degrees


Tasks/activities include, but are not limited to:
  • Operates and maintains all AWS EKS clusters across production, staging, and sandbox
  • Detects and remediates node failures within 30 minutes using cordon, drain, and replace
  • Manages EKS cluster add-on lifecycle ensuring 100% of add-ons (e.g., CoreDNS, kube-proxy, CNI, AWS Load Balancer Controller, EBS/EFS CSI) remain within one minor version of the EKS cluster version with no EOL/EOS components in production
  • Implements and maintains 100% IaC governance using Terraform for all cluster configuration and namespace management
  • Prohibits manual production changes except through approved break-glass procedures
  • Performs weekly drift detection
  • Manages Istio service mesh operations including control plane upgrades, mTLS policy enforcement, traffic routing, and sidecar injection governance
  • Configures and maintains autoscaling policies ensuring no production workload experiences resource saturation exceeding 80% for more than 5 minutes
  • Participates in daily Change Control Board (CCB) meetings for all Kubernetes cluster and namespace changes
  • Conducts post-implementation validation within 2 hours of each production change
  • Ensures all platform components are integrated with centralized logging and monitoring
  • Ensures no platform change is implemented without sufficient post-deployment production testing
  • Supports the Scheduling Event Bus delivery by providing Kubernetes infrastructure design, namespace provisioning, and operational support for event-driven workloads
  • Contributes EKS cluster health metrics, add-on compliance status, node utilization data, and incident reports to the Monthly Maintenance Report

Compensation & Benefits: The annual projected pay range for this position is $66,923 - $110,863 with consideration being given to various factors including but not limited to qualifications, experience, job responsibilities, and geographic location.

Oxley Enterprises, Inc. offers a full array of benefits including:
  • Medical, dental, vision and prescription drug coverage for you and your family.
  • Life Insurance, short-term disability and long-term disability paid for by the Company.
  • Supplemental coverages including Accident, Critical Illness, and Hospital.
  • Additional Life insurance coverage for you and your dependents.
  • 401k plan with various options to select based on your retirement goals.

Similar Jobs

More Jobs at Oxley Enterprises®, Inc.

More Information Technology Jobs

Find similar Kubernetes Platform Engineer (Remote) jobs: