Full Job Description
The Modeling & Simulation (M&S) Orchestration/Kubernetes Engineer supports testing of complex systems within distributed Hardware-in-the-Loop (HWIL) and cloud-based simulation environments for the Missile Defense Agency (MDA). This role focuses on containerization, orchestration, and DevOps practices to ensure scalable, secure, and reliable deployment of simulation and test applications in support of mission-critical software development and test events.
As a Modeling and Simulation Orchestration/Kubernetes Engineer, your duties will include the following but not limited to:
• Design, build, and maintain scalable, resilient Kubernetes clusters for simulation and test environments
• Deploy and manage containerized applications using Docker and Kubernetes, ensuring high availability and performance
• Develop and automate CI/CD pipelines using tools such as Jenkins, GitLab CI, or Azure DevOps
• Implement Infrastructure as Code (IaC) using Terraform, Ansible, or similar tools to provision and manage cloud and on-prem resources
• Monitor cluster health, system performance, and application metrics using tools such as Prometheus, Grafana, and Splunk
• Troubleshoot infrastructure and application issues in real-time during test events
• Collaborate with development teams to streamline containerization and promote DevOps best practices
• Implement and maintain security controls including network policies, RBAC, vulnerability scanning, and compliance enforcement
• Support distributed simulation events, including occasional off-hours test activities
Required Qualifications
• U.S. Citizenship
• Bachelor's degree in Computer Science, Engineering, Information Systems, or related field (equivalent professional experience considered)
• 2+ years of related experience.
• Active SECRET security clearance with ability to obtain and maintain TS/SCI
• Experience supporting Kubernetes-based environments
• Experience with Docker and container orchestration (Kubernetes, EKS, GKE, or OpenShift)
• Experience developing or maintaining CI/CD pipelines
• Experience with Infrastructure as Code (Terraform, Ansible, or similar)
• Strong scripting skills (Python, Bash, or similar)
• Knowledge of Linux/UNIX environments and command-line tools
• Experience with Git and collaborative development workflows
Preferred Qualifications:
• Certified Kubernetes Administrator (CKA) or similar certification
• Experience working in cloud environments (AWS, Azure, or GCP)
• Experience supporting distributed or high-performance computing environments
• Familiarity with GPU-enabled workloads and CUDA architecture
• Experience integrating ML/AI workloads into containerized environments
• Experience building or supporting real-time or streaming data pipelines
• Familiarity with monitoring and observability best practices
• Experience with Linux development environments
• Experience with Docker and advanced container security practices
• Familiarity with large-scale data processing or distributed systems frameworks
Schedule: M-F; 8-5
Work Location: Huntsville, AL
Travel: 0-10%
Relocation Assistance Available: No
Position Contingent Upon Award of Contract: No
#LI-DK1
Benefits:
Torch Technologies is proud to offer a stable and professional work environment, a competitive salary, and an excellent, comprehensive benefit package including: ESOP participation, 401(k) match, medical, dental, vision, life insurance, short-term disability, long-term disability, flexible spending accounts, Health Saving Accounts and Health Reimbursement Accounts, EAP, education assistance, paid time off, and holidays.