ABOUT THE TEAMThe Air Dominance & Strike team at Anduril develops aerial and multi-domain robotic systems. The team is responsible for taking products like Fury (unmanned fighter jet) and Barracuda (air-breathing cruise missile) from concept to product. The team also develops Lattice for Mission Autonomy, Anduril's premier software platform that enables masses of Fury, Barracuda, and other first and third party robots to collaborate across various missions. We work in close coordination with specialist teams like Perception, Motion Planning, Hardware, and Test Engineering to solve some of the hardest problems facing our customers. We are looking for software engineers to design and implement comprehensive monitoring, observability and alerting systems as well as build infrastructure automation to help manage large scale distributed systems.
REQUIRED QUALIFICATIONS- 5+ years of engineering experience with at least 3+ years focused on low level infrastructure problems, production operations, or infrastructure engineering
- Deep expertise with Kubernetes in production environments, including operational challenges at scale
- Strong programming skills in one or more languages such as Go, Python, Rust, or Java with ability to build production-grade tooling
- Hands-on experience with cloud platforms (AWS, Azure, or GCP) and infrastructure as code practices
- Demonstrated ability to debug complex distributed systems issues across multiple layers of the stack
- Track record of improving system reliability through architectural changes, not just operational band-aids
- Strong incident management and communication skills, with experience leading responses to critical outages
- Eligible to obtain and maintain an active U.S. Top Secret security clearance
PREFERRED QUALIFICATIONS- Experience with defense, aerospace, or other mission-critical systems where downtime has severe consequences
- Expertise in performance optimization and capacity planning for high-throughput, low-latency systems
- Knowledge of chaos engineering principles and experience implementing resilience testing frameworks
- Familiarity with CI/CD platforms and deployment automation (ArgoCD, FluxCD, Spinnaker, Jenkins)
- Understanding of networking fundamentals including load balancing, DNS, TLS/SSL, and network security
- Experience with configuration management and secrets management solutions (Vault, Sealed Secrets, SOPS)
- Strong written and verbal communication skills with ability to explain technical concepts to non-technical stakeholders
- Proven experience designing and implementing observability stacks (metrics, logging, tracing) using tools like Prometheus, Grafana, ELK/EFK, or equivalent
- Active Secret or higher security clearance
US Salary Range
$166,000-$220,000 USD
The salary range for this role is an estimate based on a wide range of compensation factors, inclusive of base salary only. Actual salary offer may vary based on (but not limited to) work experience, education and/or training, critical skills, and/or business considerations. Highly competitive equity grants are included in the majority of full time offers; and are considered part of Anduril's total compensation package. Additionally, Anduril offers top-tier benefits for full-time employees, including:
BenefitsAt Anduril, we invest in our people. Our comprehensive, competitive benefits package (available at little to no cost to employees) ensures you're supported in health, recovery, and whatever comes next. For more information, Explore Our Benefits.