Baremetal Infrastructure Engineer About the roleNominal's self-hosted and on-prem deployments are becoming increasingly critical to our customers' success. As adoption grows, we're balancing customer implementations, reliability improvements, FedRAMP readiness, and infrastructure modernization, all while supporting deployments in highly constrained environments.
We're looking for someone who can become our in-house expert on Linux systems, bare metal infrastructure, and deployment reliability. You'll help customers deploy and operate Nominal in their own environments while also driving the internal improvements that make those deployments more repeatable, observable, and scalable.
This role sits at the intersection of DevOps, Site Reliability Engineering, and customer-facing infrastructure. One day you might be debugging networking issues on a customer rack. The next, you might be building provisioning automation, improving Kubernetes upgrade paths, or implementing observability features that eliminate future support burdens.
You'll be a force multiplier for the team, reducing context switching, accelerating customer deployments, and helping us build a world-class self-hosted platform.
What You'll Do- Serve as a technical expert for customer-hosted and air-gapped deployments
- Travel onsite to support critical customer implementations when additional technical expertise is required
- Troubleshoot Linux, networking, Kubernetes, and infrastructure issues in production environments
- Partner directly with customer IT, infrastructure, and security teams to gather requirements and ensure successful deployments
- Build and improve deployment tooling, automation, and operational processes
- Improve reliability, observability, and maintainability across our self-hosted infrastructure
- Develop deployment documentation, runbooks, and troubleshooting guides
- Help shape Nominal's long-term strategy for self-hosted and regulated deployments
We're Looking ForMust Have:- Strong Linux systems expertise
- Experience operating and troubleshooting production infrastructure
- Background in DevOps, SRE, Platform Engineering, or Infrastructure Engineering
- Ability to independently debug complex systems across multiple layers of the stack
- Comfort working directly with customers and external stakeholders
- Willingness to travel for customer deployments
- Strong ownership mentality and ability to operate in ambiguous environments
Strong Pluses:- Active security clearance or ability and willingness to obtain and maintain one
- Experience with Kubernetes in production environments
- GitOps workflows and tooling (Flux, Helm, Kustomize, etc.)
- Infrastructure as Code experience
- Datacenter operations experience
- Hardware procurement, provisioning, or lifecycle management
- Networking expertise (routing, switching, troubleshooting, performance tuning)
- Linux kernel, networking, or performance optimization experience
- Security-focused infrastructure experience including TLS and certificate management
- Experience supporting air-gapped, classified, or highly regulated environments
Skills That Supercharge UsBare Metal & Provisioning- PXE
- IPMI
- BMC
- iLO / iDRAC
- MAAS
- Tinkerbell
- Metal provisioning workflows
Systems- Linux internals
- Kernel troubleshooting
- NUMA
- RDMA
- SR-IOV
- eBPF
Networking- BGP
- Routing
- Low-latency networking
- NIC offloading
- DPDK
Storage- Ceph
- NVMe
- Distributed storage systems
Infrastructure- Kubernetes
- GitOps
- Datacenter automation
- Rack provisioning
- Hardware orchestration
- On-prem infrastructure operations
Benefits/Perks- 100% coverage of medical, dental, and vision insurance
- Unlimited PTO and sick leave
- Free lunch, snacks, and coffee
- Professional Development Stipend
- In-office hardware lab with a $250 project stipend
- Annual company retreat
CompensationThe base pay range for this role is $120,000 - $230,000 per year.