About This RoleGimlet Labs is seeking a Network Engineer to design, build, and scale the network infrastructure powering production-scale AI and distributed systems for frontier labs, hyperscalers, and other high-performance compute environments.
This is an opportunity to build the network foundation for systems serving real production traffic at massive scale, while also shaping the network architecture for the next generation of AI datacenters. You will help determine how future high-performance compute environments are designed, deployed, interconnected, and operated.
The ideal candidate has deep technical knowledge of modern data center networking and is comfortable operating across physical infrastructure, network architecture, deployment workflows, and operational troubleshooting. We are looking for someone who can operate independently, drive infrastructure improvements, and help build scalable networking foundations for high-performance compute and AI workloads.
What you will work on- Design, deploy, and scale datacenter network infrastructure supporting AI workloads, distributed systems, and high-performance compute environments.
- Lead network provisioning, device configuration, connectivity validation, deployment testing, and production turn-up activities for new infrastructure builds and hardware expansions.
- Build and maintain scalable network topology designs, IP plans, deployment standards, operational documentation, and infrastructure readiness processes.
- Troubleshoot complex networking, routing, hardware, connectivity, and performance issues across physical infrastructure and distributed systems environments.
- Partner closely with infrastructure, systems, deployment, and operations teams to improve network reliability, deployment velocity, operational readiness, and infrastructure scalability.
- Drive automation and operational improvements across provisioning, configuration management, monitoring, deployment validation, and incident response workflows.
You may be a good fit if- Have experience designing, deploying, and operating production network infrastructure.
- Have strong networking fundamentals and can reason about routing, switching, connectivity, performance, and reliability issues.
- Are comfortable troubleshooting complex problems that span hardware, software, and distributed systems boundaries.
- You enjoy building systems and improving operational processes through automation.
- Thrive in highly collaborative environments and can work effectively across engineering and infrastructure teams.
- Take ownership of problems end-to-end and are comfortable operating in ambiguous, fast-moving environments.
Strong candidates may also have - Experience with AI, HPC, GPU, or large-scale distributed infrastructure.
- Experience with Arista, Cisco, Juniper, or NVIDIA networking platforms.
- Experience with network automation using Python, Ansible, Terraform, or similar tooling.
- Familiarity with RDMA, RoCE, InfiniBand, or high-performance networking environments.