About the RoleWe are building and operating large-scale infrastructure platforms to support high-performance AI workloads across multiple data centers. Our environment spans compute, storage, networking, Kubernetes, and internal platform services.
We are developing a growing ecosystem of internal tooling and services-including messaging systems, automation platforms, and control-plane integrations-that tie our infrastructure together.
We are looking for a
DevOps Software Engineer to build and maintain the
software layer that connects our infrastructure systems. This role focuses on developing integrations, services, and tooling that enable automation, orchestration, and communication across platforms.
This is not a traditional application development role. You will work directly with infrastructure systems and DevOps engineers to build reliable, production-grade platform integrations.
What You'll DoPlatform Integrations & Tooling- Design and build integrations between internal platform services (e.g., messaging/pub-sub systems), infrastructure systems (compute, storage, networking), third-party vendor platforms
- Develop services and tools that enable automation workflows, system coordination and orchestration, event-driven infrastructure operations
Software Development- Write production-quality code in Go, Python, Rust (where applicable)
- Build APIs, services, and background workers that interact with infrastructure platforms, CI/CD systems, automation frameworks
- Ensure code is reliable, observable, maintainable
Infrastructure & Automation Integration- Integrate software with automation systems such as Ansible, Terraform, CI/CD pipelines (GitHub Actions, ArgoCD)
- Enable infrastructure workflows through APIs, event-driven systems, automation hooks
Cross-Team Collaboration- Work closely with DevOps engineers (infrastructure and automation), Development teams (application requirements)
- Translate infrastructure capabilities into usable APIs and services
- Help teams integrate their systems into platform workflows
Reliability & Observability- Build logging, metrics, and tracing into services
- Debug and resolve issues across distributed systems
- Ensure integrations are resilient and handle failure scenarios gracefully
Continuous Improvement- Identify gaps in platform integration and automation
- Build tooling that reduces manual work and improves system cohesion
- Contribute to standards for internal platform development
Who You AreRequired Qualifications- 5+ years of experience in software engineering, DevOps development, or platform engineering
- Strong programming experience in:
- Go and/or Python
- Rust is a strong plus
- Experience building:
- APIs
- services
- system integrations
- Strong understanding of:
- Distributed systems concepts
- Event-driven architectures
- Experience working with:
- Linux systems
- Infrastructure platforms
Preferred Qualifications- Experience integrating with:
- Infrastructure platforms (compute, storage, networking)
- Kubernetes environments
- Familiarity with:
- Message queues or pub/sub systems
- CI/CD systems (GitHub Actions, ArgoCD)
- Experience with automation frameworks such as:
- Experience working with third-party APIs and vendor platforms
- Exposure to infrastructure at scale or CSP environments
What We Offer- 100% paid Medical, Dental, and Vision insurance for Employees
- Company Health Savings Account Contributions
- 100% paid Short Term and Long Term Disability Insurance for Employees
- Life and Voluntary Supplemental Insurance Options
- Other Insurance Options, such as Pet & Legal Insurance
- Various Supplementary Health Benefits, such as discounted Virtual Healthcare Appointments and Serious Illness Support
- Flexible Spending Account
- Employee Assistance Program