Software Engineer - Developer Platform

SF Compute

$120K — $160K *
Information Technology
Less than 5 years of experience
Job Overview by Ladders

Qualifications

  • 5-7 years of EKS/Kubernetes experience managing clusters
  • Proficient in infrastructure-as-code tools like Terraform or similar
  • Familiar with CI/CD systems such as GitHub Actions or ArgoCD
  • Strong understanding of what 'production-like' environments entail
  • Ability to design and scope pre-production environment requirements
  • Nice to have: experience with GPU workloads or bare metal networking

Responsibilities

  • Design and build a pre-production EKS cluster that mimics production performance
  • Own the infrastructure-as-code setup for the cluster
  • Integrate the cluster into CI/CD pipelines for validation before production deployment
  • Define promotion gates for changes to be eligible for production
  • Collaborate with cross-functional engineering teams to identify testable aspects
  • Evolve internal developer tooling for efficient building, testing, and shipping
  • Drive the migration from managed platforms to self-hosted solutions

Benefits

  • Generous equity grant
  • Visa sponsorship available
  • 401(k) retirement matching up to 4%
  • 100% coverage of medical, dental, and vision insurance premiums for employees and dependents
  • Unlimited paid time off plus 10+ observed holidays
  • Paid parental leave for biological, adoptive, and foster parents
  • Daily lunch covered for employees
  • Unlimited budget for office books
Full Job Description
About the Tooling Team

We are a small team focused on making SFCompute engineering faster, more observable, and more reliable. Our work spans data infrastructure, developer experience, pre-production environments, and AI tooling - but the common thread isn't any specific domain. It's that we find the problems nobody else owns and make them solved problems.

Everyone on this team wears many hats. You'll work across the stack, collaborate with all parts of engineering, and regularly take on problems that don't fit neatly into a job description. If you want a narrow scope and a clear ticket queue, this team isn't it. If you want to have a large, legible impact on a small team building serious infrastructure, read on.

The Role

We're looking for a platform engineer who cares about the full pre-production experience - not just staging clusters, but the entire ecosystem of tooling that makes development fast and safe. Right now the gap between dev and prod is a real frustration. You'll close it. That means building a realistic staging environment, but it also means owning internal developer tooling, improving deployment pipelines, and eventually getting us off managed platforms like Vercel where self-hosting makes sense.

What You'll Do
  • Design and build a pre-production EKS cluster that mirrors production fidelity without production cost
  • Own the infrastructure-as-code for the cluster (Terraform, Helm, or equivalent)
  • Integrate the cluster into CI/CD pipelines so changes are validated before they reach prod
  • Define promotion gates what has to pass in pre-prod before a change is eligible for production
  • Collaborate with platform and application engineers to understand what needs to be testable
  • Own and evolve internal developer tooling that improves how the team builds, tests, and ships
  • Drive migration off managed platforms (like Vercel) where self-hosting is the right call
  • Explore and implement A/B testing and feature flagging infrastructure to support safer, incremental rollouts
  • Monitor and maintain the pre-production environment over time


What We're Looking For
  • Hands-on EKS / Kubernetes experience you've provisioned and operated clusters, not just deployed workloads onto them
  • Experience with infrastructure-as-code tools (Terraform, CDK, or similar)
  • Familiarity with CI/CD systems (GitHub Actions, ArgoCD, or similar)
  • Strong operational instincts you know what "production-like" means and how to approximate it affordably
  • You can scope your own work. The pre-prod environment doesn't exist yet, so the first job is figuring out what it actually needs to be
  • Nice to have: experience with GPU workloads, bare metal networking, or marketplace-style platforms


Why This Role

We're shipping real workloads to bare-metal GPU clusters, and right now we validate too many infrastructure changes in production. That's the problem this role exists to solve. The cluster you build will be the default environment for every infrastructure change the team makes going forward. You'll own the design, the tooling, and the standards, with full backing from engineering leadership to do it right.

Benefits

Generous equity grant

Team members are offered a competitive salary along with equity in the company

Visa Sponsorships

Yes, we sponsor visas and work permits

Retirement matching

We match 401(k) plans up to 4%

Medical, dental & vision

We offer competitive medical, dental, vision insurance for employees and dependents and cover 100% of premiums

Time off

We offer unlimited paid time off as well as 10+ observed holidays

Parental leave

We offer biological, adoptive, and foster parents paid time off to spend quality time with family

Daily lunch

We cover lunch daily for employees

Unlimited office book budget

You can buy as many books for the office as you want

Similar Jobs

More Jobs at SF Compute

  • Human Resources Manager
    $120K — $150K *
    San Francisco, CA 94112 (San Francisco County)
    Business Services
    In-Person
  • Deal Architect
    $120K — $180K *
    San Francisco, CA 94112 (San Francisco County)
    Business Services
    In-Person
  • Senior Accounting Manager
    $130K — $160K *
    San Francisco, CA 94112 (San Francisco County)
    Finance & Insurance
    In-Person
  • Software Engineer - Developer Platform
    $120K — $160K *
    San Francisco, CA 94112 (San Francisco County)
    Information Technology
    In-Person

More Information Technology Jobs

Find similar Software Engineer - Developer Platform jobs: