Coinbase

Senior Site Reliability Engineer, Core AI Infrastructure

Coinbase$186K — $218K *
US-AnywhereRemote in United States
Information Technology
5 - 7 years of experience
Job Overview by Ladders

Qualifications

  • 5+ years of cloud infrastructure automation experience (AWS) and network environments.
  • Hands-on experience with infrastructure-as-code tools (Terraform, Ansible, Chef, Puppet, Salt).
  • Demonstrated ability deploying and managing containerized applications using Docker and Kubernetes.
  • Proficient in a programming/scripting language (Python, Bash, Ruby, or Go) and Git-based CI/CD workflows.
  • Experience leading incident response and implementing reliability improvements under strict SLAs.

Responsibilities

  • Own reliability, monitoring, and incident response for AI infrastructure services.
  • Build automation tools to enhance operational workflows and deployment speed.
  • Collaborate with Infrastructure and Security teams to integrate monitoring within CI/CD pipelines.
  • Enhance observability and documentation standards in IT engineering practices.
  • Develop full-stack applications for AI products and infrastructure using Go or Python.

Benefits

  • Medical, dental, and vision insurance.
  • 401(k) retirement plan.
  • Equity and bonus eligibility.
  • Flexible working conditions in a remote-first environment.
Full Job Description
You'll join a high-performing team of engineers driving AI transformation at Coinbase as a Senior Site Reliability Engineer on the IT Operations team. This team builds and scales the infrastructure powering Coinbase's AI products, with direct exposure to senior leadership in a fast-paced, incubator-style environment. You'll own the reliability and automation of critical AI infrastructure, ensuring our systems are resilient, observable, and secure at scale. **What you'll be doing (ie. job duties):** - Own the reliability, monitoring, and incident response lifecycle for AI infrastructure services, including on-call support for AWS deployment pipelines, root cause analysis, and blameless retros. - Build automation and tooling to streamline operational IT workflows, eliminate manual tasks, and improve deployment velocity across CI/CD frameworks and Kubernetes environments. - Partner with the Coinbase Infrastructure team to extend CI/CD frameworks supporting IT services and enterprise network platforms, and with Security and Compliance to integrate surveillance tooling into deployment pipelines. - Strengthen observability and documentation standards across IT engineering by defining metrics, implementing monitoring solutions, and maintaining technical documentation that sets a standard of excellence. - Develop full-stack applications that power internal AI products and infrastructure with Go or Python. **What we look for in you (ie. job requirements):** - 5+ years of experience automating and supporting cloud infrastructure (AWS) and network environments, with hands-on use of infrastructure-as-code tools (Terraform, Ansible, Chef, Puppet, or Salt). - Proven experience deploying, managing, and troubleshooting containerized workloads using Docker and Kubernetes in production environments. - Proficiency in at least one scripting or programming language (Python, Bash, Ruby, or Go) and version control workflows using Git-based CI/CD pipelines. - Track record of leading incident response in environments with strict SLAs, including root cause analysis, blameless retros, and measurable reliability improvements. - Utilizes generative AI responsibly, maintaining human oversight to deliver business-ready outputs and drive measurable improvements in workflow efficiency, cost, and quality. **Nice to haves:** - Expertise with linux, bash, ruby, python and/or go - Expertise automating EC2 or containers deployment with terraform - Strong network security fundamentals - Experience managing and leveraging log aggregation - Experience working in a highly regulated environment - Experience in a fast-paced, high-growth company - Experience in a Remote-first IT environment Position ID: P76832 **Pay Transparency Notice:** Base salary varies by location (see range below). Total compensation may also include equity and bonus eligibility, and benefits (medical, dental, vision, 401(k)). Annual base salary range (excluding equity and bonus): $186,065-$218,900 USD Please be advised that each candidate may submit a maximum of four applications within any 30-day period. We encourage you to carefully evaluate how your skills and interests align with Coinbase's roles before applying.

About Coinbase

Coinbase is a digital currency exchange that allows users to buy, sell, and store cryptocurrencies like Bitcoin, Ethereum, and Litecoin. The company was founded in 2012 by Brian Armstrong and Fred Ehrsam and is headquartered in San Francisco, California. Coinbase has over 56 million verified users in over 100 countries and has facilitated over $335 billion in trades. Coinbase offers a variety of services, including a cryptocurrency wallet, a trading platform, and an API for developers. Coinbase is known for its user-friendly interface and high level of security. Coinbase has raised over $547 million in funding from investors like Andreessen Horowitz, Greylock Partners, and the New York Stock Exchange.
Learn more about Coinbase
Size
1,200 employees
Industry
Founded
2012

Similar Jobs

More Jobs at Coinbase

More Information Technology Jobs

Find similar Senior Site Reliability Engineer, Core AI Infrastructure jobs: