Senior Site Reliability Engineer (In-Office Required)

Nebius

$156K — $262K *
Information Technology
5 - 7 years of experience
Job Overview by Ladders

Qualifications

  • 5-8 years in DevOps or SRE roles in production settings
  • Experience with designing and operating large-scale distributed systems
  • Strong proficiency in Kubernetes within a managed cloud environment
  • Solid knowledge of infrastructure as code, especially Terraform or similar tools
  • Familiar with GitOps deployment workflows
  • Experience in building and maintaining observability stacks
  • A calm and methodical approach to handling production incidents

Responsibilities

  • Manage Kubernetes clusters across multiple environments and regions
  • Own infrastructure as code for all resources
  • Maintain and improve CI/CD pipelines
  • Optimize real-time data pipelines processing billions of events daily
  • Build monitoring, alerting, and observability systems
  • Debug production issues across services
  • Manage cloud costs and perform capacity planning
  • Collaborate closely with a small engineering team to own all infrastructure

Benefits

  • 100% company-paid medical, dental, and vision insurance for employees and their families
  • 401(k) plan with up to 4% company match and immediate vesting
  • 20 weeks paid parental leave for primary caregivers, 12 weeks for secondary caregivers
  • Monthly reimbursement of up to $85 for mobile and internet-related costs
  • Company-paid short-term, long-term, and life insurance coverage
Full Job Description
The Role: Senior Site Reliability Engineer
  • Managing Kubernetes clusters across multiple environments and regions
  • Owning infrastructure as code for all resources
  • Maintaining and improving CI/CD pipelines and GitOps-based deployments
  • Maintaining and optimize real-time data pipelines that process billions of events per day across distributed queues and stream processors
  • Building out monitoring, alerting, and observability
  • Debugging production issues across services
  • Managing cloud costs and capacity planning
  • Working closely with a small engineering team - you'd own infra, not a slice of it

What we're looking for
  • 5-8 years in a DevOps or SRE role, working in production environments
  • Proven experience designing and operating large-scale, distributed systems, with a solid understanding of API design, reliability, and performance at scale
  • Strong Kubernetes experience in a managed cloud environment
  • Proficiency with infrastructure as code (Terraform or similar)
  • Experience with GitOps-based deployment workflows
  • Built or maintained observability stacks (logging, metrics, alerting)
  • Experience handling production incidents calmly and methodically

Nice to have:
  • Multi-region deployments
  • Search infrastructure
  • Data pipeline experience (streaming, warehousing)
  • Proxy/networking infrastructure at scale

Why Tavily?
  • Full ownership - small team, you own the entire infrastructure, not a slice of it
  • Real scaling challenges - bursty scraping workloads, cache invalidation, multi-region, millions of daily requests
  • AI-native company - your infra directly powers AI agents used by leading companies in the space.


Key employee benefits in the US:
  • Health insurance: 100% company-paid medical, dental, and vision coverage for employees and families.
  • 401(k) plan: Up to 4% company match with immediate vesting.
  • Parental leave: 20 weeks paid for primary caregivers, 12 weeks for secondary caregivers.
  • Remote work reimbursement: Up to $85/month for mobile and internet.
  • Disability & life insurance: Company-paid short-term, long-term and life insurance coverage.


Pay Transparency

We offer competitive compensation and benefits packages. Actual compensation will be determined based on job-related factors, including experience, skills, qualifications, the level at which the candidate is hired, and geographic location, consistent with applicable law.

Base Compensation Range

$156,000-$262,000 USD

Benefits & Perks:
  • Competitive compensation
  • Career growth and learning opportunities
  • Flexibility and ownership
  • Collaborative and innovative culture
  • Opportunity to work on impactful AI projects
  • International environment and talented teams

Similar Jobs

More Jobs at Nebius

More Information Technology Jobs

Find similar Senior Site Reliability Engineer (In-Office Required) jobs: