Veeam Software

GOV Site Reliability Engineer

Veeam Software$109K — $183K *
US-AnywhereRemote in United States
Information Technology
Less than 5 years of experience
Job Overview by Ladders

Qualifications

  • 3+ years in Software Engineering with minimum 1 year in SRE, Platform Engineering, or DevOps on cloud services.
  • Experience with cloud infrastructure on Azure or similar provider.
  • Familiarity with regulated environments like government or healthcare, understanding compliance's operational impact.
  • Ability to read and understand code independently to investigate system behavior.
  • Experience with monitoring tools (e.g., Prometheus, Grafana) and IaC tools (e.g., Terraform).
  • Strong programming skills in languages such as TypeScript/JS, Go, Java, or C#.
  • Solid understanding of distributed systems and networking fundamentals.

Responsibilities

  • Get familiar with VDC workloads and operational workflows by reviewing code and documentation.
  • Write and update runbooks, incident guides, and operational documentation.
  • Participate in incident triage, investigation, and postmortems for reliability.
  • Help define and maintain SLIs, SLOs, and error budgets for the team.
  • Implement monitoring and alerting to close observability gaps.
  • Improve operational efficiency through automation and tooling.
  • Collaborate with teams to enhance reliability and communicate system status effectively.

Benefits

  • Unlimited paid time off and 12 paid holidays including global self-care days.
  • Paid parental leave for all types of parents, with extended leave for birthing parents.
  • Comprehensive medical, dental, and vision coverage starting on the first day.
  • Access to mental health support and therapy through the Employee Assistance Program.
  • 401(k) retirement plan with company matching contributions.
  • Support for fertility, adoption, and surrogacy, plus paid volunteering time.
  • 24/7 virtual veterinary care at no cost through AirVet.
  • Legal services and identity protection options available.
  • Tax-advantaged spending accounts for various healthcare needs.
  • Opportunities for professional development through workshops and learning resources.
Full Job Description
Site Reliability Engineer - Government & Sovereign Cloud

Veeam is building a global SRE function to support the Veeam Data Cloud, our SaaS platform. This role is part of the team supporting our Government and Sovereign Cloud environment.

Success here requires a self-starter mindset - you'll need to be comfortable building your own context and tracking down information across a large, distributed engineering organization.

You'll work alongside senior engineers to execute on reliability work, close observability gaps, respond to incidents, and help maintain the operational foundation the team runs on.
What You Will Do
Discovery & Documentation
  • Get up to speed on VDC workloads, dependencies, and operational workflows by reading code, docs, and working with SMEs.
  • Write and maintain runbooks, incident guides, and operational documentation.
  • Support knowledge transfer and contribute to onboarding materials for the team.
Reliability & Incident Response
  • Participate in incident response including triage, investigation, mitigation, and postmortems.
  • Help implement and maintain SLIs, SLOs, and error budgets defined by the team.
  • Identify reliability issues during incidents or reviews and propose concrete improvements.
  • Support high availability and fault tolerance work on Azure, including Azure Government.
Observability
  • Close monitoring gaps by implementing instrumentation, alerting, and dashboards based on team standards.
  • Contribute to toil reduction through automation and tooling improvements.
  • Participate in on-call rotations.
Infrastructure & Delivery
  • Work with IaC, CI/CD pipelines, and deployment tooling in compliance-restricted environments.
  • Support testing, canary deployments, and release validation workflows.
  • Implement changes to infrastructure and configuration following established patterns and review processes.
Collaboration
  • Work with engineering, security, compliance, and operations teams to execute on reliability improvements.
  • Communicate clearly about system behavior, risk, and status - in writing and in meetings.
  • Raise blockers and gaps proactively; don't wait for problems to escalate.
What We Are Looking For
Required
  • 3+ years in Software Engineering, with at least 1 year in SRE, Platform Engineering, or DevOps working on cloud-hosted services.
  • Experience with cloud infrastructure on Azure or a comparable cloud provider.
  • Familiarity with regulated or compliance-oriented environments such as government (FedRAMP, CMMC), financial (PCI-DSS), or healthcare (HIPAA). You understand that compliance shapes what you can and can't do operationally.
  • Able to read and understand code well enough to investigate system behavior without always having someone walk you through it.
  • Experience with monitoring and observability tools (e.g., Prometheus, Grafana, OpenTelemetry, ELK stack).
  • Experience with IaC tools (Terraform, Terragrunt, or Pulumi) and container orchestration (Kubernetes).
  • Experience with CI/CD tooling such as GitHub Actions, Azure DevOps, GitLab CI, or ArgoCD.
  • Strong programming skills in one or more of: TypeScript/JS, Go, Java, C#, or similar.
  • Solid understanding of distributed systems fundamentals and networking basics.
  • Clear written and verbal communication skills.
Preferred
  • Experience in Government or Sovereign Cloud environments (e.g., Azure Government, AWS GovCloud).
  • Background in SaaS platforms or multi-tenant systems.
  • Familiarity with chaos engineering, resilience testing, or load testing.
  • Exposure to building or improving reliability practices on a team.
  • Familiar with AI-first development workflows using LLM-powered tools for automation, code generation, or documentation.
Why Join?
  • Work on a high-impact reliability practice for a growing GOV/Sovereign Cloud platform.
  • Learn from senior engineers while owning real work end-to-end.
  • Collaborate with strong teams across product, cloud engineering, security, and compliance.
  • Professional development resources including mentorship, training, and volunteer days.
  • Competitive compensation and benefits.


What you'll get
  • Unlimited paid time off, 12 paid holidays including 4 global VeeaMe Days for self-care and 24 paid volunteer hours annually through Veeam Cares
  • Paid parental leave: 8 weeks for all parents, 16 weeks for birthing parents
  • Medical, dental, and vision coverage starting on your first day
  • Mental health support, therapy sessions, and digital wellness tools via our Employee Assistance Program
  • 401(k) retirement plan with company matching contributions
  • Fertility, adoption, and surrogacy support through Maven, plus paid volunteer time
  • AirVet: 24/7 virtual veterinary care at no cost
  • Legal services, identity protection, and supplemental health insurance options
  • Tax-advantaged spending accounts for healthcare, dependent care, and commuting
  • Opportunities to learn and grow through on-demand libraries (LinkedIn Learning, O'Reilly), mentoring, workshops, and learning events like our annual Global Day of Learning

Compensation Transparency

Veeam is committed to pay transparency and equitable compensation. For this role, the compensation range below reflects the expected total target compensation (TTC), inclusive of base pay and a competitive performance-based bonus. For roles with a commission plan, the compensation range represents On Target Earnings (OTE), which includes base salary plus variable commission. When determining compensation, Veeam takes into consideration factors such as experience, education, skills, and geographic zone. Offers are typically made below the midpoint of the range.

In addition to compensation, Veeam provides a comprehensive benefits package, including health coverage, retirement plans, and unlimited time off.

U.S. Geographic Zones & Compensation Ranges (TTC / OTE)

Zone 1: San Francisco Bay Area, New York City Boroughs

$151,500-$252,500 USD

Zone 2: Washington, California (excluding San Francisco Bay Area)

$138,900-$231,400 USD

Zone 3: Texas, Illinois, North Carolina, Colorado, Massachusetts, Pennsylvania, Virginia, Oregon, Nevada, Hawaii, New York (excluding NYC boroughs); Sales roles located in Georgia, Ohio, and Arizona

$126,300-$210,400 USD

Zone 4: All other US locations

$109,800-$183,000 USD

About Veeam Software

Veeam Software is a privately held information technology company that develops backup, disaster recovery and intelligent data management software for virtual, physical and multi-cloud infrastructures. The company's headquarters are in Baar, Switzerland, and it has offices in more than 30 countries. Veeam has more than 375,000 customers worldwide, including 82% of the Fortune 500 and 69% of the Global 2,000 enterprises. The company was founded in 2006 by Ratmir Timashev and Andrei Baronov.
Learn more about Veeam Software
Size
5,000 employees
Industry
Founded
2006

Similar Jobs

More Jobs at Veeam Software

More Information Technology Jobs

Find similar GOV Site Reliability Engineer jobs: