Sr. Site Reliability Engineer

System One Holdings, LLC

$140K *
Information Technology
5 - 7 years of experience
Job Overview by Ladders

Qualifications

  • 5+ years of experience in SRE, DevOps, or Cloud Engineering, specifically with Azure or AWS in production settings.
  • Strong skills in Linux/Unix administration and network troubleshooting.
  • Hands-on expertise in Terraform and CI/CD pipeline design/operation.
  • Proficiency in scripting/programming languages like Python, Go, Bash, or PowerShell.
  • Production experience managing Docker containers; Kubernetes experience is a plus.
  • Excellent communication and documentation skills, comfortable in a small team setting.
  • Ability to collaborate with development and data teams for standard definition and change management.
  • Must reside in Greater Cincinnati Metro area due to hybrid onsite requirement.

Responsibilities

  • Lead efforts on modernizing and migrating hosting across hybrid infrastructure (Azure, AWS, and on-prem).
  • Streamline infrastructure provisioning using Infrastructure as Code like Terraform and Ansible.
  • Build and enhance CI/CD pipelines to facilitate rapid and reliable software delivery.
  • Implement and manage observability platforms, setting operational standards for monitoring.
  • Operate and improve containerized workloads, applying SRE practices such as SLOs and error budgets.
  • Lead incident response efforts, perform root-cause analysis, and reduce manual work with automation.
  • Support DevSecOps initiatives including secure practices, backup strategies, and disaster recovery plans.
  • Evaluate and pilot new tools/technologies to ensure an up-to-date, efficient, and scalable infrastructure.

Benefits

  • Hybrid work model with 3 days onsite per week.
  • Opportunity to lead high-impact infrastructure projects.
  • Focus on infrastructure modernization and hosting migration initiatives.
  • Collaborative small team environment that allows for end-to-end ownership of projects.
  • Access to training on emerging tools and technologies in the SRE space.
Full Job Description
Job Title: Senior Site Reliability Engineer (SRE)
Location: Cincinnati, OH (Hybrid - 3 days onsite/week)
Type: Direct Hire
Compensation: $140,000 Range (based on experience)

We're seeking an experienced Senior Site Reliability Engineer to join a small, high-impact infrastructure team. This role blends software engineering and systems automation to scale reliable cloud and hybrid environments. You will own critical projects from design through deployment, with a major focus in your first year on infrastructure modernization and a hosting migration initiative.

Responsibilities

  • Lead modernization efforts and hosting migrations across hybrid infrastructure (Azure, AWS, and on-prem).
  • Streamline provisioning using Infrastructure as Code (Terraform, Ansible, PowerShell DSC).
  • Build and enhance CI/CD pipelines (GitHub Actions, Jenkins) to enable fast, reliable delivery.
  • Implement and manage observability/monitoring platforms (e.g., Prometheus, Grafana, Datadog) and establish operational standards.
  • Operate and improve containerized workloads (Docker); apply SRE practices such as SLOs and error budgets.
  • Lead incident response, perform root-cause analysis, and reduce toil through automation, scripting, and runbooks.
  • Support DevSecOps initiatives, including secure practices, backup strategy, and disaster recovery readiness.
  • Evaluate and pilot emerging tools/technologies to keep the infrastructure stack modern, efficient, and scalable.

Required Qualifications

  • 5+ years of experience in SRE, DevOps, or Cloud Engineering with production Azure or AWS exposure.
  • Strong Linux/Unix administration and network troubleshooting skills.
  • Hands-on expertise with Terraform and designing/operating CI/CD pipelines.
  • Scripting/programming proficiency in Python, Go, Bash, or PowerShell.
  • Production experience with Docker (Kubernetes is a strong plus).
  • Strong communication and documentation skills; comfortable owning work end-to-end in a small-team environment.
  • Ability to collaborate with application development and data engineering teams to define standards and manage change.
  • Must reside in the Greater Cincinnati Metro area (hybrid onsite requirement).

Preferred Qualifications

  • Observability tooling experience (Prometheus, Grafana, ELK, or similar).
  • Windows Server / Active Directory administration.
  • Experience with legacy Unix (AIX/Solaris).
  • Database experience (Oracle/MS SQL) or BI platform exposure (Snowflake/Azure Fabric).
  • Relevant certifications (Azure, AWS, and/or CKA).


#LI-DH1

Ref: #861-Cincinnati-S1

Similar Jobs

More Jobs at System One Holdings, LLC

More Information Technology Jobs

Find similar Sr. Site Reliability Engineer jobs: