CentralSquare

Lead Site Reliability Engineer - Remote

CentralSquare$120K — $150K *
US-AnywhereRemote in United States
Information Technology
5 - 7 years of experience
Job Overview by Ladders

Qualifications

  • 5-8+ years of experience in cloud, DevOps, SRE, or systems engineering roles
  • Hands-on expertise with AWS services like EC2, VPC, IAM, and others
  • Proven experience with Infrastructure as Code, preferably Terraform
  • Strong scripting skills in Python, Bash, or PowerShell
  • Solid understanding of networking fundamentals including TCP/IP and VPNs
  • Experience with Linux systems in production environments
  • Familiarity with monitoring and logging platforms like Datadog or CloudWatch

Responsibilities

  • Design, build, and maintain AWS-based infrastructure for production and non-production
  • Implement and maintain Infrastructure as Code using Terraform or equivalent
  • Develop CI/CD pipelines for both infrastructure and applications
  • Partner with application teams to enhance deployment reliability
  • Create automation scripts to minimize manual operations
  • Improve system reliability through monitoring and self-healing mechanisms
  • Participate in on-call rotations and troubleshoot cloud incidents

Benefits

  • Collaboration with multiple teams including CloudOps, Networking, and Application teams
  • Opportunity to work on mission-critical systems
  • Hands-on approach with significant impact on the cloud environment
  • Focus on automation to reduce manual tasks
  • Involvement in high-availability production workloads
Full Job Description
The Opportunity

We are seeking a highly skilled Senior Cloud / DevOps Engineer with a strong background in AWS, automation, infrastructure as code, and networking to support and modernize our cloud environments. This role is hands-on and will partner closely with Cloud Operations, SREs, Networking, and Application teams to improve scalability, reliability, security, and operational efficiency across mission-critical systems.

The ideal candidate is comfortable operating at both the infrastructure and application layers, has strong troubleshooting skills, and can automate repeatable operational tasks while supporting high-availability production workloads.

Key Responsibilities

Cloud & DevOps Engineering
  • Design, build, and maintain AWS-based infrastructure supporting production and non-production environments
  • Implement and maintain Infrastructure as Code (IaC) using tools such as Terraform, CloudFormation, or equivalent
  • Develop and support CI/CD pipelines for infrastructure and application deployments
  • Partner with application teams to improve deployment reliability and performance

Automation & Reliability
  • Create and maintain automation scripts and tooling (Python, Bash, PowerShell, etc.) to reduce manual operations
  • Improve system reliability through self-healing mechanisms, monitoring, and alerting
  • Support SRE-style practices including incident response, root cause analysis, and continuous improvement

Networking & Security
  • Design and support cloud networking (VPCs, subnets, routing, VPNs, security groups, NACLs)
  • Troubleshoot complex network, connectivity, and performance issues across hybrid environments
  • Implement security best practices aligned with AWS Well-Architected Framework

Operations & Collaboration
  • Participate in on-call rotations supporting critical production systems
  • Provide operational support, troubleshooting, and resolution for cloud-related incidents
  • Collaborate across CloudOps, Networking, DBAs, and Application teams
  • Document architectures, runbooks, and operational procedures

What Success Looks Like in This Role
  • Reduced manual operational work through automation
  • Improved deployment reliability and production stability
  • Faster recovery and clearer root cause analysis during incidents
  • Strong partnership with CloudOps, Networking, and Application teams
Skills & Requirements
Required Qualifications

Technical Skills
  • 5-8+ years experience in cloud, DevOps, SRE, or systems engineering roles
  • Strong hands-on experience with AWS (EC2, VPC, IAM, ELB/ALB, RDS, S3, CloudWatch, etc.)
  • Proven experience with Infrastructure as Code (Terraform preferred)
  • Strong scripting and coding experience (Python, Bash, PowerShell, or similar)
  • Solid background in networking fundamentals (TCP/IP, DNS, VPNs, routing, firewalls)
  • Experience with Linux-based systems in production environments
  • Familiarity with monitoring/logging platforms (Datadog, CloudWatch, LogicMonitor, etc.)

DevOps Tooling (one or more)
  • CI/CD tools (GitHub Actions, GitLab CI, Jenkins, Azure DevOps, etc.)
  • Configuration management and automation tools
  • Containerization and orchestration (Docker, ECS, EKS, Kubernetes - preferred but not mandatory)

Preferred Qualifications
  • AWS certifications (Solutions Architect, DevOps Engineer, or equivalent)
  • Experience supporting high-availability, regulated, or SaaS environments
  • SRE experience (error budgets, SLIs/SLOs, post-incident reviews)
  • Experience working in hybrid cloud or legacy-to-cloud migration environments
  • Strong documentation, communication, and cross-team collaboration skills

Qualifications

About CentralSquare

CentralSquare is a software company that provides public safety and public administration software solutions. Their products are used by over 7,500 organizations in North America, including police departments, fire departments, and local governments. CentralSquare's software allows for real-time communication and collaboration between different agencies and departments, improving response times and overall efficiency. The company was founded in 2018 and is headquartered in Tampa, Florida.
Learn more about CentralSquare
Size
5,000 employees
Industry
Founded
1981

Similar Jobs

More Jobs at CentralSquare

More Information Technology Jobs

Find similar Lead Site Reliability Engineer - Remote jobs: