deepwatch

Manager Site Reliability Engineering

deepwatch$178K — $213K *
Tampa, FL 33647In-Person
Information Technology
8 - 10 years of experience
Job Overview by Ladders

Qualifications

  • 8+ years in SRE, DevOps, or Platform Engineering with leadership experience
  • Proven cloud experience (AWS, GCP) and container orchestration (Kubernetes, Docker)
  • Strong coding/scripting skills (Python, GO) and proficiency in IaC and GitOps
  • Deep knowledge of observability tools and defining reliability metrics
  • Experience with incident handling and post-incident evaluations
  • Proven mentorship skills for junior/mid-level SRE talent
  • Familiarity with regulatory or cybersecurity frameworks (FedRAMP, NIST)

Responsibilities

  • Lead and develop the SRE team, fostering a culture of excellence
  • Design and manage cloud and containerized infrastructure using IaC (Terraform)
  • Implement CI/CD pipelines ensuring security and compliance
  • Build and define scalable observability systems and key metrics
  • Manage incident response, analyzing root causes for improvement
  • Drive capacity planning, performance tuning, and cost efficiency
  • Collaborate effectively with InfoSec and compliance teams

Benefits

  • Comprehensive medical, dental, vision, and disability insurance
  • Flexible Time Off (FTO) with additional company holidays and sick leave
  • Unique professional development benefits with funding for growth initiatives
  • Wellness contests and educational programs offered monthly
  • 401(K) retirement program
Full Job Description
Manager, Site Reliability Engineering

Reports to: VP, Product Engineering

Lead the architecture, automation, and reliability of secure, scalable cloud infrastructure (AWS, GCP) and developer platforms within a cybersecurity context. Inspire DevOps excellence, deliver high availability, and drive operational resilience-all while mentoring a high-caliber SRE team. Lead and grow a small high caliber global SRE Team, managing US based engineers. Lead the architecture, automation, and reliability of secure, scalable cloud infrastructure (AWS, GCP) and developer platforms within a cybersecurity context. Inspire DevOps excellence, deliver high availability, and drive operational resilience-all while mentoring and managing a high-caliber SRE team.

What You'll Do:
  • Lead and grow the SRE team, setting direction, mentoring and managing engineers, and fostering excellence.
  • Design and manage cloud and containerized infrastructure with IaC (Terraform).
  • Implement robust CI/CD pipelines integrating security and compliance.
  • Build scalable observability systems, leading the definition of SLIs / SLOs and dashboards.
  • Manage incident response, root cause analysis, and postmortems; automate recovery via playbooks/runbooks.
  • Drive capacity planning, performance tuning, and cost efficiency.
  • Collaborate with InfoSec, DevSecOps, and Compliance teams-ensuring alignment with frameworks like FedRAMP, NIST, RMF.
  • Support program-level initiatives, communicating effectively with stakeholders.
  • Promote a culture of reliability, security, and developer efficiency.
  • Maintain an active 'player' role, dedicating approximately 75% of your time to hands-on engineering (design, coding, and architecture) and 25% to leadership, mentorship, and management.

What You'll Bring:
  • 8+ years in SRE, DevOps, or Platform Engineering; with technical leadership experience ready to step into management as a player/coach.
  • Proven cloud experience (AWS, GCP) and container orchestration (Kubernetes, Docker).
  • Strong coding/scripting (Python, GO) and proficiency in IaC and GitOps.
  • Deep knowledge of observability tools and defining reliability metrics.
  • Experienced in incident handling (PagerDuty, Datadog) and post-incident evaluations.
  • Demonstrated success in mentoring and developing junior/mid-level SRE talent, moving beyond delegation to hands-on technical coaching.
  • Familiarity with regulatory or cybersecurity frameworks (FedRAMP, NIST, STIGs, RMF).
  • Excellent cross-functional communication and stakeholder management.
  • Preferred: certifications such as AWS, CKA, or cyber security credentials (e.g., OSCP).

Statutory Pay Disclosure:

The anticipated salary range for this role is $178,00 - $213,000 + bonus + stock options + benefits. Actual compensation may vary from posted hiring range based upon geographic location, work experience, education, and/or skill level.

ITAR Compliance

This position will have access to customer data and as such is subject to International Traffic in Arms Regulations (ITAR). Upon application, candidates will be asked to confirm that they are a U.S. Person as defined by the following:
  • A citizen of the U.S.;
  • A lawful permanent resident of the United States;
  • A person admitted to the United States as a refugee; or
  • A person that has been granted asylum by the United States government.

The intent of this requirement is not to verify employment eligibility overall, but to ensure compliance with import/export regulations. If you do not meet these requirements, we encourage you to apply for other open roles at Deepwatch. This information will be verified upon offer of employment.

What We Offer:

Deepwatch is excited to provide benefits designed to support team members and their families. Including:
  • Medical, dental, vision, and disability insurance
  • Flexible Time Off (FTO), 12 company holidays, sick leave and 8-Weeks Paid Parental Leave
  • Unique professional development benefits with Annual "development dollars" to support our people growth and development
  • Wellness contests and monthly educational programs
  • 401(K) retirement program
  • Learn more here: Deepwatch Benefits

About deepwatch

deepwatch is a cybersecurity company that provides managed security services. The company was founded in 2015 and is headquartered in Denver, Colorado. deepwatch offers a range of cybersecurity services, including threat detection and response, vulnerability management, and compliance management. The company uses artificial intelligence and machine learning to provide advanced threat detection and response capabilities. deepwatch is committed to providing exceptional customer service and helping its clients improve their cybersecurity posture.
Learn more about deepwatch
Size
100 employees
Industry
Founded
2015
Revenue
$10 million

Similar Jobs

More Jobs at deepwatch

More Information Technology Jobs

Find similar Manager Site Reliability Engineering jobs: