Senior Site Reliability Engineer

LeanData • $130K — $160K *

Santa Clara, CA 95051In-Person

Information Technology

5 - 7 years of experience

More than 3 months ago

By clicking Apply, I agree with Ladders' Terms of Use and Privacy Policy

Job Overview by Ladders

Qualifications

5+ years in SRE, DevOps, or Systems Engineering with AWS expertise.
Proven leadership during outages with experience in blameless postmortems.
Extensive experience with New Relic for configuring observability dashboards.
Strong background in automation with Terraform and infrastructure as code.
Ability to create a strategic roadmap for architectural best practices.
Collaborative approach to integrating reliability into engineering processes.
Bachelor's degree in Computer Science, Engineering, or related field.

Responsibilities

Lead AWS architectural modernization and transition to Infrastructure as Code using Terraform.
Design and implement disaster recovery and business continuity plans for zero-downtime deployments.
Develop capacity planning and autoscaling strategies to optimize AWS resources.
Establish monitoring and alerting protocols using New Relic and IncidentIO for proactive issue resolution.
Refine CI/CD pipelines to enhance safety and predictability from code commit to production.
Fortify network architecture and application security measures including WAF management.

Benefits

Employee insurance premiums covered up to 90%.
Stock options available for all full-time employees.
Flexible PTO for work-life balance.
401K plan with employer contributions.

Full Job Description

LeanData helps the world's fastest-growing companies automate, simplify, and accelerate revenue.

We are looking for a Senior Site Reliability Engineer to lead the strategic evolution of our cloud infrastructure. Reporting directly to the SVP of Engineering, this role is designed for a builder - someone who wants to move beyond maintenance and into the realm of architectural transformation.

You will have the autonomy to evaluate our existing AWS footprint and lead the charge in modernizing our environment. Your mission is to take a high-velocity system and implement the best practices, guardrails, and automated architectures that will support our next 10x of scale. You will be the primary authority on reliability, performance, and infrastructure security.

Please note: This is a hybrid role based in our Santa Clara, CA office, with an in-office schedule of two days per week - Monday and Wednesday.

Key Responsibilities

Architectural Modernization: Lead the design and implementation of a scalable, "Cloud-First" AWS architecture. You will drive the transition toward fully automated, state-of-the-art Infrastructure as Code (Terraform).
High Availability & Resilience: Design and implement robust Disaster Recovery (DR) and Business Continuity plans, moving our services toward a zero-downtime deployment model.
Performance & Capacity Engineering: Own the strategy for capacity planning and autoscaling. You will optimize our compute resources (EC2, Lambda) to handle bursty traffic patterns with precision and cost-efficiency.
Advanced Observability: Define our monitoring and alerting philosophy using New Relic for deep APM and system insights. Partner this with IncidentIO to ensure we catch and resolve issues before they impact customers.
Streamlined CI/CD: Partner with feature teams to refine Change Management and CI/CD pipelines, ensuring code moves from "commit" to "production" safely and predictably.
Cloud Security: Harden our network architecture and application security posture, including WAF management and secure service-to-service communication.

The Tech Stack

Cloud Infrastructure: AWS (EC2, Lambda, SQS, SNS, ALB, API Gateway, S3, WAF).
Observability & Incident Response: New Relic (APM/Infrastructure), IncidentIO.
Automation & Tools: Terraform, Redis/Elasticache, Shell Scripting, NPM/PM2.
Application Ecosystem: NodeJS, Python, C#, Angular, Apex.
Integration: Salesforce Managed Packages, MSFT Dynamics365.

Who You Are

Experienced Architect: 5+ years of experience in SRE, DevOps, or Systems Engineering, with a proven track record of managing complex AWS environments.
Proven Incident Commander: You demonstrate calm, decisive leadership during high-pressure outages. You have extensive experience running blameless postmortems and, crucially, driving the remediation work needed to prevent recurrence.
Observability Pro: You have deep experience configuring New Relic (or similar platforms) to create meaningful dashboards, SLIs, and SLOs.
Automation Advocate: You believe that manual intervention is a bug. You have deep experience with Terraform and a "Code-First" approach to infrastructure.
Strategic Problem Solver: You can look at a complex, "needs-based" architecture and formulate a clear, prioritized roadmap to move it toward industry best practices.
Collaborative Leader: You enjoy working with feature engineers to help them build "reliability-by-design" into their services.
Education: A Bachelor's degree in Computer Science, Engineering, or a related technical field (or equivalent professional experience).

Why work at LeanData:

LeanData covers employee insurance premiums up to 90%
Stock options in LeanData for all full-time employees
Flexible PTO
401K plan
401K plan

About LeanData

LeanData is a provider of lead management software for enterprise businesses. The company's platform uses artificial intelligence and machine learning to help businesses manage their leads more effectively, by automating lead routing, lead matching, and lead-to-account matching. LeanData's customers include some of the world's largest and most successful companies, across a range of industries including technology, healthcare, and financial services.

Learn more about LeanData

Size

200 employees

Industry

Enterprise Technology

Founded

2012

* Ladders Estimates

Similar Jobs

Azure Cloud Engineer IV
$140K — $175K *
Hanger
Remote
Reposted Today
Amazon Connect Architect
$120K — $150K *
Miratech
Remote
Today
Azure Engineer III - Cloud Infrastructure & Kubernetes
$102K — $154K *
AIS
Remote
Today
Senior Microsoft Cloud Engineer - Data Sharing & B2B
$121K — $182K *
AIS
Remote
Today
Cloud Architect
$120K — $150K *
Keyfactor
Remote
Today
Senior Cloud Engineer
$120K — $150K *
SAIC
Remote
Today

Get Ready For Your
Next Interview

More Jobs at LeanData

Corporate Controller
$220K — $260K *
Santa Clara, CA 95051 (Santa Clara County)
1 week ago
Finance & Insurance
In-Person
Senior FP&A Analyst
$100K — $130K *
Santa Clara, CA 95051 (Santa Clara County)
1 month ago
Finance & Insurance
In-Person

More Information Technology Jobs

Business Development Director
$300K — $345K + $120K bonus *
Tier1 IT Services Firm
Kansas City, MO 64116 (Clay County)
6 days ago
Client Partner / Business Developemnt - Banking
$250K — $320K + $70K bonus *
IT Services Firm (client of TechLink Systems)
New York, NY 10001 (New York County)
6 days ago
Customer Support
Confidential Company
Austin, TX 78701 (Travis County)
2 weeks ago
Software Engineer, GPU Performance
$147K — $211K *
Google
Sunnyvale, CA 94087 (Santa Clara County)
Today
Backend Software Engineer II
$80K — $110K *
U.S. Venture
Appleton, WI 54915 (Outagamie County)
Today

Find similar Senior Site Reliability Engineer jobs:

Nationwide Santa Clara, CA

Senior Site Reliability Engineer

Job Overview by Ladders

Full Job Description

Get Ready For Your Next Interview

Find similar Senior Site Reliability Engineer jobs:

Get Ready For Your
Next Interview