SRE/DevOps Engineer

Versana

• $120K — $150K *

New York, NY 10025Hybrid

Information Technology

5 - 7 years of experience

1 week ago

By clicking Apply, I agree with Ladders' Terms of Use and Privacy Policy

Job Overview by Ladders

Qualifications

5+ years as a Site Reliability Engineer or similar role
3+ years of experience with public cloud (Azure, AWS, GCP)
3+ years in observability tools (Datadog, Elasticsearch, Grafana)
3+ years in containerization and orchestration (Docker, Kubernetes)
2+ years developing and managing CI/CD pipelines
2+ years with Infrastructure-as-Code tools (Terraform, Azure Bicep)
1+ year with site reliability tools (Gremlin, Chaos Mesh)

Responsibilities

Design, implement, and enhance system observability and monitoring tools
Monitor system performance and create incident response plans
Implement service-level objectives (SLOs) and indicators
Improve system reliability and resiliency
Conduct post-incident reviews and implement changes
Assist teams in implementing observability tools
Leverage observability for key incident management metrics
Optimize systems and workflows through architecture and automation
Collaborate with developers to ensure DevOps best practices

Benefits

Flexible working hours
Opportunity for remote work
Professional development opportunities
Support for certifications in cloud technologies
Participation in innovative engineering projects

Full Job Description

About You:
Versana is seeking a motivated SRE/DevOps Engineer with strong observability experience to join
our growing Platform Engineering squad. The squad's goal is to manage public cloud, improve
DevOps practices, and monitor Versana's real-time syndicated loan data platform. The ideal
candidate will have a deep understanding of cloud-native applications, distributed computing,
CI/CD implementation, observability tools and practices.

Key Responsibilities:
• Design, implement and enhance system observability and monitoring tools
• Monitor system performance, create incident response plans, and implement observability
practices to gain insights into system behavior.
• Implement and monitor service-level objectives (SLOs) and indicators.
• Improve system reliability and resiliency.
• Conduct post-incident reviews and implement necessary changes to prevent system
failures.
• Assist teams in implementing observability tools and leveraging available telemetry data to
troubleshoot and resolve incidents and problems.
• Leverage observability and event management to improve key incident management
metrics, such as mean time to detect and mean time to restore services.
• Continually optimize systems and workflows by improving architecture, infrastructure,
automation, CI/CD, and observability.
• Collaborate with developers to ensure applications are designed with DevOps best
practices in mind.
• Participate in a rotating on-call schedule for weekend releases and being available to
respond to production issues outside of regular working hours, including weekends and
holidays.

Must Have:
• 5+ years of experience as a Site Reliability Engineer or similar role.
• 3+ years of work experience with public cloud (Azure, AWS or GCP).
• 3+ years of direct experience with observability tools like Datadog, Elasticsearch, and
Grafana Labs, etc.
• 3+ years of experience with containerization and orchestration technologies like Docker
and Kubernetes.
• 2+ years of experience in development and management of CI/CD pipelines (e.g., Azure
DevOps, Gitlab CI/CD, Github Actions, Jenkins, etc).
• 2+ years of experience with Infrastructure-as-code tools like Terraform, Azure Bicep, Cloud
Formation, etc.
• 1+ years of experience with site reliability tools like Gremlin, Chaos Mesh, or similar.
• Proven track record leveraging core observability concepts, end-user monitoring, and
infrastructure monitoring with SaaS solutions.
• Experience with messaging services like Kafka or Azure Event Hubs.
• Good understanding of the Linux operating system.

Nice to Have:
• Experience in at least one coding language such as Java, JavaScript, Python, GoLang, or .NET.
• Certifications in cloud technologies.
• Experience with Azure cloud or Azure DevOps.
• Experience with Datadog or similar modern observability tools.

* Ladders Estimates

Similar Jobs

Site Reliability Engineer
$142K — $158K *
General Dynamics
Remote
2 days ago
Application Support Engineer, Service Reliability Engineering
$78K — $125K *
Ciena
Remote
Reposted 2 days ago
Operations Facilitator
$62K — $141K *
Chantilly, VA 20152 (Loudoun County)
Reposted 4 days ago
Site Reliability Engineer II
$100K — $130K *
Bentley Systems
Exton, PA 19341 (Chester County)
6 days ago
Site Reliability Engineer
$111K — $160K *
Mizuho Financial
New York, NY 10025 (New York County)
3 weeks ago
Site Reliability Engineer
$90K — $130K *
Arctiq, Inc.
Norfolk, VA 23503 (Norfolk City County)
3 weeks ago

Get Ready For Your
Next Interview

More Jobs at Versana

Database Administrator
$90K — $130K *
New York, NY 10025 (New York County)
1 week ago
Enterprise Technology
Hybrid
SRE/DevOps Engineer
$120K — $150K *
New York, NY 10025 (New York County)
1 week ago
Information Technology
Hybrid
Client Support Manager
$90K — $120K *
New York, NY 10025 (New York County)
1 week ago
Finance & Insurance
Hybrid
Client Success Operations Associate
$110K — $130K *
New York, NY 10025 (New York County)
1 month ago
Finance & Insurance
Hybrid

More Information Technology Jobs

Client Partner - Banking / Financial Services / Capital Markets
$325K — $350K + $100K bonus *
Large IT Services Firm (client of TechLink Systems)
New York, NY 10001 (New York County)
Yesterday
Business Development Director
$300K — $345K + $120K bonus *
Tier1 IT Services Firm
Kansas City, MO 64116 (Clay County)
1 week ago
Client Partner / Business Developemnt - Banking
$250K — $320K + $70K bonus *
IT Services Firm (client of TechLink Systems)
New York, NY 10001 (New York County)
1 week ago
Wiki Platform Engineer
$80K — $100K *
Valnet Inc.
Saint-laurent, QC H4K 1H9
Reposted Today
Data Center Technician
$62K — $112K *
Amazon
Hermiston, OR 97838 (Umatilla County)
Reposted Today

Find similar SRE/DevOps Engineer jobs:

Nationwide New York, NY

SRE/DevOps Engineer

Job Overview by Ladders

Full Job Description

Get Ready For Your Next Interview

Find similar SRE/DevOps Engineer jobs:

Get Ready For Your
Next Interview