Site Reliability Engineer

DMSi Software

• $90K — $120K *

Omaha, NE 68104In-Person

Information Technology

Less than 5 years of experience

Reposted 3 days ago

Be an Early Applicant

By clicking Apply, I agree with Ladders' Terms of Use and Privacy Policy

Job Overview by Ladders

Qualifications

Bachelor's degree in Computer Science or a related field, or equivalent experience.
3+ years of experience in Site Reliability Engineering or a similar role.
Strong experience with monitoring and observability tools such as Nagios, Prometheus, Grafana, and ELK Stack.
Proficiency in scripting languages like Python, Bash, or PowerShell.
Familiarity with cloud platforms like AWS, Azure, or GCP.

Responsibilities

Evaluate existing monitoring systems and implement improvements for comprehensive observability.
Monitor performance indicators to ensure a smooth user experience.
Refine alerting mechanisms to minimize false positives and improve incident management.
Analyze monitoring data to identify trends and provide actionable insights.
Collaborate with teams to align monitoring and alerting systems with business goals.
Develop automation scripts and tools to enhance monitoring processes.
Document systems and processes while providing training on tool usage and data interpretation.

Benefits

Opportunities for continuous professional development and improvement.
Collaboration with diverse teams across the organization.
Exposure to cutting-edge technologies in observability and monitoring.
Flexible office environment with normal office demands.

Full Job Description

As a Site Reliability Engineer, your primary responsibility will be to review, optimize, and complete the monitoring and alerting systems for our applications. You will work closely with development, operations, and product teams to ensure that our monitoring systems provide clear, actionable data and that our alerting mechanisms are finely tuned to detect issues before they impact our customers. Your work will be pivotal in transforming raw data into actionable intelligence, improving system observability, and enhancing the overall user experience.

RESPONSIBILITIES AND DUTIES:

Monitoring and Observability: Evaluate existing monitoring systems and implement improvements to ensure comprehensive observability across all systems and environments. Develop and maintain dashboards and reports that provide real-time visibility into system health, capacity/utilization trends, and performance.
User Experience: Ensure that the overall system environment operates nominally by monitoring critical performance indicators. Provide insights into system status that help maintain a smooth and uninterrupted user experience.
Alerting Optimization: Review and refine alerting mechanisms to minimize false positives and ensure timely and accurate notifications for critical issues. Develop escalation processes and response playbooks to streamline incident management.
Data Analysis and Insights: Analyze monitoring data to identify trends, anomalies, and potential areas of improvement. Provide actionable insights to relevant teams and drive data-driven decision-making leveraging machine learning and normal versus abnormal system behaviors.
Collaboration: Work closely with software engineers, DevOps teams, and other stakeholders to ensure monitoring and alerting systems are aligned with business goals and technical requirements.
Automation and Tooling: Develop and maintain automation scripts and tools to streamline monitoring and alerting processes, reducing manual effort and improving efficiency.
Documentation and Training: Document monitoring and alerting systems, processes, and best practices. Provide training and guidance to teams on how to use monitoring tools and interpret data.
Continuous Improvement: Continuously assess and improve monitoring and alerting strategies to adapt to changing technologies and business needs. Stay updated with industry trends and emerging tools in the observability space.

KNOWLEDGE, SKILLS, AND ABILITIES:Strong experience with monitoring and observability tools (e.g., Nagios, Prometheus, Grafana, ELK Stack, Datadog, New Relic).
Proficiency in scripting languages (e.g., Python, Bash, PowerShell) for automation.
Familiarity with cloud platforms (AWS, Azure, GCP) and hybrid cloud environments.
Understanding of infrastructure-as-code tools (e.g., Terraform, Ansible).
Knowledge of CI/CD pipelines and version control systems (e.g., Git, Jenkins).
Basic understanding of networking, security, and system administration.

EDUCATION AND EXPERIENCE:Bachelor's degree in Computer Science, Engineering, a related field, or equivalent experience.
Minimum of 3 years of experience in a Site Reliability Engineering or similar role, with a focus on monitoring and alerting in a SaaS environment.

WORK ENVIRONMENT AND PHYSICAL DEMANDS:Normal office environment with use of computers and telephone systems; no unusual physical demands.
Travel as needed, including business air travel and car rental.

* Ladders Estimates

Similar Jobs

Senior Mainframe Operations
$68K — $102K *
UST
Omaha, NE 68104 (Douglas County)
Today
Sr. Site Reliability Engineer (Omaha, NE)
$89K — $148K *
First National Bank of Omaha
Omaha, NE 68104 (Douglas County)
Today
Infrastructure Systems Engineer III or Sr - Linux
$100K — $130K *
Berkshire Hathaway Energy
Omaha, NE 68124 (Douglas County)
Reposted Today
Infrastructure Systems Engineer III or Sr - Linux
$100K — $130K *
Berkshire Hathaway Energy
Des Moines, IA 50317 (Polk County)
Reposted Today
Infrastructure Systems Engineer III or Sr - Linux
$100K — $130K *
Berkshire Hathaway Energy
Sioux City, IA 51108 (Woodbury County)
Reposted Today
Senior Engineer, Network
$90K — $120K *
Hubbell Inc
Centralia, MO 65240 (Boone County)
Today

Get Ready For Your
Next Interview

More Jobs at DMSi Software

Site Reliability Engineer
$90K — $120K *
Omaha, NE 68104 (Douglas County)
Reposted 2 days ago
Information Technology
In-Person
Site Reliability Engineer
$90K — $120K *
Omaha, NE 68104 (Douglas County)
Reposted 3 days ago
Information Technology
In-Person
Sr Data Engineer
$100K — $130K *
Omaha, NE 68104 (Douglas County)
6 days ago
Enterprise Technology
In-Person
Sr Software Engineer
$100K — $130K *
Phoenix, AZ 85032 (Maricopa County)
1 month ago
Information Technology
In-Person

More Information Technology Jobs

Business Development Director
$300K — $345K + $120K bonus *
Tier1 IT Services Firm
Kansas City, MO 64116 (Clay County)
6 days ago
Client Partner / Business Developemnt - Banking
$250K — $320K + $70K bonus *
IT Services Firm (client of TechLink Systems)
New York, NY 10001 (New York County)
6 days ago
Customer Support
Confidential Company
Austin, TX 78701 (Travis County)
2 weeks ago
Sr Assoc, Cyber Sec ThreatMgmt - Detection Engineer
$88K — $151K *
Northern Trust
Naperville, IL 60540 (Dupage County)
Today
Global Director – Vulnerability Management & Security Configuration
$164K — $288K *
Northern Trust
Chicago, IL 60629 (Cook County)
Today

Find similar Site Reliability Engineer jobs:

Nationwide Omaha, NE

Site Reliability Engineer

Job Overview by Ladders

Full Job Description

Get Ready For Your Next Interview

Find similar Site Reliability Engineer jobs:

Get Ready For Your
Next Interview