SRE - Dynatrace / Splunk / AWS

Zeektek

• $80K — $120K *

Chesterfield, MO 63017In-Person

Information Technology

Less than 5 years of experience

More than 3 months ago

By clicking Apply, I agree with Ladders' Terms of Use and Privacy Policy

Job Overview by Ladders

Qualifications

Bachelor's degree in a quantitative or business field (e.g., statistics, mathematics, engineering, computer science)
4-6 years of related experience
Proficiency in AWS, Kubernetes, and application monitoring
Experience with disaster recovery and SRE practices
Familiarity with observability tools like Dynatrace and Grafana

Responsibilities

Build and improve disaster recovery capabilities for Tier 1 applications
Review and maintain application design and architecture documents
Implement and maintain disaster recovery capabilities with development teams
Participate in disaster recovery testing exercises for continuous improvement
Lead complex projects focused on observability and monitoring

Benefits

Opportunity for contract to hire
Contribution to a key IT disaster recovery initiative
Engagement with Tier 1 application management
Collaboration with various engineering teams
Professional development in Site Reliability Engineering practices

Full Job Description

We have contract to hire position open for a Dynatrace engineer who has a background setting up monitoring, observability and dashboards for AWS, Kubernetes, Applications. This is part of a Disaster Recovery team so the candidate should know disaster recovery and SRE.

Walk me through the day-to-day responsibilities of this the role:
These resources will be working on building and improving the disaster recovery (DR) capabilities of Tier 1 applications. Common responsibilities will include:
Building, reviewing and maintaining application design and architecture documents.
Ensuring the DR capabilities are built into each system.
Working with development teams to implement and maintain the DR capabilities.
Participate in DR testing exercises and evaluate the results for continuous improvement.

Job details:
Helps lead projects that are focused on managing and maintaining optimum platform infrastructure performance, reliability, and security using SRE practices, observability tools, manual and automated procedures, documentation, people and processes and continuous delivery(CI/CD) tools, processes, and designs.

Develops complex services to automate monitoring activities and provide critical information to facilitate response and resolution of performance and availability issues and incidents.

Understands and advocates for standardized and scalable software tools to ensure that systems operate without interruption at optimum performance and leads project teams through out the deployment process.

Troubleshoots and analyzes service disruptions to determine the root cause of issues and develop solutions for improved reliability.

Education and Experience:
• A Bachelor's degree in a quantitative or business field (e.g., statistics, mathematics, engineering, computer science).
• Requires 4 - 6 years of related experience.

Essential Functions:
• Troubleshoots and resolves more complex problems with systems and services and initiates regular deployment of new versions of the systems and their subcomponents
• Leads more complex projects focused on building and maintaining observability/monitoring for the application, monitoring key performance indicators, maintaining alerting, and continuously improving visibility.
• Helps make decisions around periodic system validation and testing, service monitoring, and standing up new services/tools
• Uses knowledge and experience to identify strategies that increase system reliability and performance through on-call rotation and process optimization
• Identifies and implements necessary manual and automated procedures for improved collaborative response in real-time
• Leads lower level Engineers in stress, security, and performance testing
• Resolves issues that come up through support escalation
• Keeps documentation and runbooks up to date to effectively deal with new incidents that might arise
• Leads post incident reviews and documents findings for future informed decision making
• Reviews proposals to optimize Software Development Life Cycle (SDLC) to boost service reliability and makes decisions around which proposals should move forward.
• Communicates complex topics with development teams to investigate and document issues and leads internal team to develop solutions to mitigate them

What previous job titles or background work will in this role?
Site Reliability Engineer
Disaster Recovery Engineer
System Support Engineer
Application Architect
Cloud Systems Engineer

Any future projected positions potentially coming up? YES If yes, note: This is the board approved IT DR initiative project funded under project IDs:
IT DRR Program,
DR Modernization, and
Disaster Recover and Resilience.
Internal/External Groups with which the Candidate will interface: Required Skills/Experience: Preferred Skills/ Experience: 1. AWS, Route 53, Lambda, Mongo DB, Kafka, Kubernetes 1. Rancher, Axway API Gateway, 2. Load Balancing / Load Redirecting / Load Restricting strategies 2. 3. Monitoring and Observability tools such as Prometheus, Grafana, Dynatrace, Splunk, Elk 3. Education Requirement: Bachelor's degree in a quantitative or business field (e.g., statistics, mathematics, engineering, computer science). Education Preferred:

* Ladders Estimates

Similar Jobs

RN- MD Live Urgent Care-Remote
$64K — $97K *
Remote
Reposted Today
Technical Training Lead Analyst, Instructional Designer - Evernorth - Remote
$71K — $118K *
Cigna
Remote
Today
Adoption Design Manager - Business Analytics - Evernorth - Remote
$108K — $181K *
Cigna
Remote
Today
Strategy and Business Development Senior Advisor - Independent Pharmacy Affairs - Evenorth - Hybrid
$100K — $130K *
Cigna
St. Louis, MO 63129 (Saint Louis County)
Today
Business Analytics Advisor, Pricing Services- CuraScript SD- Hybrid
$98K — $163K *
Cigna
Remote
Today
Network Operations Advisor - Evernorth - Remote
$79K — $132K *
Cigna
Remote
Today

Get Ready For Your
Next Interview

More Information Technology Jobs

Business Development Director
$300K — $345K + $120K bonus *
Tier1 IT Services Firm
Kansas City, MO 64116 (Clay County)
6 days ago
Client Partner / Business Developemnt - Banking
$250K — $320K + $70K bonus *
IT Services Firm (client of TechLink Systems)
New York, NY 10001 (New York County)
6 days ago
Customer Support
Confidential Company
Austin, TX 78701 (Travis County)
2 weeks ago
Sr Assoc, Cyber Sec ThreatMgmt - Detection Engineer
$88K — $151K *
Northern Trust
Naperville, IL 60540 (Dupage County)
Today
Global Director – Vulnerability Management & Security Configuration
$164K — $288K *
Northern Trust
Chicago, IL 60629 (Cook County)
Today

Find similar SRE - Dynatrace / Splunk / AWS jobs:

Nationwide Chesterfield, MO

SRE - Dynatrace / Splunk / AWS

Job Overview by Ladders

Full Job Description

Get Ready For Your Next Interview

Find similar SRE - Dynatrace / Splunk / AWS jobs:

Get Ready For Your
Next Interview