The main purpose of this position is to implement and maintain a comprehensive Enterprise Monitoring solution supporting high availability services through proactive and predictive monitoring and alerting. The Enterprise Monitoring Engineer will assist and work with all IT departments and third party vendors to produce and document monitoring schema for network infrastructure, server hardware, OS, applications, and business processes. The Enterprise Monitoring Engineer will also identify and implement system monitoring tools across all IT systems. The Enterprise Monitoring Engineer will be focused on aligning the selection and utilization of monitoring and automation technologies to a comprehensive solution encompassing complete business transactions. This person will map the infrastructure elements and applications which ultimately execute the business functions which deliver services to our employees and guests. The Enterprise Monitoring Engineer will also be responsible for providing management reports that track key metrics and measurements and trending and notify the Enterprise Monitoring Analyst(s) of threshold changes. This position implements and maintains monitoring technology in a 24x7 enterprise production environment. The Enterprise Monitoring Engineer must have knowledge of ITIL principles and practices.
ROLES AND RESPONSIBILITIES:
- Provide system engineering for Enterprise Monitoring Systems (BMC Product Line, OEM and SCOM) including systems architecture, monitoring strategy, operational deployments, application design and maintenance/administration.
-Engage with subject matter expert teams ranging from network to applications to define, deploy and maintain system and service monitors.
-Work with other IT departments and vendors to plan and implement new features, enhancements, and upgrades.
-Document supporting policies, processes and procedures
-Provide training as needed to Enterprise Monitoring Analysts and operations team regarding alarm correlation and threshold setting.
-Assists in the installation, maintenance, and general support of monitoring systems
-Routinely review monitoring systems and services to ensure stability and security.
-Assist in interpretation of diagnostic data obtained from monitoring solutions.
Provide escalation support to Implementation team for standard monitoring implementation.
-Provide implementation support for custom monitoring orders.
-Oversee the rollout of monitoring software updates
-Oversee the rollout of new and updated monitoring scripts
-Manage the installation of new software releases, system upgrades, and patch installs that resolves monitoring related software problems
-Participates as an Enterprise Monitoring resource on Business and IT projects.
-Provide planning and monitoring engineering guidance to support teams.
-Identify, diagnose, and resolve technical monitoring problems.
-Serve as the system administrator for all Enterprise monitoring systems.
-Serve as a focal point for analysis of Enterprise Monitoring data, and collection and reporting on Enterprise Monitoring Key Performance Indicators.
-Provide technical consultation to individual contributors and customers in areas of expertise.
-Provide Capacity, Performance and Availability reports for assigned systems.
-Define and recommend monitoring standards for fault-detection, availability, capacity and performance trending for assigned applications and services.
-Develop and distribute trend reports detailing availability, performance & capacity metrics.
-Research/Design new monitors that meet the needs of the engineering teams.
-Provide regular reports on key metrics, measurements, and trending.
-Notify Monitoring Analyst(s) of threshold adjustments based on monitoring data collected.
-Lead meetings as it relates to monitoring/alerting and the DSM teams processes. Meetings include but are not limited to Change Management, Problem Management, Knowledge Management and Incident management.
REQUIRED TECHNICAL SKILLS:
-Minimum 7 years system administration of Enterprise Monitoring Systems.
-Minimum 7 years of experience in Server Management products, i.e.: BMC ProactiveNet / Patrol, OEM and Microsoft SCOM.
-Minimum 7 years UNIX administration experience (including Solaris and RedHat Linux) in an enterprise environment.
-Minimum 7 years Windows administration experience in an enterprise environment.
-Minimum 7 years networking experience in an enterprise environment.
-Advanced knowledge of Enterprise Monitoring metrics, reporting, logging and best practices.
-Experience with SNMP traps.
-Ability to perform network traffic analysis using network capture tools.
-Ability to generate management level reports