Sr Data Center Operations Engineer

Milestone Technologies • $75K — $95K *

Saint-jerome, QC J5L 0A1In-Person

Information Technology

5 - 7 years of experience

Reposted Today

Be an Early Applicant

By clicking Apply, I agree with Ladders' Terms of Use and Privacy Policy

Job Overview by Ladders

Qualifications

5+ years of experience in data center operations or related infrastructure environments
Significant hands-on experience with server hardware troubleshooting and repair
Minimum of 2 years of Linux experience in production settings
Demonstrated capability in root cause analysis for complex hardware issues
Strong organizational and documentation skills

Responsibilities

Diagnose and resolve complex hardware failures across server platforms
Utilize Linux command-line tools for system diagnostics and troubleshooting
Support hardware installation and infrastructure validation
Serve as a primary escalation point for complex infrastructure issues
Mentor junior technicians on hardware troubleshooting best practices

Benefits

Exposure to large-scale, modern infrastructure environments
Clear path for progression into advanced technical or engineering roles
Opportunities for continuous improvement initiatives
Participation in on-call rotation supporting 24x7 operations
Collaboration with cross-functional teams

Full Job Description

Job Overview

Le Québécois:

Ingénieur senior en opérations de centre de données

Aperçu du poste:
L'ingénieur senior en opérations de centre de données joue un rôle clé et très pratique dans le déploiement et l'exploitation à long terme d'un environnement de centre de données haute performance à grande échelle, supportant du calcul avancé et des infrastructures complexes.

Ce poste s'adresse à un ingénieur expérimenté possédant une expertise approfondie en matériel serveur, systèmes Linux et opérations de centre de données, dans des environnements qui exigent une haute disponibilité, de la précision et de la performance. Tu contribueras dès la phase initiale de déploiement, en supportant la mise en service de l'infrastructure, la validation et la préparation du matériel. Par la suite, lorsque l'environnement sera en mode stable, tu prendras en charge la fiabilité continue, le dépannage avancé et les initiatives d'amélioration continue.

Ce rôle demande un solide esprit d'opération - quelqu'un qui s'épanouit dans des environnements complexes et critiques pour la production et qui met l'accent sur la résolution des problèmes à la source. Tu agiras comme principal point d'escalade technique et travailleras en étroite collaboration avec les équipes d'ingénierie et d'infrastructure pour assurer la stabilité et la performance des systèmes.

Ce poste s'exerce principalement en anglais, en raison des interactions fréquentes avec des équipes interfonctionnelles globales et de l'utilisation de documentation technique en anglais.

Ce rôle permet de couvrir à la fois les phases de déploiement et d'opérations, offrant une exposition à des environnements d'infrastructure modernes à grande échelle, ainsi qu'un chemin clair vers des rôles techniques avancés ou en ingénierie.

Responsabilités principales:

Dépannage avancé du matériel et réparations

Diagnostiquer et résoudre des défaillances matérielles complexes sur différentes plateformes serveurs (cartes mères, CPU, mémoire, stockage)
Effectuer des réparations et remplacements au niveau des composantes
Exécuter les processus break/fix en minimisant les temps d'arrêt et en respectant les SLA
Réaliser des analyses de cause racine (RCA) et proposer des améliorations préventives
Identifier les tendances de défaillance et contribuer à l'amélioration des outils, de l'automatisation et des processus

Systèmes Linux et support des plateformes

Utiliser les outils en ligne de commande Linux pour le monitoring, le diagnostic et le troubleshooting
Supporter le déploiement et la configuration de serveurs sous différentes distributions Linux (RHEL, Ubuntu, etc.)
Diagnostiquer les problèmes au niveau du démarrage (boot) et du système d'exploitation en environnement de production
Collaborer avec les équipes d'ingénierie pour résoudre des problèmes complexes matériel/logiciel

Opérations de centre de données

Supporter l'installation d'équipements, le câblage structuré et la validation de l'infrastructure
Maintenir un inventaire précis des pièces, des actifs et des équipements retirés
Documenter les réparations, changements et configurations dans les outils ITSM/DCIM
Assurer le respect des normes de sécurité, sûreté et opérations
Agir comme point d'escalade principal pour les incidents complexes
Participer à une rotation de garde (on-call) pour un environnement 24/7

Collaboration et mentorat

Offrir du coaching et du mentorat aux techniciens sur le troubleshooting matériel et les meilleures pratiques
Collaborer avec les équipes réseau, stockage et infrastructure pour résoudre des enjeux interfonctionnels
Contribuer au partage de connaissances, à la documentation et à l'excellence opérationnelle
Participer aux initiatives d'amélioration continue (processus, outils, pratiques opérationnelles)

Compétences requises:

Maîtrise de l'anglais (oral et écrit) requise afin de communiquer efficacement avec des équipes interfonctionnelles globales et de consulter de la documentation technique principalement en anglais
Expertise avancée en architecture matérielle serveur et dépannage au niveau des composantes
Excellente maîtrise des systèmes Linux et des outils de diagnostic en ligne de commande
Bonne compréhension des bases en réseautique et des composantes d'infrastructure
Expérience dans des environnements opérationnels structurés (SOP, SLA, systèmes de billets)
Familiarité avec les outils ITSM/DCIM (ServiceNow, Jira ou équivalent)
Expérience en câblage structuré et connectivité fibre optique
Excellentes capacités d'analyse et de résolution de problèmes, avec souci du détail
Capacité à performer dans des environnements critiques et sous pression
Fortes compétences organisationnelles et en documentation

Expérience requise

5+ ans d'expérience en opérations de centre de données ou environnement d'infrastructure similaire
Expérience significative en dépannage et réparation de matériel serveur
Minimum 2 ans d'expérience avec des systèmes Linux en environnement de production
Expérience avec des plateformes serveurs d'entreprise et des environnements d'infrastructure
Expérience démontrée en analyse de cause racine (RCA) et résolution de problèmes complexes
Expérience avec des systèmes de billets et des processus opérationnels
Expérience en déploiement ou mise à niveau d'infrastructure de centres de données (un atout)

Certifications (un atout)

CompTIA A+, Server+ ou Linux+
Certification LPI ou équivalent
Certifications spécifiques aux fournisseurs de matériel

Exigences physiques

Capacité de soulever et déplacer des équipements jusqu'à 50 lb
Capacité de travailler dans un environnement à température contrôlée avec un niveau de bruit modéré
Capacité d'effectuer des tâches physiques (debout, marche, flexion, position à genoux) pendant de longues périodes

English:

Senior Data Center Operations Engineer

Job Overview:
The Senior Data Center Operations Engineer plays a critical, hands-on role in supporting the build-out and long-term operation of a high-performance, enterprise-scale data center environment supporting advanced compute and large-scale infrastructure deployments.

This position is designed for an experienced engineer with deep expertise in server hardware, Linux systems, and data center operations, operating within environments that demand high availability, precision, and performance. You will contribute during the initial deployment phase, supporting infrastructure bring-up, validation, and hardware readiness. As the environment transitions into steady-state operations, you will take ownership of ongoing reliability, advanced troubleshooting, and continuous improvement initiatives.

This role requires a strong operator mindset-someone who thrives in complex, production-critical environments and takes pride in resolving issues at their root. You will serve as a primary technical escalation point, working closely with engineering and infrastructure teams to maintain system stability and performance.

You will collaborate with cross-functional teams, making clear and professional communication in English (written and verbal) essential for success in this role.

This role offers continuity across both deployment and operational phases and provides exposure to large-scale, modern infrastructure environments, with a clear path for progression into advanced technical or engineering roles.

Key Responsibilities:

Advanced Hardware Troubleshooting & Repair

Diagnose and resolve complex hardware failures across server platforms (motherboards, CPUs, memory, storage)
Perform component-level repairs and replacements on servers and data center hardware
Execute break/fix processes with a focus on minimizing downtime and meeting SLAs
Conduct root cause analysis (RCA) of hardware failures and implement preventative improvements
Identify recurring failure trends and contribute to tooling, automation, and process enhancements

Linux Systems & Platform Support

Utilize Linux command-line tools for system monitoring, diagnostics, and troubleshooting
Support provisioning and deployment of servers across Linux distributions (RHEL, Ubuntu, etc.)
Troubleshoot boot-level and OS-level issues in production environments
Collaborate with engineering teams to resolve complex hardware/software interaction issues

Data Center Operations

Support hardware installation, structured cabling, and infrastructure validation
Maintain accurate inventory of spare parts, assets, and retired equipment
Document repairs, changes, and configurations in ITSM/DCIM systems
Ensure adherence to safety, security, and operational protocols
Serve as a primary escalation point for complex infrastructure issues
Participate in on-call rotation supporting 24x7 operations

Collaboration & Mentorship

Provide guidance and mentorship to technicians on hardware troubleshooting and best practices
Collaborate with network, storage, and infrastructure teams to resolve cross-functional issues
Contribute to knowledge sharing, documentation, and operational excellence initiatives
Support continuous improvement efforts across processes, tooling, and operational workflows

Required Skills:

Strong English communication skills (written and verbal) are required for coordination with cross-functional teams
Expert-level knowledge of server hardware architecture and component-level troubleshooting
Strong proficiency with Linux systems and command-line diagnostics
Solid understanding of networking fundamentals and infrastructure components
Experience working within structured operational environments (SOPs, SLAs, ticketing systems)
Familiarity with ITSM/DCIM tools (ServiceNow, Jira, or similar)
Experience with structured cabling and fiber optic connectivity
Strong analytical and problem-solving skills with attention to detail
Ability to operate effectively in high-pressure, high-availability environments
Strong organizational and documentation skills

Required Experience:

5+ years of experience in data center operations or similar infrastructure environments
Significant hands-on experience with server hardware troubleshooting and repair
Minimum of 2 years of experience working with Linux operating systems in production environments
Experience supporting enterprise server platforms and infrastructure environments
Demonstrated experience performing root cause analysis and resolving complex hardware issues
Experience working within ticketing systems and operational workflows
Exposure to data center build-outs, deployments, or infrastructure upgrades (preferred)

Preferred Certifications:

CompTIA A+, Server+, or Linux+
LPI certification or equivalent
Vendor-specific hardware certifications

Physical Requirements:

Ability to lift and move equipment up to 50 lbs
Ability to work in a temperature-controlled environment with moderate noise levels
Ability to perform physical tasks such as standing, walking, bending, and kneeling for extended periods

Compensation

Estimated Pay Range:

Exact compensation and offers of employment are dependent on circumstances of each case and will be determined based on job-related knowledge, skills, experience, licenses or certifications, and location.

About Milestone Technologies

Continental Aerospace Technologies is an aircraft engine manufacturer located at the Brookley Aeroplex in Mobile, Alabama, United States. It was originally spun off from automobile engine manufacturer Continental Motors Company in 1929 and owned by Teledyne Technologies from 1969 until December 2010. The company is now part of Aviation Industry Corporation of China, which is a Government of the People's Republic of China state-owned aerospace company headquartered in Beijing. Although Continental is most well known for its engines for light aircraft, it was also contracted to produce the air-cooled V-12 AV-1790-5B gasoline engine for the U.S. Army's M47 Patton tank and the diesel AVDS-1790-2A and its derivatives for the M48, M60 Patton, and Merkava main battle tanks. The company also produced engines for various independent manufacturers of automobiles, tractors, and stationary equipment from the 1920s to the 1960s.

Learn more about Milestone Technologies

Industry

Technical Services

Founded

1997

* Ladders Estimates

Similar Jobs

Robotics Forward Deployed Engineer
$80K — $160K *
Tutor Intelligence
Watertown, MA 02472 (Middlesex County)
Today
Spécialiste ou ingénieur/ingénieure de systèmes
$75K — $95K *
Rheinmetall
Saint-jean-chrysostome, QC G6Z 0A1
Reposted Today
National Lead (Automation)
$86K — $173K *
Abbott
Remote
Reposted Today
Network Engineer
$70K — $95K *
Rockland Trust Company
Lowell, MA 01852 (Middlesex County)
Today
Diesel Technician
$79K — $87K *
Blasius Chevrolet Cadillac
Waterbury, CT 06708 (Naugatuck Vly County)
Reposted Today
Deployment Engineer - ERP
$70K — $95K *
Tyler Technologies
Yarmouth, ME 04096 (Cumberland County)
Today

Get Ready For Your
Next Interview

More Jobs at Milestone Technologies

Sr Data Center Operations Engineer
$75K — $95K *
Saint-jerome, QC J5L 0A1
Reposted Today
Information Technology
In-Person
Senior Data Center Operations Engineer
$97K — $141K *
Sunnyvale, CA 94087 (Santa Clara County)
Reposted Today
Information Technology
In-Person
Data Center Technician L3
$70K — $101K *
Sunnyvale, CA 94087 (Santa Clara County)
Reposted Today
Information Technology
In-Person
Senior Data Center Operations Engineer
$80K — $110K *
Saint-jerome, QC J5L 0A1
Reposted Today
Information Technology
In-Person
Data Center Manager
$100K — $130K *
Minneapolis, MN 55407 (Hennepin County)
Reposted Today
Information Technology
In-Person

More Information Technology Jobs

Client Partner - Banking / Financial Services / Capital Markets
$325K — $350K + $100K bonus *
Large IT Services Firm (client of TechLink Systems)
New York, NY 10001 (New York County)
1 week ago
Principal Software Engineer
$155K — $185K *
StoneX Group Inc.
Birmingham, AL 35242 (Shelby County)
Today
Senior Control Engineer
$175K — $200K *
Rockefeller Capital Management
New York, NY 10025 (New York County)
Today
Senior Security Engineer
$175K — $200K *
Rockefeller Capital Management
New York, NY 10025 (New York County)
Today
Lead Software Configuration
$116K — $196K *
AT&T
Dallas, TX 75217 (Dallas County)
Reposted Today

Find similar Sr Data Center Operations Engineer jobs:

Nationwide Saint-jerome, QC

Sr Data Center Operations Engineer

Job Overview by Ladders

Full Job Description

Get Ready For Your Next Interview

Find similar Sr Data Center Operations Engineer jobs:

Get Ready For Your
Next Interview