Job DescriptionSummary:The HCI Platform Analyst is responsible for supporting and operating enterprise hyperconverged infrastructure (HCI) environments across data center and hybrid platforms. This role focuses on day-to-day operations, monitoring, provisioning, and troubleshooting across integrated stacks including compute, storage, and networking.
The ideal candidate is a hands-on operator with foundational HCI knowledge who can execute standard procedures, support infrastructure stacks end-to-end, and maintain stable, secure, and performant environments under established engineering standards.
Duties and Responsibilities:- Support day-to-day operations of hyperconverged infrastructure (HCI) stacks, including compute nodes, storage nodes, and integrated networking components.
- Assist with building, deploying, and expanding HCI clusters under guidance of senior engineers.
- Install, configure, and support hypervisor platforms (VMware ESXi/vCenter preferred) across HCI environments.
- Perform routine system administration tasks including patching, upgrades, monitoring, and lifecycle maintenance of HCI systems.
- Execute code currency and patching activities, ensuring environments remain compliant with vendor and security standards.
- Support provisioning and management of virtual machines, storage resources, and cluster configurations.
- Assist with management of Layer 2 switching, VLAN configurations, and basic network connectivity within HCI stacks.
- Support and troubleshoot infrastructure components including compute, storage, and network layers.
- Monitor infrastructure performance (CPU, memory, storage, latency) and escalate issues when thresholds are exceeded.
- Assist with OS installations, system builds, and server provisioning activities.
- Support license management tracking and validation for infrastructure platforms and software.
- Execute standard migration activities (VM migrations, storage moves, cluster workload balancing) following runbooks.
- Coordinate with vendors for support cases, troubleshooting, and hardware/software issue resolution.
- Provide support for observability and monitoring platforms, ensuring accurate alerting and visibility into system health.
- Perform remote hands activities, including coordination with data center teams for hardware tasks and troubleshooting.
- Support infrastructure services including DNS, VLANs, VIPs, and load balancing integrations required by application workloads.
- Follow operational runbooks for incident response, change implementation, and maintenance activities.
- Troubleshoot L2-level infrastructure issues and escalate complex problems with clear diagnostics.
- Maintain system documentation, runbooks, and inventory records.
- Participate in 24x7 on-call rotation, providing after-hours support for infrastructure incidents.
- Perform other duties as assigned.
Skills and Competencies:- Foundational knowledge of hyperconverged infrastructure (HCI) concepts (compute + storage + networking stack).
- Working knowledge of VMware virtualization platforms (ESXi, vCenter).
- Basic understanding of server hardware, compute nodes, and storage integration within HCI environments.
- Familiarity with Layer 2 networking concepts, including VLANs and basic switching.
- Exposure to infrastructure services such as DNS, VIPs, and load balancing.
- Understanding of infrastructure lifecycle management (patching, upgrades, firmware updates).
- Basic understanding of infrastructure monitoring and observability tools.
- Exposure to automation and scripting (PowerShell, Python, or similar) is a plus.
- Strong troubleshooting and problem-solving skills in a production support environment.
- Ability to follow standard operating procedures and execute tasks with discipline.
- Strong attention to detail and ownership in execution of operational tasks.
- Ability to collaborate across infrastructure, network, and application teams.
- Willingness to learn and grow into deeper infrastructure engineering responsibilities.
- Dell VxRails, or HPe dHCI experience.
Minimum Qualifications:- Bachelor's degree in Computer Science, Information Technology, or equivalent experience.
- 2 to 5+ years of experience in infrastructure operations, system administration, or data center support roles.
- Hands-on experience with virtualization platforms (VMware preferred).
- Exposure to enterprise infrastructure environments (compute, storage, or networking).
Preferred Qualifications:- Exposure to hyperconverged infrastructure platforms (Nutanix, VMware vSAN, or similar).
- Experience with server provisioning, OS installation, and lifecycle management.
- Familiarity with VLANs, DNS, and basic networking concepts.
- Exposure to monitoring, observability, or infrastructure tools.
- Basic automation or scripting experience.
- Familiarity with ITIL processes (incident, change, problem management).
Required Licensing or Certifications:- HPe or Dell/EMC certifications
- Cloud certifications
- HCI certifications
Working Conditions:- May require after-hours, weekend, or off-shift work during incidents, maintenance windows, or critical business events.
- Participation in a rotating 24x7 on-call support model is required.
- If applicable, list the travel requirements for the job.
- Hybrid work 4:1, 1 day WFH.
Travel:Light travel, 10%