Technical Lead - Server and Compute Platform Engineer

Hewlett Packard Enterprise Company   •  

Austin, TX

Industry: Technology

  •  

11 - 15 years

Posted 177 days ago

This job is no longer available.

The Technical Lead–Server and Compute Platform Engineer position in the HPE Global IT organization is being created with overall responsibility for designing and solutioning global infrastructure compute platforms to meet the evolving business needs. As the technical lead for this group, you will build upon and showcase our state of the art technology, and craft the processes needed by HPE to operate, innovate and grow. You will play a key role in solutioning and deploying HPE’s Global IT infrastructure technology roadmap and the ongoing lifecycle support.

This position reports to the Senior Director of Infrastructure services and is both a lead engineer and a technical architect position with the requirement to fully understand and have sufficient background in server technologies, compute, enterprise architecture, and operations to lead a team of employee and contractor subject matter experts.

Duties and Responsibilities:

  • Participates as a member of and leads cross-functional engineering teams. Performs analysis of cross-functional and complex business requirements. Designs complex cross-functional solutions for others to build. Provides mentoring and guidance to other engineers.
  • A preponderance of time is spent in strategic and creative problem solving. Demonstrates broad technical leadership, impacting significant technical direction; exerts influence outside of immediate team and drives change.
  • Applies in-depth or broad technical knowledge to manage global maintenance services across various technology areas (e.g. Compute High-Availability Administration) or functions. Performs solution design.
  • Applies the company and 3rd party technologies and leads design of highly complex infrastructure and software solutions, while driving innovation.
  • Independently implements end- user or enterprise infrastructure or services of significant complexity.
  • Integrates technical expertise and business understanding to create superior solutions for the company and customers. Consults with team members and other organizations, customer and vendors on complex issues. Mentors others in the technology community; may publish or otherwise engage professionally outside of the company.
  • Lead the design and solutioning of production, staging, QA and development IT Infrastructures running in 24x7 environments
  • Support the enterprise architects by contributing subject matter expertise to architectural strategies, roadmaps with latest industry trends, technology standardization, modernization and virtualization
  • Acts as the primary interface with the business and application teams to consult and deploy infrastructure solutions and services to meet the needs and requirements  
  • Analyze business and technical requirements to determine detailed system design, potential issues, and related cost for each project request
  • Develop implementation and migration strategies that preserve the availability, performance, integrity, stability, and scalability of systems, based on guidance from the enterprise architect
  • Lead the successful delivery of complex IT Infrastructure projects from planning, detailed design to implementation
  • Design high-availability systems through high-availability zones or site-to-site HA and DR solutions for application hosting
  • Design disaster recovery compute solutions with optimized automated failover to meet business continuity targets
  • Develop and maintain detailed design documentation for deployed IT infrastructure
  • Lead the solutioning of complex, cross-functional issues that cross technologies in server, storage, O/S, virtualization, operations, and security
  • Implement and deliver high-availability compute platforms to support“container-as-a-service” (CaaS), “infrastructure-as-a-service” (IaaS) and “platform-as-a-service” (PaaS).
  • Develop and implement compute and infrastructure services, including management, monitoring, automation, backup and tooling
  • Design and implement new server and compute services, as well as providing capacity planning and management of the existing infrastructure.
  • Provide 24x7 technical support of global server, cloud, and management components in both a proactive and reactive manner, ensuring systems are stable and performing within the terms of their respective Service Level Agreements.
  • Support the build and deployment and ongoing support of Active Directory, enterprise directory, windows servers, Linux, UNIX, DDI (DNS, DHCP, IPAM) and service proxy components.
  • Management of technical team and vendors for Infrastructure services on premise and in the cloud.
  • Design and implement near and long-term strategy ensuring platform capacity and performance meets existing and future requirements.
  • Guide and provide work direction to a team of employee and contract operations and project staff IT subject matter experts.
  • Provide technical expertise for in the areas of servers and compute, and serve as an escalation resource for all technical, delivery and execution needs.
  • Coordinate and execute upon approval adjustments and changes that increase performance and availability.
  • Provide platform design, definition and coordination of standards, project management, and lead technology research for infrastructure and cloud related platforms.
  • Assist with vendor supported projects and ongoing support.
  • Ongoing review of capacity utilization to align with budget planning.
  • Maintain up-to-date system and platform specific documentation and inventories across the enterprise.
  • Assist with ensuring availability and optimal performance of services through continuous assessment operations, including incident, change and problem management.
  • Assess technical risk and technical debt and provide timely mitigation and remediation plans.
  • Assist with evaluation of new infrastructure technology and tools and build business cases for their use.
  • Oversees technical design documentation process for correctness and timeliness.
  • Participate in cross functional project reviews and Identify high risk areas.
  • Contribute to and maintain system standards including Change Management.
  • Research and recommend possible automated approaches for administration tasks.
  • Ensure daily system monitoring, verify integrity and availability of all compute services hardware, resources, systems and key processes, and review system and application logs.
  • Assist infrastructure council to design, document and publish standards, policies, and guidelines for HW, application hosting, disaster recovery design and testing

Requirements:

  • Technical Bachelor’s degree or equivalent experience and a minimum of 10 years of related experience or a Master’s degree and a minimum of 8 years of experience.
  • 10 years of experience with broad and deep technical knowledge of compute services and server platform design and implementation, and ongoing operations for global scale organizations.
  • 5 years of management experience for outsource vendors in infrastructure or closely related IT scope for project and steady state services.
  • Experience implementing multi-tier application hosting, automated provisioning on premise and in cloud
  • Experience in best practices of SDLC methodologies like Agile, Scrum, waterfall and Devops/Cloud processes
  • Knowledge of Build/Release/Deployment/Operations (DevOps) engineering
  • Strong understanding across Cloud and compute infrastructure components (server, storage, network, data, and applications) to deliver compute platforms and IaaS
  • Experience with implementation of Hypervisor technologies (VMWare and Openstack) and OS administration
  • Experience in Data Center Infrastructure design – a thorough understanding of physical/virtual layout and relationships between server, network & SAN infrastructure, power requirements, and server racking.
  • Experience with blade systems, hyper converged systems, mid-range and high-end HPE compute platforms.
  • Experience with out of band management, jump stations and Console models – this includes iLO access and configurations for servers
  • Experience with OneView setup – this includes both Synergy integrated OV as well as standalone OV systems that manage other systems.
  • Experience with Image Streamer setup – Basic setup to get I3S system online and ready for systems to be imaged.
  • Experience with Server LAN interface configuration for Linux and Windows systems.
  • Project management skills – working with team management and members to ensure timelines and project goals are achieved. Utilize a high level of critical thinking as well as problem solving to address any and all problems during build phase
  • Very capable of developing documented target infrastructure designs, standards and guiding principles
  • Experience with OS installation & patching, firmware/driver updates and building strategies to mitigate vulnerabilities
  • Experience with building hardware bill of material creation
  • Hands-on and team management experience with Active Directory, windows and Linux OS’s, certificate and related components.
  • Experience with storage arrays, 3PAR, Nimble, and LUNS deployment and configuration 
  • A self-starter with a strong interest in technology and its practical application along with the ability to deal with a fast-paced environment and ambiguity.
  • Strong understanding of technical troubleshooting methodology.
  • Strong understanding of the use and performance management.
  • Ability to facilitate problem solving among administrative groups with varying needs and priorities, and to communicate well with administrative users, technical staff, and senior management.
  • Strong team building, leadership, coaching and mentoring skills.
  • Excellent oral, written, and interpersonal communication and presentation skills across organizational boundaries.
  • Considerable experience with multi-site, global datacenter rollouts, transformations and migrations.
  • Strong customer service skills.
  • Ability to work with a range of technical staff to develop joint solutions in project and ongoing support situations.
  • Ability to achieve efficiencies through workflow improvement.
  • Preferable: Knowledge and experience with Azure, AWS, and cloud management, monitoring, security.

Job ID 1027899