Deloitte

HPC AI Solution Architect (S2S)

Deloitte$141K — $278K *
Information Technology
8 - 10 years of experience
Job Overview by Ladders

Qualifications

  • 10+ years in infrastructure architecture or engineering for large-scale platforms
  • 4+ years with GPU-accelerated platforms for AI or ML
  • 3+ years of Linux system administration in production
  • 3+ years designing distributed compute clusters for AI in hybrid cloud
  • 2+ years with high-performance networking or storage for AI/HPC
  • 2+ years building containerized platforms using Kubernetes
  • 2+ years automating infrastructure as code with Terraform or Ansible
  • Experience in pre-sales or sales engineering for technology solutions

Responsibilities

  • Lead architecture for pursuits, including requirements and design
  • Define reference architectures for GPU platforms across environments
  • Drive architecture trade-offs on performance and risk
  • Own technical solution strategies in proposals and RFPs
  • Facilitate client workshops and technical reviews
  • Architect innovative technology solutions focused on business outcomes
  • Engage with C-Suite client leadership during sales discussions
  • Support go-to-market strategies, including industry events

Benefits

  • Broad range of employee benefits to support well-being
  • Opportunities for professional growth and development
  • Participation in discretionary annual incentive program
  • Exposure to cutting-edge AI technologies
  • Collaborative and innovative work environment
Full Job Description
Lead Cloud Integrated Infra Engineer- AI Infrastructure(S2S)

As a Lead Cloud Integrated Infra Engineer on the Silicon2Service team in Deloitte's AI & Engineering practice, you will design and drive deployment of fully integrated architectures for GPU-accelerated AI factories and high-performance computing infrastructure in close partnership with Deloitte AI specialists and our ecosystem partners. You will shape end-to-end solutions-from discovery and reference architecture mapping through sizing and implementation. You will partner with Sales Executives, AI application specialists, delivery engineering, and managed services to help clients achieve measurable outcomes from private AI assets. You will lead technical solution strategy for pursuits and active opportunities and translate complex client needs into clear, complete solutions and delivery requirements.

Recruiting for this role ends on 6/26/2026.

Work you'll do
As a Lead Cloud Integrated Infra Engineer on the Silicon2Service team, you will be responsible for:
  • Leading architecture for pursuits and active opportunities, including discovery, requirements, constraints, and target-state design
  • Creatively defining reference architectures for on-premises, cloud, and hybrid GPU platforms across compute, network, storage, security, software and operations
  • Driving architecture trade-offs and decisions across performance, scalability, reliability, locality, total cost of ownership, time-to-value, and risk
  • Owning the technical solution strategy in proposals and RFPs, including architecture narrative, assumptions, dependencies, sizing guidance, and delivery approach
  • Facilitating client workshops and technical reviews and translating engineering detail into executive-ready communications
  • Architecting complex, innovative technology solutions with a focus on business outcomes, cost of quality, and long-term scalability and sustainability.
  • Engaging with C-Suite client leadership during sales and delivery, including leading technical pre-sales discussions, shaping proposals, and supporting the closing of new business opportunities
  • Supporting go-to-market strategies, including participation in industry events, conferences, and client briefings
The Team

The Silicon to Service team at Deloitte delivers end-to-end AI factories and advanced technology services that help organizations build, deploy, and operate large-scale, private AI and data platforms. We enable the next phase of enterprise AI adoption through private AI economics with cloud-like ese of use. Join this unique opportunity to work on innovative AI platforms and emerging technologies in the rapidly evolving AI market while solving complex enterprise problems for some of the world's largest organizations.

Qualifications

Required:
  • 10+ years of experience in infrastructure architecture or engineering for large-scale platforms including design, implementation, operations, and optimization.
  • 4+ years designing or delivering GPU-accelerated platforms for AI, ML, or high-performance computing
  • 3+ years Linux system administration in production environments
  • 3+ years designing or operating distributed compute clusters for AI/HPC in hybrid cloud setups, including multi-GPU topologies, partitioning, scheduler integration, and scalability for edge-to-cloud workloads.
  • 2+ years with high-performance networking or storage for AI/HPC
  • 2+ years building containerized platforms using Kubernetes or Red Hat OpenShift, including GPU operators/drivers, CUDA container runtime, and cluster lifecycle automation
  • 2+ years automating infrastructure as code(IaC) with tools like Terraform and Ansible
  • At least 2 end-to-end deployments of reference architectures in the cloud or on-prem, including variants with security controls, network segmentation, operational runbooks, and validation testing
  • Experience in pre-sales or sales engineering, including discovery, solution demonstrations, and proposal/RFP contributions
  • Ability to travel 50%, on average, based on the work you do and the clients and industries/sectors you serve.
  • Limited immigration sponsorship may be available.
Preferred:
  • 2+ years implementing AI/HPC cluster scheduling (Slurm and Kubernetes), including multi-tenant queues, quotas, and GPU-aware policies
  • 2+ years supporting generative AI infrastructure patterns, including multi-node distributed training
  • Experience with AI agents and frameworks
  • Experience with high-throughput storage for AI/HPC
  • Experience executing NVIDIA co-sell motions with OEMS (Dell, HPC, Lenovo), CSPs ( AWS, Azure, Google Cloud), or independent software vendors ( Run:ai, OpenShift, Weights & Biases)
The wage range for this role takes into account the wide range of factors that are considered in making compensation decisions including but not limited to skill sets; experience and training; licensure and certifications; and other business and organizational needs. The disclosed range estimate has not been adjusted for the applicable geographic differential associated with the location at which the position may be filled. At Deloitte, it is not typical for an individual to be hired at or near the top of the range for their role and compensation decisions are dependent on the facts and circumstances of each case. A reasonable estimate of the current range is $141,200 to $278,300.

You may also be eligible to participate in a discretionary annual incentive program, subject to the rules governing the program, whereby an award, if any, depends on various factors, including, without limitation, individual and organizational performance.

Benefits

At Deloitte, we know that great people make a great organization. We value our people and offer employees a broad range of benefits. Learn more about what working at Deloitte can mean for you.

Requisition code: 350635

Job ID 350635

About Deloitte

Deloitte is a multinational professional services network that provides audit, tax, consulting, enterprise risk and financial advisory services. The company was founded in London in 1845 and has since grown to become one of the largest professional services firms in the world. Deloitte has over 330,000 employees in more than 150 countries and territories. The company's mission is to help clients achieve their goals and make an impact that matters in their businesses and communities.
Learn more about Deloitte
Size
330,000 employees
Industry
Founded
1999

Similar Jobs

More Jobs at Deloitte

More Information Technology Jobs

Find similar HPC AI Solution Architect (S2S) jobs: