Infosys

On-Prem LLM Platform Engineer (OpenShift AI / GPU)

Infosys$90K — $130K *
Information Technology
Less than 5 years of experience
Job Overview by Ladders

Qualifications

  • Bachelor's degree or foreign equivalent in a relevant field.
  • 5-7 years of hands-on experience with OpenShift (OCP) and OpenShift AI.
  • Deep understanding of Kubernetes operations and model serving.
  • Expertise in GPU orchestration and performance optimization techniques.
  • Experience with NVIDIA H200 GPUs and inference optimization techniques.

Responsibilities

  • Participate in data extraction and transformation for model readiness.
  • Ensure data quality for model development and testing.
  • Develop models using statistical or machine learning techniques.
  • Test and validate models, selecting the best algorithms based on metrics.
  • Collaborate with teams to translate business requirements into analytics solutions.
  • Document model development and deployment processes for reproducibility.
  • Deploy analytics tools in both test and production environments.

Benefits

  • Comprehensive Medical/Dental/Vision/Life Insurance coverage.
  • Long-term and short-term disability benefits.
  • Health and Dependent Care Reimbursement Accounts available.
  • Additional insurance options including Critical Illness and Accident coverage.
  • 401(k) plan with contributions based on salary level.
  • Generous paid holidays and paid time off.
Full Job Description
Job details

Job Role

Data Science Consultant 1

Career Role

Analyst - Data Science

Work Location

Charlotte, NC

State / Region / Province

North Carolina

Country

USA

Domain

Delivery

Interest Group

Infosys Limited

Company

ITL USA

Requisition ID

148550BR

Technical Skills 1

Technology|Generative AI|Generative AI for Data Analytics

Technical Skills 2

Technology|Analytics - Packages|Python - Big Data

Technical Skills 3

Technology|Agentic AI|Agent Engineering

In the assigned Job Role of Data Science Consultant 1, your Area Of Responsibility will be as below:
• Participate in data extraction, transformation, and preparation.
• Resolve common data issues and ensure quality for model development.
• Participate in developing models using statistical or machine learning techniques and collaborate with technology teams to operationalize them into analytics tools or scripts.
• Participate in model testing and validation, selecting the best-performing algorithms based on statistical and business metrics.
• Participate in the development of advanced analytics and machine learning or deep learning models including LLMs using predefined processes and tools like SAS and R/ Python.
• Participate in defining analytics problems; execute visualization, analysis, and predictive modeling with senior support.
• Identify data sources and extract from RDBMS and develop UI/UX for client usage.
• Participate in model performance, while making minor adjustments, and escalate risks or compliance concerns and generate reports on deviations or schedule slippages.
• Proactively participate in detailed documentation of model development, testing, and deployment activities for reproducibility.
• Work closely with business and technology teams to translate requirements into actionable models, while communicating results effectively.
• Apply predefined quality measurement frameworks, if any, to individual project tasks.
• Participate in deploying analytics tools in test and production environment, while ensuring they meet operational requirements.

Your contribution to the team:
• Strong analytical and problem-solving mindset with hands-on model development skills.
• Ability to translate business needs into actionable analytics solutions.
• Focus on data quality, validation and performance optimization.
• Effective collaboration with business and technology stakeholders.
• Commitment to continuous learning, knowledge sharing, fostering team development.

Required Skill and Experience

Strong hands-on experience with OpenShift (OCP) and OpenShift AI

Deep understanding of Kubernetes cluster operations and model serving

Experience with vLLM, Triton, TensorRT-LLM, or SGLang
Expertise in inference optimization techniques including:

Proficiency in model quantization techniques (FP8, AWQ, GPTQ) and performance tuning

Experience with GPU orchestration and scheduling
Working knowledge of CUDA, NCCL, and MIG concepts
Exposure to NVIDIA H200 GPUs (highly preferred)

Preferred Skill and Experience

Exposure to NVIDIA H200 GPUs (highly preferred)
• Deploy and run LLMs on-prem using:
o OpenShift (OCP), OpenShift AI, Kubernetes ML serving patterns
o Model serving stacks: vLLM, Triton, TensorRT-LLM, SGLang
• "Unfold an LLM" end-to-end:
o Package model artifacts, containerize, configure runtime, serve endpoints
o Implement authn/authz, secrets, networking, and endpoint routing patterns
• GPU-first platform engineering:
o GPU orchestration, scheduling, resource quotas
o Performance tuning for throughput/latency, memory utilization
o Awareness of CUDA/NCCL, tensor parallelism, MIG concepts

Additional Required Qualifications
• Bachelor's degree or foreign equivalent required from an accredited institution. Will also consider three years of progressive experience in the specialty in lieu of every year of education.
• This position may require relocation and/or travel to work/project location.
• Candidates authorized to work for any employer in the United States without employer-based visa sponsorship are welcome to apply. Infosys is unable to provide immigration sponsorship for this role now or in the future.

Benefits

Along with competitive pay, as a full-time Infosys employee you are also eligible for the following benefits:
  • Medical/Dental/Vision/Life Insurance
  • Long-term/Short-term Disability
  • Health and Dependent Care Reimbursement Accounts
  • Insurance (Accident, Critical Illness , Hospital Indemnity, Legal)
  • 401(k) plan and contributions dependent on salary level
  • Paid holidays plus Paid Time Off

About Infosys

Infosys Limited is an Indian multinational corporation that provides business consulting, information technology and outsourcing services. It has its headquarters in Bangalore, Karnataka, India. Infosys is the second-largest Indian IT company after Tata Consultancy Services by 2017 revenue figures and the 596th largest public company in the world based on revenue. On 31 March 2018, its market capitalisation was $37.32 billion. The credit rating of the company is A? (rating by Standard & Poor's).
Learn more about Infosys
Size
314,015 employees
Market Cap
$77.5 billion
Industry
Net Income
$178.5 billion
Founded
2004
5 Year Trend
+12.2%
Revenue
$945.9 billion
NASDAQ

Similar Jobs

More Jobs at Infosys

More Information Technology Jobs

Find similar On-Prem LLM Platform Engineer (OpenShift AI / GPU) jobs: