GenAI Platform / LLM Inference Optimization Engineer (Cloud)

Infosys Limited Digital$100K — $130K *
Enterprise Technology
Less than 5 years of experience
Job Overview by Ladders

Qualifications

  • Bachelor's degree or equivalent experience in a relevant field.
  • 5-7 years of hands-on experience in data science, machine learning, or related areas.
  • Strong command of tools such as vLLM, TensorRT-LLM, and Triton.
  • Experience with cloud platforms like GCP and Azure, along with Terraform.
  • Knowledge of MLOps practices and AI/ML deployment techniques.

Responsibilities

  • Develop tasks for data preparation and identify patterns or anomalies.
  • Ensure data is ready for advanced modeling applications.
  • Create and refine complex models for various use cases, ensuring scalable deployment.
  • Test and optimize algorithms for performance and scalability while mentoring team members.
  • Design predictive models to solve business challenges and ensure effective knowledge management.

Benefits

  • Medical, dental, and vision insurance coverage.
  • Long-term and short-term disability insurance options.
  • Health and dependent care reimbursement accounts available.
  • 401(k) plan with employer contributions based on salary level.
  • Generous paid holidays and paid time off.
Full Job Description
Job Description

In the assigned Job Role of Data Science Consultant 2, your Area Of Responsibility will be as below:
• Develop data preparation tasks, while identifying patterns or anomalies.
• Ensure data readiness for advanced modeling.
• Develop models for complex use cases (e.g., forecasting models, LLM-based solutions), while refining algorithms to meet business needs, and ensure smooth deployment into scalable, production-ready solutions.
• Conduct testing and optimize algorithms for performance, reliability, and scalability, while providing guidance to team members in best practices.
• Design and develop predictive models and data-driven analyses to address business challenges.
• Build, evaluate, and deploy models, standardize code, and contribute to knowledge management.
• Leverage tools like SAS and R/Python to create reusable customizations for non-ML, ML, and deep learning algorithms, while enhancing analytics including LLMs, and create innovative, cost-effective solutions.
• Define analytics problems for projects; execute visualization, analysis, and predictive modeling under guidance.
• Proactively maintain models and implement improvements for accuracy and reliability.
• Apply governance controls to mitigate risks and ensure compliance.
• Analyze performance trends, recommend improvements, and document discrepancies for escalation.
• Maintain comprehensive documentation standards, while participating in knowledge transfer sessions.
• Participate in discussions with stakeholders to refine requirements, provide insights, and guide implementation of models.
• Apply the predefined quality measurement framework at an individual task level in the project.
• Deploy complex analytics tools or multi-system integration, while validating deployment success.
• Participate in developing scripts or templates for repeated deployments tasks.
• Contribute to analytic solutions, IP asset creation, and training initiatives.
• Contribute to thought leadership such as papers, innovative non-ML, ML, deep learning or LLM models, and proofs of concepts.
• Participate in and deliver analytics training, while contributing to content creation.
• Provide input for segment and unit-level business plans.

Your contribution to the team:
• Deliver scalable, high-quality analytics solutions aligned to business needs.
• A knack for optimization, deployment and performance improvement of models.
• The ability to drive innovation through advanced analytics, automation and thought leadership.
• Enable team growth through knowledge sharing, training and standardization.
• Support business planning with data-driven insights.

Benefits

Along with competitive pay, as a full-time Infosys employee you are also eligible for the following benefits:
  • Medical/Dental/Vision/Life Insurance
  • Long-term/Short-term Disability
  • Health and Dependent Care Reimbursement Accounts
  • Insurance (Accident, Critical Illness , Hospital Indemnity, Legal)
  • 401(k) plan and contributions dependent on salary level
  • Paid holidays plus Paid Time Off

Required Skill and Experience
• vLLM, TensorRT-LLM, Triton, SGLang
• Quantization (FP8/AWQ/GPTQ), tensor parallelism
• Performance benchmarking & tuning
• Kubernetes, GKE, KServe / ML serving patterns
• Helm, Operators
• GPU orchestration concepts and scheduling patterns
• GCP and/or Azure (strong hands-on)
• Terraform
• Cloud networking, landing zones, governance/org policies
• HashiCorp Vault (secrets management)
Observability & SRE
• Prometheus/Grafana, logging, tracing
• SRE/SLO mindset, reliability engineering

Preferred Skill and Experience

Experience in Big Data technologies (e.g., BigQuery, Hadoop).
Expertise in ML model development, data engineering, and software engineering principles.
Knowledge of MLOps and AI/ML deployment (e.g., SageMaker, Snowflake). Familiarity with CI/CD, DevOps, and automation tools in AI/ML contexts.
• Design and implement LLM inference serving stacks using:
o vLLM, TensorRT-LLM, Triton Inference Server, SGLang
o Inference optimization techniques: continuous batching, speculative decoding, KV/prefix caching
o Quantization: FP8 / AWQ / GPTQ and tuning for GPU utilization
• Build Kubernetes-based serving platforms:
o KServe, Kubernetes ML Serving, GKE, OpenShift (OCP) (where applicable)
• Enable GenAI platforms and RAG use cases:
o Integrate LLM services with RAG pipelines
o Provide reusable internal libraries, templates, and developer enablement assets
• Collaborate with cross-functional teams and client stakeholders to productionize LLM workloads at scale

Additional Required Qualifications
• Bachelor's degree or foreign equivalent required from an accredited institution. Will also consider three years of progressive experience in the specialty in lieu of every year of education.
• This position may require relocation and/or travel to work/project location.
• Candidates authorized to work for any employer in the United States without employer-based visa sponsorship are welcome to apply. Infosys is unable to provide immigration sponsorship for this role now or in the future.

About Infosys Limited Digital

Infosys Limited Digital Careers

Joining Infosys Limited Digital offers a unique opportunity to be part of a global team of professionals who are at the forefront of digital innovation and transformation. Infosys Limited Digital is renowned for its commitment to leadership in technology services and consulting, making it an ideal place for ambitious individuals seeking growth and career advancement.

Explore Job Opportunities

Infosys Limited Digital is actively seeking talented individuals to join their team. With a variety of job opportunities available, candidates can find positions that match their skills and interests. The company values diversity and inclusivity, ensuring that all team members have the opportunity to contribute to their fullest potential.

Experience Professional Growth

At Infosys Limited Digital, professional growth is a priority. The company offers comprehensive benefits and diversity training programs that support both personal and professional development. Employees are encouraged to take leadership roles within projects, fostering a culture of innovation and continuous improvement.

Engage in Meaningful Work

Employees at Infosys Limited Digital engage in projects that transform businesses and industries worldwide. By leveraging cutting-edge technology and deep industry expertise, the team delivers solutions that meet the complex demands of today's digital landscape.

Internship Programs

For those starting their careers, Infosys Limited Digital’s internship programs provide a robust platform for learning and development. Interns gain hands-on experience, working alongside seasoned professionals on real-world projects. This not only enhances their skills but also prepares them for future employment opportunities within the company.

Join a Collaborative Team

Infosys Limited Digital prides itself on its collaborative culture. Team members from around the globe come together to share ideas and strategies, ensuring the best outcomes for clients and stakeholders. Networking within the company is highly encouraged, opening doors to myriad opportunities for career advancement.

Prepare for Your Interview

When applying for a position at Infosys Limited Digital, it is crucial to tailor your resume to highlight relevant experience and skills. Preparation for the interview process is key, as it is a chance to demonstrate how your background aligns with the company’s needs and values.

Stay Connected with Career Insights

Keep up to date with the latest in career opportunities and industry trends by subscribing to Infosys Limited Digital’s career blog. Gain insights from insiders and stay informed about new openings and company news.

Apply Now

Discover the exciting and rewarding career opportunities at Infosys Limited Digital. Search for open positions that align with your professional skills and interests. Infosys Limited Digital is looking for passionate, curious, and innovative team players ready to make a significant impact in the digital world.

SEARCH INFOSYS LIMITED DIGITAL JOBS

READ CAREERS BLOG

Job Alert Emails

Customize your subscription to receive job alerts and insider tips tailored to your preferences. Explore the diverse and dynamic career paths available at Infosys Limited Digital.
Learn more about Infosys Limited Digital

Similar Jobs

More Jobs at Infosys Limited Digital

More Enterprise Technology Jobs

Find similar GenAI Platform / LLM Inference Optimization Engineer (Cloud) jobs: