IT - Senior Technology Architect

Siri InfoSolutions Inc

$129K *
Information Technology
Less than 5 years of experience
Job Overview by Ladders

Qualifications

  • Bachelor's degree in Computer Science or related field.
  • 4+ years of experience in AI model fine-tuning and optimization.
  • Proficient in Generative AI technologies, particularly LangGraph and Hugging Face.
  • Experience with PyTorch, LangChain, and BentoML for developing AI solutions.
  • Solid understanding of retrieval-augmented generation and semantic search techniques.
  • Ability to work with large-scale structured and unstructured datasets.

Responsibilities

  • Collaborate with Data Science and Quantitative Analytics teams to enhance model performance.
  • Build proof-of-concept AI solutions tailored to banking workflows.
  • Implement semantic search with vector databases for context improvement.
  • Conduct experiments to optimize model generation and retrieval accuracy.
  • Evaluate the performance of language models and rerankers in high-volume environments.
  • Design and integrate safety measures for compliant model behavior.
  • Engineer embedding strategies to enhance relevance in responses.

Benefits

  • Comprehensive health, dental, and vision insurance options.
  • Retirement savings plan with employer matching.
  • Generous paid time off policies including holidays.
  • Employee training and development opportunities.
  • Flexible work arrangements and remote work options.
Full Job Description
Charlotte, North Carolina

May 26, 2026

Openings - 1

Collaborate with Data Science and Quantitative Analytics teams at Wells Fargo on model fine-tuning, retrieval optimization, model guidance, and inference tuning to improve response quality and consistency. Build proof-of-concept Generative AI solutions using LangGraph, Hugging Face Transformers and Tokenizers, PyTorch, LangChain, and BentoML to prototype domain-specific banking AI workflows, evaluate model behaviour, and support model serving readiness. Develop and optimize Retrieval-Augmented Generation workflows by implementing semantic search with vector databases and designing customized reranking approaches to improve context relevance and answer quality. Conduct model experimentation, evaluation, and optimization to improve generation quality, retrieval accuracy, inference consistency, and domain-specific response performance. Evaluate LLM and reranker performance across high-volume inference workflows using model quality checks, retrieval metrics, and output consistency analysis to ensure precise and stable responses.. Design and integrate prompt-level guardrails using Model Armor, Prompt Guard, and LLaMA Guard 3 to support safe, compliant, and reliable model behaviour in enterprise banking use cases. Engineer embedding strategies and optimize semantic similarity retrieval mechanisms to improve contextual relevance across large-scale structured and unstructured banking datasets.

Job Overview

Collaborate with Data Science and Quantitative Analytics teams at Wells Fargo on model fine-tuning, retrieval optimization, model guidance, and inference tuning to improve response quality and consistency. Build proof-of-concept Generative AI solutions using LangGraph, Hugging Face Transformers and Tokenizers, PyTorch, LangChain, and BentoML to prototype domain-specific banking AI workflows, evaluate model behaviour, and support model serving readiness. Develop and optimize Retrieval-Augmented Generation workflows by implementing semantic search with vector databases and designing customized reranking approaches to improve context relevance and answer quality. Conduct model experimentation, evaluation, and optimization to improve generation quality, retrieval accuracy, inference consistency, and domain-specific response performance. Evaluate LLM and reranker performance across high-volume inference workflows using model quality checks, retrieval metrics, and output consistency analysis to ensure precise and stable responses.. Design and integrate prompt-level guardrails using Model Armor, Prompt Guard, and LLaMA Guard 3 to support safe, compliant, and reliable model behaviour in enterprise banking use cases. Engineer embedding strategies and optimize semantic similarity retrieval mechanisms to improve contextual relevance across large-scale structured and unstructured banking datasets. $129,854.00 /year

Education and Experience Requirements

Bachelor's degree in Computer Science or related with 4 years of experience.

Similar Jobs

More Jobs at Siri InfoSolutions Inc

More Information Technology Jobs

Find similar IT - Senior Technology Architect jobs: