AI Foundation Model Engineer

Compunnel

$130K — $180K *
Finance & Insurance
5 - 7 years of experience
Job Overview by Ladders

Qualifications

  • 7+ years in AI/ML engineering or related fields
  • Hands-on experience with LLMs and transformative AI technologies
  • Strong Python skills with relevant frameworks like PyTorch and TensorFlow
  • Experience deploying AI services using cloud-native solutions and CI/CD processes
  • Practical knowledge of model evaluation and inference optimization

Responsibilities

  • Design and implement LLM-powered applications for diverse business needs
  • Build RAG pipelines leveraging advanced data processing techniques
  • Adapt models using cutting-edge methods for performance improvement
  • Develop scalable APIs and microservices for cloud or containerized environments
  • Optimize inference workloads for enhanced performance and user experience
  • Implement observability measures for AI applications to ensure quality
  • Embed security and compliance best practices in AI application design

Benefits

  • Work in a dynamic, innovative field focusing on advanced AI technologies
  • Opportunity to contribute to high-stakes projects in the regulated financial sector
  • Engage in a collaborative environment with a focus on professional growth
  • Chance to work with cutting-edge tools and frameworks
  • Involvement in shaping the future of responsible AI and model governance
Full Job Description
JOB SUMMARY
Design, build, deploy, and optimize enterprise-grade AI systems powered by foundation models, LLMs, retrieval-augmented generation, and agentic AI workflows. The role converts AI concepts into secure, scalable, observable, and supportable production systems suitable for a regulated financial-services environment.

Key Responsibilities
Design and implement LLM-powered applications such as knowledge assistants, document intelligence solutions, workflow agents, summarization tools, and decision-support systems.
Build RAG pipelines using embeddings, chunking strategies, vector databases, semantic retrieval, reranking, response grounding, and citation patterns.
Adapt and optimize models using LoRA, PEFT, instruction tuning, distillation, transfer learning, quantization, and domain adaptation techniques.
Develop scalable APIs, microservices, model-serving components, and integration patterns across cloud, hybrid, or containerized environments.
Optimize inference workloads for latency, throughput, token efficiency, cost, reliability, and user experience.
Implement model and application observability, including prompt logs, retrieval quality, hallucination indicators, drift signals, feedback loops, cost telemetry, and service health.
Embed security, privacy, Responsible AI, and model risk controls into AI application design and delivery.
Create production documentation, runbooks, release notes, test evidence, and audit-ready implementation records.

Required Qualifications
7+ years in AI/ML engineering, platform engineering, software engineering, or applied machine learning.
Hands-on experience with LLMs, transformers, embeddings, RAG, semantic search, and GenAI application patterns.
Strong Python engineering skills with PyTorch, TensorFlow, Hugging Face, LangChain, LlamaIndex, Semantic Kernel, or equivalent frameworks.
Experience deploying production AI services using APIs, containers, Kubernetes, CI/CD, cloud-native services, and monitoring platforms.
Practical knowledge of model evaluation, fine-tuning, inference optimization, and secure data handling.

Preferred Qualifications
Banking, risk, compliance, financial crime, operations, or enterprise technology background.
Experience with Azure OpenAI, AWS Bedrock, Vertex AI, Databricks, vLLM, Triton, MLflow, Kubeflow, or model gateways.
Exposure to model risk, AI governance, audit controls, AI cost governance, and private or open-source LLM deployments.

Similar Jobs

More Jobs at Compunnel

More Finance & Insurance Jobs

Find similar AI Foundation Model Engineer jobs: