Role: Principal Architect - Gen AI/Practice Lead - Gen AI
Experience Level: 10+ years
Employment type: Full Time
Location: Remote
(Canada/USA)
What you will do:As Principal Architect : Gen AI/ Practice Lead - Gen AI at Quantiphi, you will be responsible for designing and developing advanced machine learning models and algorithms to solve complex business problems. You will work on optimizing and deploying these models on AWS infrastructure, ensuring scalability and reliability.
Basic Qualifications (BQ):- 10+ years of relevant hands-on technical experience implementing, and developing cloud ML solutions on AWS.
- Hands-on experience on AWS services. Proven experience using AWS Sagemaker and Bedrock leveraging different types of data sources, Training jobs, real-time and batch applications.
- Design and implement agentic AI architectures using frameworks such as LangChain, Strand Agents etc., enabling autonomous task planning, decision-making, and multi-step reasoning.
- Hands-on experience with Amazon AgentCore for building, deploying, and scaling production-grade agentic AI applications, including agent memory management, tool registry, and observability.
- Architect and deploy scalable AI solutions on AWS, leveraging services like Lambda, Bedrock, Step Functions, S3, API Gateway, and SageMaker.
- Proficiency in working with LLM APIs (e.g., Claude, Nova, and other third-party LLM providers), including API integration,and multi-model orchestration strategies.
- Hands-on experience fine-tuning or optimizing large language models (LLM)
- Familiarity with LLM tool use, prompt templating and context management.
- Strong expertise in Vector Databases, including indexing strategies, embedding generation, similarity search, and integration with RAG architectures.
- Model Evaluation & Optimization: Evaluate LLM's zero-shot and few-shot capabilities, fine-tuning hyperparameters, ensuring task generalization, and exploring model interpretability for robust web app integration.
- Develop and maintain Model Context Protocol (MCP) implementations to manage state, context windows, memory, and prompt orchestration across distributed agent systems.
- Experience with at least one of the workflow orchestration tools, Airflow, StepFunctions, SageMaker Pipelines, Kubeflow etc.
- Experience implementing secure, scalable APIs and integrating with 3rd-party data sources and tools
- Ability to collaborate with cross-functional teams such as Developers, QA, Project Managers, and other stakeholders to understand their requirements and implement solutions.
- Should have experience with Deep Learning Concepts - Transformers, BERT, Attention models, tokenization, embeddings.
Other Qualifications (OQ):- Experience with software development, exposure to frontend backend frameworks and communication protocols
- Experience working on Infrastructure as Code (IaC) and CI/CD pipelines
- Experience with NLP concepts: syntactic/semantic analysis, NER etc.
What is in it for you:- Join one of the world's fastest-growing AI-first digital engineering companies and make a real impact at scale.
- Lead and collaborate with a high-energy team of talented, driven individuals solving complex, meaningful challenges.
- Work with Fortune 500 companies and disruptive innovators in a research-driven environment with 60+ patents.
- Stay ahead of the curve by gaining hands-on experience with cutting-edge AI, ML, data, and cloud technologies while continuously upskilling.
If you like wild growth and working with happy, enthusiastic over-achievers, you'll enjoy your career with us!