Job DescriptionWe are hiring a Staff Engineer to join our AI Platform team building and operating our enterprise AI platform for engineering workflows. This role is focused on the foundational platform layer - Agentic systems, MCP ecosystem, LLM gateway, memory and knowledge infrastructure, and platform observability - that powers AI applications across Flash Product's Group.
Essential Duties and Responsibilities:- Platform Development: Design, build, and maintain core components of the Nexus platform, including agentic orchestration (LangGraph / Deep Agents), MCP servers and gateway integrations, memory and knowledge systems, and the LLM gateway layer.
- Agentic Systems: Develop and extend multi-agent workflows, tool-calling pipelines, and conversational AI experiences across Nexus Builder, Nexus Chat, and purpose-built applications.
- MCP Ecosystem: Build OIDC-compliant MCP servers that expose enterprise data and tools (Jira, test management, internal systems) to agents in a secure, governed way.
- Reliability & Observability: Drive platform stability through environment separation, monitoring, tracing, evaluation pipelines, and incident response practices.
- Technical Leadership: Own technical decisions within assigned workstreams, conduct design reviews, and mentor engineers on agentic patterns, LLM application design, and production AI best practices.
- Cross-Functional Collaboration: Partner with InfoSec, Cloud Infrastructure, IAM, and product engineering teams to ship platform capabilities that meet enterprise requirements.
- Continuous Innovation: Stay current with advances in LLMs, agentic frameworks, and AI infrastructure; evaluate and integrate new technologies into the Nexus roadmap.
QualificationsRequired:- Master's degree in Artificial Intelligence, Machine Learning, Data Science, Computer Science, or a related field.
- Approximately 5 years of professional software engineering experience, with demonstrated impact building production AI/ML systems or developer platforms.
- Strong proficiency in Python; working knowledge of TypeScript/JavaScript and React.
- Hands-on experience with modern AI/ML frameworks: LangChain, LangGraph, LlamaIndex, PyTorch or TensorFlow, and the Hugging Face ecosystem.
- Practical understanding of LLMs, transformers, embeddings, RAG architectures, and agentic design patterns.
- Experience integrating with LLM providers and gateways (Anthropic, OpenAI, Portkey, or equivalent), and familiarity with the Model Context Protocol (MCP).
- Solid grasp of distributed systems, microservices, REST/GraphQL APIs, and event-driven architectures.
- Experience deploying and operating workloads on Kubernetes with Docker, including hybrid on-prem / cloud topologies.
- Comfort with relational (PostgreSQL), NoSQL (MongoDB, Elasticsearch), in-memory (Redis / Valkey), and vector databases.
- Familiarity with OAuth / OIDC, SSO, and enterprise security and compliance practices.
Preferred: - Strong written and verbal communication, with the ability to articulate technical tradeoffs to engineering and leadership audiences.
- Proven ability to lead initiatives end-to-end, work independently, and collaborate effectively across teams.
- Bias for action, ownership mindset, and a track record of shipping in fast-moving environments.
- Problem Solving: Demonstrated ability to decompose ambiguous problems, design pragmatic solutions, and debug complex issues across the AI/ML and infrastructure stack.
Summary of skill sets:- Programming: Python, TypeScript/JavaScript, React, GraphQL
- AI/ML & Agents: LangGraph, LangChain, LlamaIndex, PyTorch/TensorFlow, RAG, MCP, agentic frameworks
- Platform & Infra: Kubernetes, Docker, microservices, REST APIs, CI/CD
- Data: RDBMS (PostgreSQL), NoSQL (MongoDB, Elasticsearch), in-memory (Redis/Valkey), vector databases
- Security & Identity: OAuth, OIDC, SSO, enterprise auth patterns
Compensation & Benefits Details- An employee's pay position within the salary range may be based on several factors including but not limited to (1) relevant education; qualifications; certifications; and experience; (2) skills, ability, knowledge of the job; (3) performance, contribution and results; (4) geographic location; (5) shift; (6) internal and external equity; and (7) business and organizational needs.
- The salary range is what we believe to be the range of possible compensation for this role at the time of this posting. We may ultimately pay more or less than the posted range and this range is only applicable for jobs to be performed in California, Colorado, New York or remote jobs that can be performed in California, Colorado and New York. This range may be modified in the future.
- You will be eligible to participate in Sandisk's Short-Term Incentive (STI) Plan, which provides incentive awards based on Company and individual performance. Depending on your role and your performance, you may be eligible to participate in our annual Long-Term Incentive (LTI) program, which consists of restricted stock units (RSUs) or cash equivalents, pursuant to the terms of the LTI plan. Please note that not all roles are eligible to participate in the LTI program, and not all roles are eligible for equity under the LTI plan. RSU awards are also available to eligible new hires, subject to Sandisk's Standard Terms and Conditions for Restricted Stock Unit Awards.
- We offer a comprehensive package of benefits including paid vacation time; paid sick leave; medical/dental/vision insurance; life, accident and disability insurance; tax-advantaged flexible spending and health savings accounts; employee assistance program; other voluntary benefit programs such as supplemental life and AD&D, legal plan, pet insurance, critical illness, accident and hospital indemnity; tuition reimbursement; transit; the Applause Program, employee stock purchase plan, and the Sandisk's Savings 401(k) Plan.
- Note: No amount of pay is considered to be wages or compensation until such amount is earned, vested, and determinable. The amount and availability of any bonus, commission, benefits, or any other form of compensation and benefits that are allocable to a particular employee remains in the Company's sole discretion unless and until paid and may be modified at the Company's sole discretion, consistent with the law.