Senior AI Platform Engineer - Frisco

McAfee • $107K — $176K *

San Jose, CA 95123In-Person

Information Technology

8 - 10 years of experience

Today

Be an Early Applicant

By clicking Apply, I agree with Ladders' Terms of Use and Privacy Policy

Job Overview by Ladders

Qualifications

10+ years in platform engineering with AI/ML or GenAI experience.
Hands-on experience with LLM ecosystems (AWS Bedrock, OpenAI, Anthropic).
Strong Kubernetes expertise (EKS/GKE), including GPU scheduling and multi-tenant isolation.
Proficient in Python and Go; familiarity with FastAPI and gRPC.
Deep knowledge of AWS services and Infrastructure as Code (Terraform).
Experience integrating platforms using Backstage and self-service patterns.
Strong understanding of distributed systems and CI/CD automation.

Responsibilities

Design, build, and scale generative AI platforms for LLM applications.
Architect secure, scalable cloud-native AI infrastructure (AWS, GCP, Kubernetes).
Enable self-service AI capabilities through standardized services and APIs.
Manage AI gateways including model routing and policy enforcement.
Integrate GenAI services into CI/CD pipelines for lifecycle management.
Build observability platforms for tracking token usage and performance metrics.
Drive multi-cloud AI platform strategy and modernization initiatives.

Benefits

Bonus Program
401k Retirement Plan
Medical, Dental, Vision, and Disability Coverage
Paid Parental Leave
Support for Community Involvement
14 Paid Company Holidays
Unlimited Paid Time Off for Exempt Employees
96 Hours of Sick Time and 120 Hours of Vacation for Non-Exempt Employees

Full Job Description

Job Title:
Senior AI Platform Engineer - Frisco

Role Overview:

This role is responsible for designing, building, and scaling enterprise-grade Generative AI platforms and developer ecosystems. The focus is on enabling secure, scalable, reliable, and production-ready GenAI capabilities across the organization leveraging LLMs, AI gateways, Kubernetes, and cloud-native infrastructure.

The role combines deep expertise in platform engineering, AI infrastructure, and generative AI at enterprise scale. It operates with a platform-as-a-product mindset, enabling self-service AI capabilities through developer portals (e.g., Backstage templates and plugins) to accelerate adoption and standardization.
The engineer will partner closely with Security and Governance teams to embed responsible AI practices, enforce policy-driven controls, and provide token-level usage and cost visibility. This role also drives consistency in model access patterns, observability, and lifecycle management of AI services across environments.

This is a Hybrid Position located in Frisco, TX. We are only considering candidates within a commutable distance to the Frisco office. You will be required to be onsite on an as-needed basis; when not working onsite, you will work from your home office. We are only considering candidates within a commutable distance to the office location and are not offering relocation assistance at this time.

About The Role:

Design, build, and scale enterprise-grade Generative AI platforms supporting LLM applications, AI agents, RAG architectures, and multi-model routing.

Architect and implement secure, scalable AI infrastructure leveraging cloud-native technologies (AWS, GCP, Kubernetes, GKE/EKS).
Enable self-service AI capabilities for engineering teams through standardized platform services, APIs, and Backstage templates/plugins.
Build and operate Retrieval-Augmented Generation (RAG) infrastructure, including embedding pipelines and vector stores (OpenSearch, Aurora pgvector).
Develop and manage enterprise AI gateway capabilities, including model routing, rate limiting, token tracking, and policy enforcement.
Integrate GenAI services into CI/CD pipelines and platform workflows to enable seamless deployment and lifecycle management.
Build observability platforms for GenAI systems, tracking token usage, latency, response quality, failure rates, throughput, and cost visibility.
Own lifecycle management of Kubernetes-based AI platforms including upgrades, patching, scaling.
Define SLIs/SLOs and reliability benchmarks for AI platform services.
Implement AI security guardrails including PII redaction, prompt injection defenses, and policy-driven controls.
Integrate DevSecOps and AI security scanning into deployment pipelines to enforce secure-by-design practices.
Design AI release validation, risk analysis, and governance frameworks for production readiness.
Build reusable infrastructure modules and platform automation frameworks using Infrastructure as Code (Terraform or equivalent).
Develop upgrade and patching strategies for AI platforms with minimal downtime and operational risk.
Ensure platform security posture, compliance, and lifecycle governance across environments.
Drive multi-cloud AI platform strategy and lead modernization initiatives across AWS and GCP.
Partner with Security and Governance teams to enforce responsible AI practices and enterprise standards.
Drive measurable improvements in developer productivity, platform adoption, and AI cost efficiency through standardized platform capabilities.

About You:

10+ years of experience in platform engineering, with hands-on AI/ML or GenAI platform experience.
Hands-on experience with at least one LLM ecosystem (AWS Bedrock, OpenAI, Anthropic).
Strong Kubernetes experience (EKS/GKE), including GPU scheduling, autoscaling, and multi-tenant isolation.
Strong programming expertise in Python and Go; experience building services using FastAPI and gRPC.
Deep expertise in AWS (IAM, VPC, KMS) and Infrastructure as Code (Terraform).
Experience building and integrating platforms using Backstage (plugins, templates, self-service patterns).
Strong understanding of distributed systems and event streaming (Apache Kafka).
Expertise in CI/CD automation and platform engineering best practices.
Experience with multi-model orchestration frameworks (LangChain, LlamaIndex).
Exposure to LLMOps / MLOps tooling for model lifecycle management, evaluation, and versioning.
Experience building or integrating AI agent frameworks and orchestration patterns.
Familiarity with AI cost optimization strategies (token efficiency, caching, adaptive routing).
Experience with prompt engineering frameworks, guardrails, and evaluation techniques.
Exposure to AI model evaluation frameworks (quality scoring, hallucination detection, benchmarking).
Experience with vector databases beyond OpenSearch (e.g., Pinecone, Weaviate)
Familiarity with event-driven architectures for AI workflows (Kafka-based streaming pipelines).
Experience exposing platform capabilities as reusable APIs, SDKs, templates, and developer tooling.
Strong understanding of cloud-native architectures and microservices design patterns.
Experience implementing AI security controls, governance frameworks, and risk mitigation.
Experience with enterprise AI gateway patterns for model access and control.
Exposure to agentic AI concepts (MCP, A2A, AI agents) and emerging GenAI orchestration patterns.
Proven ability to lead architecture reviews, drive platform governance, and influence engineering standards.
Demonstrated experience driving large-scale engineering transformation initiatives.
AI/ML certifications such as AWS Machine Learning Specialty, Google Cloud ML Engineer is a plus.
Cloud architecture certifications (AWS/GCP Solutions Architect) is a plus.
Kubernetes certifications (CKA, CKAD, CKS) is a plus.

#LI-Hybrid

Company Benefits and Perks:

We work hard to embrace diversity and inclusion and encourage everyone at McAfee to bring their authentic selves to work every day. We offer a variety of social programs, flexible work hours and family-friendly benefits to all of our employees.

Bonus Program
401k Retirement Plan
Medical, Dental, Vision, Basic Life, Short Term Disability and Long-Term Disability Coverage
Paid Parental Leave
Support for Community Involvement
14 Paid Company Holidays
Unlimited Paid Time Off for Exempt Employees
96 Hours of Sick Time and 120 Hours of Vacation for Non-Exempt Employees Accrued Each Year

The starting pay range for this position is $107,430.00-$176,490.00. McAfee takes into consideration an individual's skillset, experience and location in making final salary determinations. For further details, please discuss with the Talent Acquisition Partner.

About McAfee

McAfee is a cybersecurity company that provides antivirus, encryption, and other security solutions. The company was founded in 1987 and is headquartered in Santa Clara, California. McAfee's products are designed to protect against a variety of cyber threats, including malware, phishing, and ransomware. The company serves customers in a variety of industries, including healthcare, finance, and government. In 2011, McAfee was acquired by Intel, and in 2020, it was spun off as an independent company.

Learn more about McAfee

Size

7,000 employees

Market Cap

$4.7 billion

Industry

Enterprise Technology

Net Income

-$118 million

Founded

1987

Revenue

$2.9 billion

NASDAQ

MCFE

* Ladders Estimates

Similar Jobs

Inference Optimization ML Engineer
$130K — $180K *
Rhoda AI
Mountain View, CA 94040 (Santa Clara County)
Today
Sr Software Engineer, AI Platform
$150K — $180K *
NRG Energy
Remote
Today
AI Game Designer | North America | Canada | Europe | Fully Remote
$80K — $120K *
Escape Velocity Entertainment Inc
Remote
Reposted Today
AI Engineer - Remote
$100K — $150K *
Huzzle
Remote
Reposted Today
Staff AI Engineer
$145K — $220K *
Unqork
Remote
Today
Member of Technical Staff - Science, Frontier AI & Robotics (FAR)
$150K — $300K *
Amazon
San Francisco, CA 94112 (San Francisco County)
Reposted Today

Get Ready For Your
Next Interview

More Jobs at McAfee

Senior AI Platform Engineer - Frisco
$107K — $176K *
San Jose, CA 95123 (Santa Clara County)
Today
Information Technology
In-Person
Senior AI Platform Engineer - Frisco
$107K — $176K *
Frisco, TX 75034 (Denton County)
Today
Enterprise Technology
In-Person
Sr. DevOps Engineer
$107K — $176K *
San Jose, CA 95123 (Santa Clara County)
Today
Information Technology
In-Person
Sr. DevOps Engineer
$107K — $176K *
Frisco, TX 75034 (Denton County)
Today
Information Technology
In-Person
Lead DevOps Engineer
$107K — $176K *
Frisco, TX 75034 (Denton County)
Today
Information Technology
In-Person

More Information Technology Jobs

Client Partner - Banking / Financial Services / Capital Markets
$325K — $350K + $100K bonus *
Large IT Services Firm (client of TechLink Systems)
New York, NY 10001 (New York County)
1 week ago
Senior Manager, Software Engineering, Full Stack
$209K — $238K *
Capital One Financial Corporation
Plano, TX 75025 (Collin County)
Today
Applications Development Technology Lead Analyst
$96K — $145K *
Citigroup, Inc
Tampa, FL 33647 (Hillsborough County)
Today
Cloud DevOps Analyst
$70K — $95K *
Gateway Ticketing Systems
Gilbertsville, PA 19525 (Montgomery County)
Reposted Today
Hybrid Cloud Platform Engineer (PaaS)
$100K — $130K *
Abile Group, Inc.
Springfield, VA 22153 (Fairfax County)
Today

Find similar Senior AI Platform Engineer - Frisco jobs:

Nationwide San Jose, CA

Senior AI Platform Engineer - Frisco

Job Overview by Ladders

Full Job Description

Get Ready For Your Next Interview

Find similar Senior AI Platform Engineer - Frisco jobs:

Get Ready For Your
Next Interview