Cloudera

AI Solutions Architect - FS or CI Polygraph Required

Cloudera$120K — $150K *
US-AnywhereRemote in Virginia, US
Education, Government & Non-Profit
5 - 7 years of experience
Job Overview by Ladders

Qualifications

  • 5+ years in Data Engineering, Machine Learning, or Software Engineering; 2+ years focused on Generative AI or Deep Learning.
  • Expertise in Python and deep learning frameworks (PyTorch, TensorFlow, Hugging Face).
  • Hands-on experience with Cloudera Data Platform (CDP), Spark, or similar big data tools.
  • Proficiency in orchestration tools like LangChain, LlamaIndex, or Haystack.
  • Experience developing visual data representations using Django, React, or Angular.
  • Expertise in building ETL/ELT pipelines with SQL and NoSQL/Vector databases.
  • Knowledge of government security frameworks (NIST AI RMF, FedRAMP).

Responsibilities

  • Evaluate and select optimal model architectures based on mission requirements.
  • Guide customers on decisions regarding 'Build vs. Buy vs. Fine-tune' models.
  • Design and implement robust data pipelines within CDP.
  • Develop and optimize Vector Databases and Retrieval-Augmented Generation architectures.
  • Optimize model inference using quantization and pruning techniques.
  • Collaborate with customer’s AI Center of Excellence for ethics and compliance governance.
  • Translate complex technical AI concepts into briefings for senior stakeholders.

Benefits

  • Generous PTO Policy
  • Support for work-life balance
  • Flexible Work From Home Policy
  • Mental & Physical Wellness programs
  • Phone and Internet Reimbursement
  • Access to Continued Career Development
  • Comprehensive Benefits and Competitive Packages
Full Job Description

Business Area:

Professional Services

Seniority Level:

Mid-Senior level

Job Description:

As an AI Solutions Engineer within Cloudera’s Public Sector Consulting team, you will be the technical architect and execution lead for agencies moving from "data chaos" to "agentic autonomy." You will work directly with government organizations to design, build, and deploy mission-critical AI applications on the Cloudera Data Platform (CDP).

This is not a "theoretical" role. You will be on the front lines of Phase 2 and Phase 3 adoption journeys—helping customers clean legacy data silos, select the right model architectures, and industrialize MLOps pipelines in highly secure, often air-gapped or hybrid-cloud environments.

As the AI Solutions Engineer you will: 

1. AI Model Strategy, Selection and Implementation

  • Evaluate and select optimal model architectures (LLMs, SLMs, or traditional ML) based on mission requirements, considering tradeoffs between accuracy, latency, and cost.

  • Guide customers on "Build vs. Buy vs. Fine-tune" decisions, prioritizing open-source models (Llama, Mistral, Falcon) that can run securely within a sovereign data perimeter.

  • Experience building Agentic Workflows (AI agents that can execute API calls and multi-step tasks).

2. End-to-End Data Engineering

  • Design and implement robust data pipelines within CDP to transform "messy" legacy data into AI-ready formats.

  • Develop and optimize Vector Databases and Retrieval-Augmented Generation (RAG) architectures to ground AI responses in verified agency facts.

  • Build Data pipelines with Spark, Nifi, Kafka or other ETL tools.

3. Optimization & Performance Tuning

  • Optimize model inference for production environments using quantization, pruning, and hardware acceleration (NVIDIA GPU orchestration).

  • Implement LLMOps to monitor model performance, detect hallucination rates, and manage model versioning and drift.

4. Public Sector Advisory & Governance

  • Collaborate with the customer’s AI Center of Excellence (CoE) to establish automated guardrails for ethics, bias mitigation, and FedRAMP/IL5 compliance.

  • Translate complex technical AI concepts into mission-value briefings for GS-level stakeholders and agency leadership.

We’re excited about you if you have: (Minimum Qualifications): 

  • Experience: 5+ years in Data Engineering, Machine Learning, or Software Engineering, with at least 2 years focused on Generative AI or Deep Learning.

  • Technical Stack: Expertise in Python and deep learning frameworks (PyTorch, TensorFlow, Hugging Face).

    • Hands-on experience with Cloudera (CDP), Spark, or similar big data ecosystems.

    • Proficiency in orchestration tools like LangChain, LlamaIndex, or Haystack.

    • Experience developing visual data representations and dashboards (Django, React, or Angular)  

    • Experience using a compiled programming language, preferably one that runs on the JVM (Java, Scala, etc)

  • Data Expertise: Proven ability to build ETL/ELT pipelines and work with both SQL and NoSQL/Vector databases (e.g., Pinecone, Milvus, or PGVector).

  • Public Sector Knowledge: Understanding of government security frameworks (NIST AI RMF, FedRAMP, SRGs, STIGs).

  • Active Top Secret Security Clearance

You may also have: (Preferred Qualifications)

  • Experience fine-tuning of foundational models using techniques such as PEFT (Parameter-Efficient Fine-Tuning) and LoRA to adapt AI to domain-specific government nomenclature.

  • Experience training of specialized models on proprietary datasets while ensuring strict adherence to data privacy and sensitivity labels.

  • Experience installing and operating Cloudera Data Platform 

  • Experience installing and operating Kubernetes

  • Experience in Air-Gapped deployments and managing AI workloads in disconnected environments.

  • Advanced degree (MS or PhD) in Computer Science, Data Science, or a related field.

  • Active Counterintelligence (CI) or Full Scope (FS) Poly is required.

This role is not eligible for immigration sponsorship.

What you can expect from us:

  • Generous PTO Policy 

  • Support work life balance with

  • Flexible WFH Policy 

  • Mental & Physical Wellness programs 

  • Phone and Internet Reimbursement program 

  • Access to Continued Career Development 

  • Comprehensive Benefits and Competitive Packages 

  • Employee Resource Groups

#LI-Remote

#LI-ND3

About Cloudera

Cloudera is a software company that provides a unified platform for data management and analytics. The company's platform enables organizations to store, process, and analyze large amounts of data in a variety of formats, including structured, semi-structured, and unstructured data. Cloudera's platform is used by some of the world's largest and most innovative organizations, including Airbus, Credit Suisse, and Expedia. The company was founded in 2008 and is headquartered in Palo Alto, California.
Learn more about Cloudera
Size
2,728 employees
Market Cap
$4.6 billion
Industry
Net Income
-$162.7 million
Founded
2008
5 Year Trend
+39.2%
Revenue
$869.2 million
NASDAQ

Similar Jobs

More Education, Government & Non-Profit Jobs

Find similar AI Solutions Architect - FS or CI Polygraph Required jobs: