(USA) Principal, Data Engineer

Walmart, Inc.

$143K — $286K *
Information Technology
8 - 10 years of experience
Job Overview by Ladders

Qualifications

  • 10-15+ years in large-scale distributed data platform development
  • Expertise in cloud-native ecosystems like GCP or Azure
  • Strong experience with both batch and streaming systems
  • Deep understanding of semantic modeling and agentic systems
  • Fluency in Python, Java, or Scala, with strong Spark/PySpark skills

Responsibilities

  • Architect and evolve core data platform strategy for diverse workloads
  • Establish architectural standards for data across all domains
  • Define enterprise data patterns for AI and multi-agent workflows
  • Partner with teams to align platform strategy with decision-making goals
  • Develop standards for governance and observability across data systems
  • Lead initiatives to modernize legacy systems for composability
  • Mentor engineering teams and elevate technical standards

Benefits

  • Opportunities for mentorship and professional growth
  • Work with cutting-edge technologies in AI and data engineering
  • Impact the design and functionality of data systems across the enterprise
  • Be part of a mission-focused organization driving innovation
  • Contribute to creating inclusive digital experiences and accessibility standards
Full Job Description
Position Summary...

What you'll do...
Principal Data Engineer - Agentic Data Platforms
The Mission
As a Principal Data Engineer, you are the enterprise-scale technical authority responsible for shaping and evolving Sam's Club's data platform architecture to power intelligent, autonomous systems at scale.
Your mission is to architect and steward the foundational data systems that enable AI agents, copilots, and large-scale analytics to operate with reliability, semantic clarity, and enterprise-grade trust.
You do not simply build systems-you define how the organization builds systems.

What You'll Do
Define Enterprise Data Architecture
  • Architect and evolve the core data platform strategy across batch, streaming, and hybrid systems to support both traditional analytics and AI-native workloads.
  • Establish and steward architectural standards that unify structured, semi-structured, and unstructured data across domains.


Lead Agentic Data Enablement
  • Define enterprise patterns for agent-ready data, ensuring systems are discoverable, semantically rich, and optimized for LLMs, copilots, and multi-agent workflows.
  • Shape reference architectures for RAG, real-time feature pipelines, vector indexing, and graph-augmented reasoning.


Drive Cross-Domain Platform Strategy
  • Partner with Engineering, AI/ML, Product, and Platform leaders to align roadmaps with long-term autonomous decision-making goals.
  • Influence and guide multiple teams in adopting scalable patterns without direct authority.


Architect for Trust, Reliability, and Scale
  • Define standards for observability, telemetry, lineage, governance, and AI auditability across enterprise data systems.
  • Design resilient, fault-tolerant systems that support millions of users and mission-critical retail operations.


Modernize Legacy Systems
  • Lead modernization initiatives that transform traditional data lake systems into composable, event-driven, and agent-aware platforms.
  • Balance short-term delivery with long-term scalability and maintainability.


Elevate Technical Excellence
  • Mentor and guide Staff Engineers and Senior Engineers.
  • Lead design reviews and steward architectural decisions across high-risk, high-impact initiatives.
  • Raise the technical bar for the organization through principled engineering standards.


What You'll Bring
Enterprise-Scale Experience
  • 10-15+ years building and evolving large-scale distributed data platforms.
  • Proven track record of shaping architecture across multiple domains or organizations.


Deep Data Platform Expertise
  • Strong experience in cloud-native ecosystems (GCP or Azure preferred), including BigQuery, Dataflow, Pub/Sub, or equivalent.
  • Expertise across batch and streaming systems (Kafka, Spark Structured Streaming, Flink, Druid, etc.).
  • Experience designing hybrid real-time + analytical architectures.


Agentic & AI-Ready Systems Knowledge
  • Strong understanding of semantic modeling, embeddings, knowledge graphs, and vector search.
  • Experience designing data systems that support RAG, context enrichment, and agent orchestration.
  • Ability to reason about schema, latency, storage format, and their impact on AI reasoning quality.


Governance & Observability Leadership
  • Advanced knowledge of data quality frameworks, access control, lineage, compliance, and auditability.
  • Experience defining trust and safety standards for AI-enabled systems.


Strong Engineering Foundation
  • Fluency in Python, Java, or Scala.
  • Deep experience with Spark/PySpark and SQL optimization at scale.
  • Strong systems thinking and performance optimization skills.


Influence Without Authority
  • Demonstrated ability to align diverse stakeholders and drive architectural adoption across teams.
  • Clear, executive-level communication and the ability to translate technical strategy into business value.


Minimum Qualifications...

Outlined below are the required minimum qualifications for this position. If none are listed, there are no minimum qualifications.

Option 1: Bachelor's degree in Computer Science and 5 years' experience in software engineering or related field. Option 2: 7 years' experience in software engineering or related field. Option 3: Master's degree in Computer Science and 3 years' experience in software engineering or related field.
4 years' experience in data engineering, database engineering, business intelligence, or business analytics.

Preferred Qualifications...

Outlined below are the optional preferred qualifications for this position. If none are listed, there are no preferred qualifications.

Data engineering, database engineering, business intelligence, or business analytics, ETL tools and working with large data sets in the cloud, Master's degree in Computer Science or related field and 5 years' experience in software engineering or related field, We value candidates with a background in creating inclusive digital experiences, demonstrating knowledge in implementing Web Content Accessibility Guidelines (WCAG) 2.2 AA standards, assistive technologies, and integrating digital accessibility seamlessly. The ideal candidate would have knowledge of accessibility best practices and join us as we continue to create accessible products and services following Walmart's accessibility standards and guidelines for supporting an inclusive culture.

Primary Location...

809 11th Ave, Sunnyvale, CA 94089-4731, United States of America

Similar Jobs

More Jobs at Walmart, Inc.

More Information Technology Jobs

Find similar (USA) Principal, Data Engineer jobs: