(USA) Senior, Software Engineer

Walmart, Inc.

$117K — $234K *
Information Technology
5 - 7 years of experience
Job Overview by Ladders

Qualifications

  • Bachelor's or Master's Degree in Computer Science, Engineering, or related field
  • 5+ years of experience building scalable eCommerce applications or distributed backend services
  • 3+ years of industry experience in application releases, CI/CD pipelines, and distributed system testing
  • Strong expertise in containerization and orchestration using Kubernetes, including multi-cluster and GPU-node management
  • Experience with modern CI/CD platforms and GitOps workflows.

Responsibilities

  • Build, manage, and evolve QE & Release Automation frameworks with AI-assisted capabilities
  • Support Kubernetes-based containerization in production, including GPU-backed workloads
  • Lead investigation and resolution of high-impact incidents for search and AI services
  • Implement comprehensive monitoring and observability for applications and AI models
  • Maintain and improve automation pipelines for software deployment cycles
  • Integrate AI tools into engineering workflows to enhance development speed
  • Drive execution of medium- to large-scale projects from Dev to Ops, focusing on AI initiatives.

Benefits

  • Opportunity to work with cutting-edge AI technologies and tools
  • Be part of a team influencing the high-availability service used by millions worldwide
  • Collaborative environment with engineers and AI/ML professionals
  • Focus on operational excellence in a high-impact role
  • Involvement in large-scale projects with a direct effect on business metrics
Full Job Description
Position Summary...

What you'll do...
About Team:

Search PTE - DevOps Team processes billions of queries for millions of products on Walmart sites and apps worldwide. Whenever a user types in a query or browses through product categories on the web or mobile, our service goes to work. We mine structured and semi-structured data from product catalogs, social web, transactions, query logs, and AI-generated signals at an unprecedented scale. We work on big data problems, cutting-edge relevance algorithms from information retrieval, machine learning, and AI-powered ranking to deliver a high-availability, low-latency service that directly impacts business metrics.
Position Summary

Being part of the Search PTE-DevOps team at Walmart provides deep insight into the full lifecycle of a product - from content acquisition to being sold on Walmart.com. As a Senior Software Engineer in DevOps & AI Platform, you must support all systems and services to ensure high availability and reliability, while embracing AI-augmented workflows to accelerate engineering velocity.
You will work closely with developers, AI/ML engineers, and platform teams to support new application features, AI model deployments, and service launches. You will design, build, and operate the tools that help in developing, scaling, and monitoring cutting-edge technology - including GenAI and LLMOps pipelines.
You must be able to triage complex technical issues in collaboration with engineering, NOC, NetEng, and Platform teams. If you are passionate about five 9's reliability and excited about the intersection of AI and platform engineering, this position is for you.

We are looking for an expert in continuous integration and delivery pipelines, containerized infrastructure, and AI-assisted development practices. You will play a critical role in all search application and AI model release cycles, working closely with Engineering, QE, and DevOps.

What You'll Do:
  • Build, manage, and evolve QE & Release Automation frameworks, incorporating AI-assisted test generation and self-healing test capabilities
  • Build and support Kubernetes-based containerization in production, including GPU-backed workloads for AI/ML inference
  • Lead independently the investigation and resolution of high-impact search system and AI service incidents
  • Build, manage, and support comprehensive monitoring and observability for applications and AI model performance (drift, latency, accuracy)
  • Maintain and improve automation pipelines supporting application build, release, and AI model deployment cycles (CI/CD + MLOps/LLMOps)
  • Integrate AI coding assistants and GenAI tooling (e.g., Wibey, GitHub Copilot) into engineering workflows to accelerate development
  • Design and implement AI-powered observability solutions using intelligent alerting, anomaly detection, and predictive incident management
  • Collaborate with AI/ML teams to operationalize LLM-based features within search, including prompt pipeline management and vector search infrastructure
  • Drive execution and lead medium- to large-scale projects from Dev to Ops, including AI/ML platform initiatives
  • Analyze, design, and build frameworks using cutting-edge technology and AI tools to fulfill Operational Excellence
  • Lead and independently handle high-impact, critical search system and AI service incidents
  • Improve, optimize, and identify opportunities within the software development and AI deployment lifecycle (SDLC + MLOps)
  • Provide engineering and QE teams with architectural guidance on solutions, automation frameworks, and AI integration patterns
  • Work with product and engineering teams to review new functional and AI-driven requirements; develop comprehensive test plans and automate test cases - including AI model validation
  • Perform quality assurance for large-scale eCommerce backend search services and AI-powered features
  • Write programs and scripts to automate testing and validation of search backend services and LLM/AI inference pipelines
  • Expertise in WCNP, Concord, Looper, Python, Golang, and Java - with hands-on experience in AI/ML tooling, LLMOps, and GenAI platforms


What You'll Bring:
  • Bachelor's or Master's Degree in Computer Science, Engineering, or related field
  • 5+ years of experience building scalable eCommerce applications or distributed backend services
  • 3+ years of industry experience in application releases, CI/CD pipelines, and distributed system testing
  • Strong expertise in containerization and orchestration using Kubernetes (including multi-cluster and GPU-node management)
  • 2+ years of programming experience in Python, Go, Java, and Shell scripting, with exposure to REST and gRPC API frameworks
  • Experience with modern CI/CD platforms (e.g., Concord, GitHub Actions, Looper) and GitOps workflows (e.g., ArgoCD, Flux)
  • Working knowledge of AI/ML workflows: model serving, inference optimization, or LLM deployment pipelines
  • Familiarity with observability stacks: OpenTelemetry, distributed tracing, log aggregation (e.g., Splunk, OpenObserve), and AI-assisted anomaly detection

Additional Preferred Qualifications
  • Experience with LLMOps and GenAI platforms: prompt engineering, RAG pipelines, vector databases (e.g., Pinecone, Weaviate, Elasticsearch KNN), and LLM evaluation frameworks
  • Hands-on experience with AI coding assistants (e.g., Wibey, GitHub Copilot) and AI-augmented DevOps tooling
  • Proficiency with WCNP (Walmart Cloud Native Platform) and cloud-native infrastructure on GCP or Azure
  • Knowledge of eBPF-based observability tools (e.g., Cilium, Pixie) and advanced networking concepts (VIP, TCP, Envoy/Istio service mesh)
  • Experience with GPU infrastructure management for AI workloads (CUDA, NVIDIA device plugins for Kubernetes)
  • Familiarity with MLflow, Kubeflow, Ray, or similar MLOps platforms for experiment tracking and model lifecycle management
  • Experience with performance and load testing tools (e.g., Gatling, k6, Locust) to measure server and client-side metrics
  • Knowledge of AI safety and responsible AI practices in production environments (guardrails, content filtering, bias monitoring)
  • Contributions to open-source DevOps, AI/ML, or platform engineering projects are a strong plus


Minimum Qualifications...

Outlined below are the required minimum qualifications for this position. If none are listed, there are no minimum qualifications.

Option 1: Bachelor's degree in computer science, computer engineering, computer information systems, software engineering, or related area and 3 years' experience in software engineering or related area.
Option 2: 5 years' experience in software engineering or related area.

Preferred Qualifications...

Outlined below are the optional preferred qualifications for this position. If none are listed, there are no preferred qualifications.

Master's degree in Computer Science, Computer Engineering, Computer Information Systems, Software Engineering, or related area and 1 year's experience in software engineering or related area., We value candidates with a background in creating inclusive digital experiences, demonstrating knowledge in implementing Web Content Accessibility Guidelines (WCAG) 2.2 AA standards, assistive technologies, and integrating digital accessibility seamlessly. The ideal candidate would have knowledge of accessibility best practices and join us as we continue to create accessible products and services following Walmart's accessibility standards and guidelines for supporting an inclusive culture.

Primary Location...

1395 Crossman Ave, Sunnyvale, CA 94089-1114, United States of America

Walmart and its subsidiaries are committed to maintaining a drug-free workplace and has a no tolerance policy regarding the use of illegal drugs and alcohol on the job. This policy applies to all employees and aims to create a safe and productive work environment.

Similar Jobs

More Jobs at Walmart, Inc.

More Information Technology Jobs

Find similar (USA) Senior, Software Engineer jobs: