Principal Data Engineer

Walmart, Inc.

• $178K — $312K *

Seattle, WA 98115In-Person

Information Technology

5 - 7 years of experience

Today

Be an Early Applicant

By clicking Apply, I agree with Ladders' Terms of Use and Privacy Policy

Job Overview by Ladders

Qualifications

Bachelor's in Computer Science, Software Engineering, or related field; or Master's with relevant experience
5+ years of experience in software/data engineering or analytics; or 3+ years with a Master's
Proficient in designing and implementing data pipelines using Apache Spark and PySpark
Experienced with AWS for building and maintaining data lakes
Skilled in developing ETL workflows with Apache Airflow and writing complex SQL queries
Background in data modeling, warehousing solutions like Snowflake and Databricks
Experience in mentoring junior engineers and conducting code reviews

Responsibilities

Define data requirements and service level agreements prioritization
Identify suitable data sources and perform initial data quality checks
Build infrastructure for optimal data transformation and integration
Automate repetitive data preparation using modern tools and techniques
Create and maintain critical data documentation to leverage data as an asset
Revise application design to align with business and technical requirements
Monitor production applications and develop necessary patches

Benefits

Medical, vision, and dental coverage
401(k) and stock purchase options
Company-paid life insurance
Paid time off including sick leave, parental leave, and family care leave
Education assistance, including covered college degrees
Short-term and long-term disability benefits
Company discounts and adoption expense reimbursement

Full Job Description

What you'll do...

Position: Principal Data Engineer

Job Location: 300 Elliott Avenue W., Seattle, WA 98119

Duties: Understands the priority order of requirements and service level agreements. Defines and identifies the most suitable sources for required data that is fit for purpose, referring to external sources as required. Performs initial data quality checks on the extracted data. Reviews the deliverables of junior associates and provides guidance on data source and quality. Builds the infrastructure required for optimal transformation and integration from a wide variety of data sources using appropriate data integration technologies. Uses modern tools, techniques, and architectures to partially or completely automate the most common, repeatable and tedious data preparation and integration tasks. Deploys pipelines using scheduling and orchestration frameworks. Evaluates impacts of data issues and risks at an early stage. Identifies needs and creates methods to fuse and reshape complex, multi-source data and make it usable for modeling. Updates knowledge of current and emerging big data analytics and data science trends and techniques. Assembles large, complex data across all data platforms (for example, relational, dimensional, NoSQL) and data tools. Builds complex logical and conceptual models and provides guidance to team on physical data models. Identifies and defines the appropriate techniques for exposing data toother systems. Reviews and provides guidance and inputs on all data modeling activities to team members. Creates and maintains critical data documentation and metadata that allows data to be understood and leveraged as a shared asset. Assists in defining data modeling standards and foundational best practices. Provides inputs to the architectural design to make best use of the available resources, given goals, and expected loads. Reviews the solution and application design to ensure it meets business, technical, and data requirements. Identifies language and libraries to use in the development process. Maps test cases to business and functional requirements. Creates proof of concepts. Reviews and troubleshoots code in line with final designs. Identifies and recommends the appropriate testing methodology. Identifies the environment(s) for deployment. Identifies and recommends modifications of application based on different environment requirements. Identifies modifications needed for scalability and drives the change. Monitors applications in production and leads development of patches where required. Reviews and ensures all code documentation is complete and updated periodically. Analyzes the business problem within one's discipline and questions assumptions to help the business identify the root cause. Identifies and recommends approach to resolve the business problem to create effective technology focused solutions. Sets relevant deliverables based on the established success criteria and define key metrics to measure progress and effectiveness of the solution. Quantifies business impact.

Minimum education and experience required: Bachelor's degree or the equivalent in Computer Science, Software Engineering or related field and 5 years of post-bachelor's progressively responsible experience in software engineering, data engineering, database engineering, business intelligence, business analytics or related field; OR Master's degree or the equivalent in Computer Science, Software Engineering or related field and 3 years of experience in software engineering, data engineering, database engineering, business intelligence, business analytics or related field.

Skills required: Experience designing and implementing data pipelines using Apache Spark and PySpark. Experience building and maintaining data lakes using AWS. Experience developing ETL workflows using Apache Airflow. Experience writing complex SQL queries and performance tuning. Experience implementing data modeling and data warehousing solutions (Snowflake and Databricks). Experience programming in Python for data engineering tasks. Experience using CI/CD tools (GitHub Actions and Terraform) for data pipeline deployment. Experience monitoring, and alerting data quality and issues across Data warehouse with tools like Monte Carlo and AWS CloudWatch. Experience performing cost optimization and resource tagging for data lake infrastructure. Experience mentoring junior engineers and conducting code reviews. Experience leading data architecture design and strategy across teams. Employer will accept any amount of experience with the required skills.

Salary Range: $178,069/year to $312,000/year. Additional compensation includes annual or quarterly performance incentives.

Benefits: At Walmart, we offer competitive pay as well as performance-based incentive awards and other great benefits for a happier mind, body, and wallet. Health benefits include medical, vision and dental coverage. Financial benefits include 401(k), stock purchase and company-paid life insurance. Paid time off benefits include PTO (including sick leave), parental leave, family care leave, bereavement, jury duty and voting. Other benefits include short-term and long-term disability, education assistance with 100% company paid college degrees, company discounts, military service pay, adoption expense reimbursement, and more.

Eligibility requirements apply to some benefits and may depend on your job classification and length of employment. Benefits are subject to change and may be subject to a specific plan or program terms. For information about benefits and eligibility, see One.Walmart.com.

#LI-DNI #LI-DNP

* Ladders Estimates

Similar Jobs

Principal Applied Scientist, Selling Partner Selection Success
$198K — $269K *
Amazon
Seattle, WA 98115 (King County)
Yesterday
AV Simulation Domain Expert (Sr. Principal) - US (Remote) or Chicago, IL
$130K — $180K *
HERE Technologies
Remote
2 days ago
Principal Applied AI Engineer, Finance
$193K — $340K *
Genesys
Remote
Reposted 3 days ago
[Remote] Principal Applied Scientist
$120K — $251K *
Oracle Corporation
Remote
Reposted 3 days ago
Principal Applied Scientist, Agentforce Operations
$197K — $313K *
Salesforce
Seattle, WA 98115 (King County)
4 days ago
Principal Forward Deployed Architect - AI Data Foundations
$158K — $237K *
CUNA Mutual Group
Remote
4 days ago

Get Ready For Your
Next Interview

More Jobs at Walmart, Inc.

Area Manager
$65K — $98K *
Oquossoc, ME 04964 (Franklin County)
Today
Retail & Consumer Goods
In-Person
(USA) Principal, Product Manager
$143K — $286K *
Sunnyvale, CA 94087 (Santa Clara County)
Today
Consumer Technology
In-Person
(USA) Senior Manager, Technology Operations
$117K — $234K *
San Bruno, CA 94066 (San Mateo County)
Today
Retail & Consumer Goods
In-Person
(USA) Manager, Automation Engineering
$90K — $180K *
Bentonville, AR 72712 (Benton County)
Today
Manufacturing & Automotive
In-Person
(USA) Realty Project Coach
$60K — $110K *
Woodburn, OR 97071 (Marion County)
Today
Retail & Consumer Goods
In-Person

More Information Technology Jobs

Software Engineer II - Python
$90K — $120K *
7Eleven
Irving, TX 75061 (Dallas County)
Today
Solutions Architect
$143K — $179K *
Old Dominion Freight Line Inc
Lithia Springs, GA 30122 (Douglas County)
Today
Sr. Principal Software Scientist
$185K — $280K *
Cerence Inc.
Remote
Today
Associate Director, AI Performance & Operations
$157K — $215K *
Devoted Health
Remote
Today
Manager, Cloud Operations
$100K — $130K *
Avnet
Chandler, AZ 85225 (Maricopa County)
Today

Find similar Principal Data Engineer jobs:

Nationwide Seattle, WA

Principal Data Engineer

Job Overview by Ladders

Full Job Description

Get Ready For Your Next Interview

Find similar Principal Data Engineer jobs:

Get Ready For Your
Next Interview