Engineer II, Data

CarMax, Inc.

$90K — $120K *
Plano, TX 75024In-Person
Information Technology
Less than 5 years of experience
Job Overview by Ladders

Qualifications

  • 2+ years of experience as a Data Engineer or Software Engineer working with data
  • Strong proficiency in writing production-ready Python code
  • Experience building pipelines on Azure or similar cloud platforms (AWS/GCP)
  • Hands-on experience with distributed data processing (e.g., Apache Spark, Databricks)
  • Familiarity with event streaming platforms (e.g., Azure Event Hubs, Kafka, Kinesis)
  • Practical experience with CI/CD, version control, and automated deployments
  • Understanding of data modeling, schema management, and reliability concepts

Responsibilities

  • Design, build, and maintain production-grade data pipelines
  • Implement event-driven ingestion and real-time processing
  • Build data transformations using distributed processing frameworks
  • Write and maintain clean and testable Python code
  • Support CI/CD pipelines and automated deployment processes
  • Monitor and troubleshoot pipeline failures and data quality issues
  • Collaborate with Data Scientists, Analysts, and platform engineers

Benefits

  • Hybrid work arrangement with varied onsite requirements based on location
  • Opportunities for professional growth and continuous learning
  • Collaboration with Analytics and Data Science teams
  • Involvement in building cloud-native and streaming data platforms
Full Job Description
8117 - Dallas Technology Hub - 5830 Granite Parkway, Plano, Texas, 75024

The bulk of the data engineer's work would be in building, managing and optimizing data pipelines and then moving these data pipelines effectively into production for key data and analytics consumers (like business/data analysts, data scientists or any persona that needs curated data for operational and analytical data use cases.

Data engineers also need to guarantee compliance with data governance and data security requirements while creating, improving and operationalizing these integrated and reusable data pipelines. This would enable faster data access, integrated data reuse and vastly improved time-to-solution for CarMax's operational and analytical initiatives

Data Engineer II - Enterprise Data Services

We are seeking a Data Engineer with hands-on, production experience building and operating data pipelines in a cloud environment. This role sits on the Enterprise Data Services team, which owns streaming data ingestion, curation, and delivery for Analytics and Data Science teams.

Key Responsibilities
  • Design, build, and maintain production-grade data pipelines (streaming and batch)
  • Implement event-driven ingestion and near real-time processing
  • Build transformations using distributed data processing frameworks
  • Write and maintain clean, testable Python code following software engineering best practices
  • Support CI/CD pipelines, automated deployments, and environment promotions
  • Monitor and troubleshoot pipeline failures, latency issues, and data quality problems
  • Collaborate closely with Data Scientists, Analysts, and platform engineers


Required Experience & Skills

Candidates must have professional, hands-on experience in most of the following:
  • 2+ years of experience as a Data Engineer or Software Engineer working with data
  • Strong experience writing production Python code
  • Experience building and supporting pipelines on Azure or an equivalent cloud platform (AWS/GCP)
  • Hands-on experience with distributed data processing, such as:
    • Apache Spark
    • Databricks (or Spark runners such as EMR, Dataproc)
  • Experience working with event streaming or messaging platforms, such as:
    • Azure Event Hubs
    • Apache Kafka / Confluent Kafka
    • Amazon Kinesis
  • Practical experience with CI/CD pipelines, version control, and automated deployments
  • Familiarity with data modeling, schema management, and data reliability concepts
  • Experience operating systems in an agile, collaborative engineering team


Strongly Preferred
  • Supporting production streaming systems with uptime and latency expectations
  • Azure-native services such as:
    • Azure Databricks
    • Azure Functions (or equivalent serverless frameworks: AWS Lambda, Cloud Functions)
    • Cosmos DB (or comparable NoSQL stores such as DynamoDB, Cassandra)
  • Experience working with Analytics or Data Science platform teams
  • Debugging live pipelines and handling operational ownership/on-call responsibilities


Technology Environment (Primary & Equivalent)
  • Languages: Python (required), SQL, SparkSQL
  • Streaming: Event Hubs | Kafka | Kinesis
  • Processing: Databricks | Apache Spark | EMR | Dataproc
  • Storage: Cosmos DB | DynamoDB | Cassandra
  • Serverless: Azure Functions | AWS Lambda
  • Cloud: Azure (primary) | AWS | GCP
  • DevOps: CI/CD pipelines, automated deployments


Who This Role Is For

This role is a strong fit for someone who:
  • Has already worked on real, production data systems
  • Wants to deepen experience in streaming and cloud-native data platforms
  • Brings a software engineering mindset to data problems
  • Values ownership, reliability, and continuous learning


Work Location and Arrangement: This role will be based out of the CarMax Midtown office, Richmond VA or CarMax Technology Hub, Plano TX and have a Hybrid work arrangement.
  • Associates based in Richmond work onsite 4 days per week.
  • Associates based in Plano work onsite 2 days per week.


Work Authorization: Applicants must be currently authorized to work in the United States on a full-time basis. Sponsorship will not be considered for this specific role.

Similar Jobs

More Jobs at CarMax, Inc.

More Information Technology Jobs

Find similar Engineer II, Data jobs: