Senior Data Platform Engineer

Stack AV

$120K — $150K *
Information Technology
5 - 7 years of experience
Job Overview by Ladders

Qualifications

  • 7+ years of experience building and managing distributed storage systems or modern data platforms.
  • Experience with streaming platforms like Kafka or Pulsar.
  • Proficient in Python and SQL, with a track record of developing high-availability data applications using Trino and Apache Spark.
  • Familiarity with table formats such as Iceberg, Delta Lake, Hudi, and Xtable.
  • Hands-on experience with at least one RDBMS, such as Postgres or MySQL.
  • Strong skills in debugging and problem-solving within complex distributed systems.
  • Excellent team collaboration and clear communication of technical concepts.

Responsibilities

  • Design and operate distributed storage systems for large-scale batch workload execution.
  • Build and maintain a modern, open-source data platform.
  • Optimize storage resource utilization and enhance reliability and fault tolerance.
  • Collaborate with various teams to understand workload requirements and enhance platform capabilities.
  • Contribute to platform tooling, automation, and CI/CD workflows.

Benefits

  • Flexible working hours and potential for remote work.
  • Access to continuous learning and development opportunities.
  • Collaboration with a diverse and innovative team of engineers and researchers.
  • Opportunities to work with cutting-edge technology in autonomous systems.
  • Participation in building open-source solutions and contributions to the tech community.
Full Job Description
About the Role:

In the Compute Platform team, our mission is to provide the foundational compute platform that powers large-scale autonomous systems development. The team is responsible for enabling engineers and researchers to efficiently run compute and data intensive workloads on Stack AV infrastructure.

The Data Platform team is responsible for designing, implementing and maintaining the Stack AV on-premises data platform. The team supports large scale OLAP/OLTP and feature engineering workloads for multiple Product Development groups across the company. You will work at the intersection of infrastructure, distributed systems, and developer experience-ensuring that our critical services and pipelines are reliable, efficient, and easy to run.

As a Senior Data Platform Engineer, you will design and operate high scale data systems that power engineers across the company.

Responsibilities:
  • Design and operate distributed storage systems for scheduling and executing large-scale batch workloads.
  • Build and maintain an open source, modern data platform.
  • Optimize utilization of storage resourcesImprove reliability and fault tolerance of large-scale storage systems and data platform components.
  • Collaborate with teams across the company to understand workload requirements and improve platform capabilities.
  • Contribute to platform tooling, automation, and CI/CD workflows.

Qualifications:
  • 7+ years of experience building and operating distributed storage systems or modern data platforms.
  • Experience operating streaming platforms such as Kafka or Pulsar.
  • Fluent in Python, and SQL, with experience writing and maintaining highly available data applications using Trino and Apache Spark.
  • Knowledge of table formats (Iceberg, Delta Lake, Hudi, Xtable).
  • Experience operating and optimizing at least one RDBMS (Postgres, MySQL).
  • Strong debugging and problem-solving skills in complex distributed systems.
  • Ability to collaborate across teams and communicate technical concepts clearly.


Similar Jobs

More Jobs at Stack AV

More Information Technology Jobs

Find similar Senior Data Platform Engineer jobs: