Senior Data Platform Engineer

Stack AV

$120K — $160K *
US-AnywhereRemote in Pittsburgh, PA
Information Technology
5 - 7 years of experience
Job Overview by Ladders

Qualifications

  • 7+ years of experience with distributed storage systems or modern data platforms.
  • Experience with streaming platforms like Kafka or Pulsar.
  • Proficient in Python and SQL, skilled in high-availability data applications using Trino and Apache Spark.
  • Knowledge of table formats such as Iceberg, Delta Lake, Hudi, or Xtable.
  • Experience with RDBMS optimization (Postgres, MySQL).
  • Strong debugging capabilities in complex distributed systems.
  • Effective team collaboration and communication of technical concepts.

Responsibilities

  • Design and operate distributed storage systems for large-scale batch workloads.
  • Build and maintain an open-source modern data platform.
  • Optimize storage resource utilization for better efficiency.
  • Enhance reliability and fault tolerance of storage and data platform components.
  • Collaborate with various teams to align on workload needs and platform improvements.
  • Contribute to platform tooling, automation, and CI/CD workflows.

Benefits

  • Support for work-life balance and flexible working hours.
  • Access to continuous learning and professional development opportunities.
  • Potential for remote work and collaboration across diverse teams.
  • Innovative work environment focused on state-of-the-art technology.
  • Opportunity to impact large-scale autonomous systems development.
Full Job Description
About the Role:

In the Compute Platform team, our mission is to provide the foundational compute platform that powers large-scale autonomous systems development. The team is responsible for enabling engineers and researchers to efficiently run compute and data intensive workloads on Stack AV infrastructure.

The Data Platform team is responsible for designing, implementing and maintaining the Stack AV on-premises data platform. The team supports large scale OLAP/OLTP and feature engineering workloads for multiple Product Development groups across the company. You will work at the intersection of infrastructure, distributed systems, and developer experience-ensuring that our critical services and pipelines are reliable, efficient, and easy to run.

As a Senior Data Platform Engineer, you will design and operate high scale data systems that power engineers across the company.

Responsibilities:
  • Design and operate distributed storage systems for scheduling and executing large-scale batch workloads.
  • Build and maintain an open source, modern data platform.
  • Optimize utilization of storage resourcesImprove reliability and fault tolerance of large-scale storage systems and data platform components.
  • Collaborate with teams across the company to understand workload requirements and improve platform capabilities.
  • Contribute to platform tooling, automation, and CI/CD workflows.

Qualifications:
  • 7+ years of experience building and operating distributed storage systems or modern data platforms.
  • Experience operating streaming platforms such as Kafka or Pulsar.
  • Fluent in Python, and SQL, with experience writing and maintaining highly available data applications using Trino and Apache Spark.
  • Knowledge of table formats (Iceberg, Delta Lake, Hudi, Xtable).
  • Experience operating and optimizing at least one RDBMS (Postgres, MySQL).
  • Strong debugging and problem-solving skills in complex distributed systems.
  • Ability to collaborate across teams and communicate technical concepts clearly.


Similar Jobs

More Jobs at Stack AV

More Information Technology Jobs

Find similar Senior Data Platform Engineer jobs: