Plaid

Senior Data Engineer - Data Engineering

Plaid$130K — $160K *
Information Technology
Less than 5 years of experience
Job Overview by Ladders

Qualifications

  • 4+ years of experience in data engineering, focusing on large-scale data pipelines.
  • Proficient in building data models on extensive datasets (from 500TB to petabytes).
  • Strong SQL skills and familiarity with orchestration tools like DBT, Mode, and Airflow.
  • Experience with data warehouses and lakes, notably Redshift, Snowflake, and Databricks.
  • Knowledge of batch and real-time pipeline construction using technologies like Spark and Kafka.
  • Ability to design schemas for analytics over unstructured data.
  • A proactive attitude towards trying new technologies and creating proof-of-concepts.

Responsibilities

  • Analyze Plaid's product strategy to shape dataset design and usage principles.
  • Prioritize data quality and performance during dataset creation.
  • Lead collaborative data engineering projects that span the company.
  • Promote the adoption of relevant industry tools and practices.
  • Manage and enhance core SQL and Python data pipelines for data lake and warehouse systems.
  • Ensure comprehensive documentation and defined quality standards for datasets.

Benefits

  • Opportunity to influence and define the ownership of internal datasets and visualizations.
  • Strong professional growth through mentorship from an experienced DE team and the broader Data Platform team.
  • Collaborative environment working across various teams including Engineering, Product, and Marketing/Finance.
  • Empowerment to voice ideas and impact the data strategy directly.
Full Job Description


The main goal of the DE team in 2024-25 is to build robust golden data sets to power our business goals of creating more insights based products. Making data-driven decisions is key to Plaid's culture. To support that, we need to scale our data systems while maintaining correct and complete data. We provide tooling and guidance to teams across engineering, product, and business and help them explore our data quickly and safely to get the data insights they need, which ultimately helps Plaid serve our customers more effectively. Data Engineers heavily leverage SQL and Python to build data workflows. We use tools like DBT, Airflow, Redshift, ElasticSearch, Atlanta, and Retool to orchestrate data pipelines and define workflows. We work with engineers, product managers, business intelligence, data analysts, and many other teams to build Plaid's data strategy and a data-first mindset. Our engineering culture is IC-driven -- we favor bottom-up ideation and empowerment of our incredibly talented team. We are looking for engineers who are motivated by creating impact for our consumers and customers, growing together as a team, shipping the MVP, and leaving things better than we found them.

You will be in a high impact role that will directly enable business leaders to make faster and more informed business judgements based on the datasets you build. You will have the opportunity to carve out the ownership and scope of internal datasets and visualizations across Plaid which is a currently unowned area that we intend to take over and build SLAs on. You will have the opportunity to learn best practices and up-level your technical skills from our strong DE team and from the broader Data Platform team. You will collaborate with and have strong and cross functional partnerships with literally all teams at Plaid from Engineering to Product to Marketing/Finance etc.

Responsibilities
  • Understanding different aspects of the Plaid product and strategy to inform golden dataset choices, design and data usage principles.
  • Have data quality and performance top of mind while designing datasetsLeading key data engineering projects that drive collaboration across the company.
  • Advocating for adopting industry tools and practices at the right timeOwning core SQL and python data pipelines that power our data lake and data warehouse.
  • Well-documented data with defined dataset quality, uptime, and usefulness.

Qualifications
  • 4+ years of dedicated data engineering experience, solving complex data pipelines issues at scale.
  • You've have experience building data models and data pipelines on top of large datasets (in the order of 500TB to petabytes)
  • You value SQL as a flexible and extensible tool, and are comfortable with modern SQL data orchestration tools like DBT, Mode, and Airflow.
  • You have experience working with different performant warehouses and data lakes; Redshift, Snowflake, Databricks.
  • You have experience building and maintaining batch and realtime pipelines using technologies like Spark, Kafka.
  • You appreciate the importance of schema design, and can evolve an analytics schema on top of unstructured data.
  • You are excited to try out new technologies. You like to produce proof-of-concepts that balance technical advancement and user experience and adoption.
  • You like to get deep in the weeds to manage, deploy, and improve low level data infrastructure.
  • You are empathetic working with stakeholders. You listen to them, ask the right questions, and collaboratively come up with the best solutions for their needs while balancing infra and business needs.
  • You are a champion for data privacy and integrity, and always act in the best interest of consumers.

About Plaid

Plaid is a financial services company based in New York City. The company builds a technology platform, which enables applications to connect with users' bank accounts. Plaid focuses on enabling consumers and businesses to interact with their bank accounts, check balances, and make payments through financial technology applications. The company was founded in 2013 by Zach Perret and William Hockey. In January 2020, Visa announced that it would acquire Plaid for $5.3 billion. The acquisition was completed in January 2021.
Learn more about Plaid
Size
600 employees
Industry
Founded
2011

Similar Jobs

More Jobs at Plaid

More Information Technology Jobs

Find similar Senior Data Engineer - Data Engineering jobs: