New York Blood Center

Senior Data Engineer

New York Blood Center$127K — $137K *
Rye, NY 10580In-Person
Enterprise Technology
5 - 7 years of experience
Job Overview by Ladders

Qualifications

  • Bachelor’s degree in computer science, Data Science, Information Technology, or a related quantitative field.
  • 6+ years in data engineering with complex, production-grade data platforms ownership.
  • Expert-level skills in SQL (query optimization) and Python (PySpark).
  • Deep experience with Azure data services: Azure Data Factory, Databricks, Synapse.
  • Proven design experience in dimensional data models and data lake architecture.
  • Strong background in data quality engineering and automated validation.
  • Experience with relational databases and legacy to cloud migration.

Responsibilities

  • Architect and own complex data pipelines across NYBCe's data platform.
  • Lead the design and implementation of ELT/ETL frameworks using SQL and Azure tools.
  • Establish and monitor pipeline reliability standards and protocols.
  • Drive design of scalable data models for dimensional warehousing on Azure.
  • Contribute to architectural decisions for data storage and optimization.
  • Lead migrations from legacy systems to cloud-native platforms.
  • Design feature pipelines to support machine learning and analytics.

Benefits

  • Opportunities for mentorship and professional development.
  • Involvement in cutting-edge analytics and AI projects.
  • Contribution to significant architectural decisions affecting the enterprise.
  • Collaboration with a skilled team on advanced data engineering solutions.
  • Access to modern tools and technologies for data engineering.
Full Job Description
Responsibilities

As a Senior Data Engineer on DAPI's Tetris team, you will own the design and delivery of complex data engineering solutions that power NYBCe's enterprise analytics, AI, and reporting capabilities. Reporting to the Lead Data Engineer, you will drive technical decisions, set engineering standards, and ensure the reliability and scalability of DAPI's data platform across 49+ integrated enterprise source systems.

 

This role demands deep technical mastery in SQL, Python, and Azure cloud data engineering, combined with a product orientation—understanding how the data assets you build translate into decisions, reports, and AI outputs for the business. You will mentor Data Engineers, contribute to architectural direction, and serve as a technical anchor for delivery across the Tetris team's sprint cycles.

 

Advanced Pipeline Development & Ownership

  • Architect, build, and own complex data pipelines for high-volume, high criticality workstreams across NYBCe's enterprise data platform.
  • Lead the design and implementation of ELT/ETL frameworks using SQL, Python, Azure Data Factory, Databricks, and Azure Synapse Analytics.
  • Establish pipeline reliability standards—monitoring, alerting, error handling, and recovery protocols—and ensure adherence across the team.

Data Architecture & Platform Evolution

  • Drive the design of scalable data models supporting dimensional warehousing, data lake architectures on Azure.
  • Contribute to architectural decisions on data storage, partitioning, compute optimization, and consumption layer design.
  • Lead migrations from legacy data solutions to modern cloud-native platforms, managing risk and business continuity throughout.

AI & Analytics Enablement

  • Design and deliver feature pipelines and data preparation frameworks that support machine learning model development and deployment.
  • Partner with Data Scientists to translate model requirements into production-grade data assets and feature stores.
  • Collaborate with Analytics Engineers to ensure data models are optimized for analytical consumption and reporting performance.

Data Quality & Governance Leadership

  • Define and implement data quality frameworks—validation rules, SLAs, anomaly detection, and automated testing for pipeline outputs.
  • Lead data governance initiatives including metadata management, lineage tracking, data cataloging (Microsoft Purview), and access control.
  • Ensure platform compliance with HIPAA, NYBCe data policies, and applicable regulatory requirements.

Mentorship & Technical Leadership

  • Mentor Data Engineers—providing code reviews, technical guidance, and architectural feedback that elevates team capability.
  • Contribute to DAPI's engineering standards, reusable frameworks, and technical documentation.
  • Participate in Agile ceremonies and model strong engineering discipline—clear DevOps hygiene, sprint commitment, and delivery accountability.
Qualifications

 

Education:

  • Bachelor’s degree in computer science, Data Science, Information Technology, or a related quantitative field.

Experience:

  • 6+ years of progressive experience in data engineering with demonstrated ownership of complex, production-grade data platforms.
  • Expert-level SQL (query optimization, indexing strategy, execution plans) and Python (PySpark, pipeline frameworks, testing).
  • Deep hands-on experience with Azure data services: Azure Data Factory, Azure Databricks, Azure Synapse Analytics, Azure Data Lake Storage.
  • Proven experience designing dimensional data models and data lake architecture at enterprise scale.
  • Experience building data pipelines that directly support machine learning feature engineering and model serving.
  • Strong background in data quality engineering—automated validation, SLA enforcement, and lineage tracking.
  • Experience with relational databases (SQL Server, Oracle) and migration from legacy to cloud-native platforms.

 

Certifications & Licenses:

No certifications are required. The following are considered favorable:

  • Microsoft Certified: Azure Data Engineer Associate
  • Databricks Certified Associate Developer for Apache Spark

Knowledge:

  • Advanced SQL and Python for enterprise-scale data engineering—optimization, testing, and framework design.
  • Azure data platform architecture in depth—ADF, Databricks, Synapse, ADLS, and their integration patterns.
  • Modern data platform paradigms—data lake, medallion architecture, data mesh concepts, and consumption layer design.
  • Machine learning pipeline requirements—feature engineering, training data preparation, and model data dependencies.
  • Data governance frameworks—metadata management, lineage, cataloging, access control, and regulatory compliance (HIPAA).
  • Agile engineering practices—sprint delivery, DevOps hygiene, CI/CD for data pipelines, and technical documentation standards.

Skills:

  • Cultural competency and the ability to communicate effectively in a culturally sensitive manner with both individuals and groups from diverse backgrounds.
  • Architect and deliver complex, production-grade data pipelines that meet enterprise reliability and performance standards.
  • Design scalable data models and platform structures that serve analytics, reporting, and AI consumption patterns simultaneously.
  • Lead data quality engineering—automated testing, validation frameworks, SLA monitoring, and incident response.
  • Mentor and elevate Data Engineers through code review, architectural feedback, and knowledge transfer.
  • Translate product and analytical requirements into sound engineering designs and delivery plans.

Abilities:

  • Communicate complex data concepts clearly to both technical and non-technical stakeholders.
  • Work independently and manage competing priorities in a lean, fast-paced team environment.
  • Embrace accountability and take ownership of deliverables end-to-end.
  • Incorporate feedback constructively and seek continuous improvement.

Any combination of education, training, and experience equivalent to the requirements above that has supplied the necessary knowledge, skills, and experience to perform the essential functions of the job.

 

For applicants who will perform this position in Westchester County, the proposed annual salary is $127,000.00 to $137,000.00 per year.  For applicants who will perform this position outside of New York City or Westchester County, salary will reflect local market rates and be commensurate with the applicant’s skills, job-related knowledge, and experience.

 

Unless otherwise specified, all posted opportunities are located in the New York or Greater Tri-State office locations.

About New York Blood Center

The New York Blood Center (NYBC) is a nonprofit community, regional blood center that provides blood and transfusion-related medical services to patients in New York, New Jersey, Connecticut, Pennsylvania, Delaware, and Maryland. NYBC also conducts research in hematology, blood banking, transfusion medicine, and cellular therapies. The center was founded in 1964 and is headquartered in New York City. NYBC operates 13 blood donation centers and has partnerships with over 40 hospitals in the Northeastern United States. The center collects approximately 2,000 units of blood each day and serves over 200 hospitals in the region.
Learn more about New York Blood Center
Size
2,500 employees
Industry
Founded
1964

Similar Jobs

More Jobs at New York Blood Center

More Enterprise Technology Jobs

Find similar Senior Data Engineer jobs: