Collective Health

Lead Data Engineer

Collective Health$168K — $210K *
Information Technology
8 - 10 years of experience
Job Overview by Ladders

Qualifications

  • 8+ years of data engineering experience in fast-paced, data-driven environments.
  • Expertise in building scalable ETL pipelines with Spark (PySpark or Scala) and SQL.
  • Deep understanding of data architecture, schema design, and dimensional modeling for analytics and machine learning.
  • Proficiency in distributed systems such as Spark, Databricks, or Snowflake.
  • Experience with event-driven architectures and streaming platforms like Kafka or Kinesis.
  • Excellent communication skills to collaborate cross-functionally and convey technical concepts in a business context.
  • Mentorship experience guiding engineers and promoting an inclusive team culture.
  • Familiarity with data privacy and compliance in healthcare or regulated industries.

Responsibilities

  • Architect scalable data solutions by designing, developing, and optimizing data pipelines.
  • Advance data modeling and architecture to support analytical and machine-learning requirements.
  • Enhance performance and reliability of data processing, ensuring quality and governance.
  • Drive collaboration across Product, Engineering, Data Science, and Analytics teams for impactful data solutions.
  • Mentor junior and mid-level engineers, conducting code reviews and establishing best practices.
  • Implement data governance and security measures for sensitive healthcare data.
  • Influence data strategy with insights on infrastructure, technologies, and process improvements.

Benefits

  • Mission-driven culture valuing innovation and collaboration in healthcare.
  • Opportunities for impactful projects that shape the organization.
  • Professional development through internal mobility and mentorship programs.
  • Flexible work arrangements supporting work-life balance.
Full Job Description
At Collective Health, we're transforming how employers and their people engage with their health benefits by seamlessly integrating cutting-edge technology, compassionate service, and world-class user experience design.

We deliver a connected healthcare experience for over 600,000 members and 70+ companies nationwide who want the best for their employees. Our data engineering team is pivotal in this mission, constructing and managing scalable, high-performance data pipelines and architectures that drive analytics, operational workflows, and machine learning applications.

As a Lead Data Engineer, you will drive the development of robust, scalable, and efficient data solutions, collaborating closely with cross-functional teams. You will provide thought leadership on data architecture, mentor junior engineers, and optimize our data ecosystem for performance and reliability.
What you'll do:
  • Architect Scalable Data Solutions - Design, develop, and optimize large-scale data pipelines using Spark (PySpark, Scala), Databricks, and distributed data processing frameworks.
  • Advance Data Modeling & Architecture - Lead the design and evolution of data models to support analytical, operational, and machine-learning requirements.
  • Enhance Data Performance & Reliability - Improve data processing performance, scalability, and reliability, while ensuring data quality and governance.
  • Drive Cross-Functional Collaboration - Partner with Product, Engineering, Data Science, and Analytics teams to deliver high-impact data solutions that generate actionable business and clinical insights.
  • Mentor & Provide Technical Leadership - Guide junior and mid-level engineers, conduct code reviews, and establish best practices in data engineering.
  • Ensure Data Governance & Security - Implement robust security, privacy, and compliance measures for sensitive healthcare data, ensuring adherence to industry regulations.
  • Influence Data Strategy - Provide input on data infrastructure decisions, emerging technologies, and process improvements.
To be successful in this role, you'll need:
  • 8+ years of data engineering experience in fast-paced, data-driven environments.
  • Expertise in building scalable ETL pipelines with Spark (PySpark or Scala) and SQL.
  • Deep understanding of data architecture, schema design, and dimensional modeling for analytics and machine learning.
  • Proficiency in distributed systems such as Spark, Databricks, or Snowflake.
  • Experience with event-driven architectures and streaming platforms like Kafka or Kinesis.
  • Excellent communication skills - ability to collaborate cross-functionally and translate complex technical concepts into business impact.
  • Mentorship experience - experience guiding engineers and fostering a collaborative, inclusive team culture.
  • Security-first mindset - familiarity with data privacy, encryption, and compliance in healthcare or other regulated industries is a plus.
Pay Transparency Statement

This is a hybrid position based out of one of our offices: San Francisco, CA, Plano, TX, or Lehi, UT. Hybrid employees are expected to be in the office two days per week. #LI-hybrid

The actual pay rate offered within the range will depend on factors including geographic location, qualifications, experience, and internal equity. In addition to the salary, you will be eligible for stock options and benefits like health insurance, 401k, and paid time off. Learn more about our benefits at https://jobs.collectivehealth.com/benefits/.

San Francisco, CA Pay Range

$168,000-$210,000 USD

Lehi, UT Pay Range

$134,500-$168,000 USD

Plano, TX Pay Range

$147,800-$185,500 USD

Why Join Us?
  • Mission-driven culture that values innovation, collaboration, and a commitment to excellence in healthcare
  • Impactful projects that shape the future of our organization
  • Opportunities for professional development through internal mobility opportunities, mentorship programs, and courses tailored to your interests
  • Flexible work arrangements and a supportive work-life balance

We are an equal opportunity employer and value diversity at our company. We do not discriminate on the basis of race, religion, color, national origin, gender, sexual orientation, age, marital status, veteran status, or disability status. Collective Health is committed to providing support to candidates who require reasonable accommodation during the interview process. If you need assistance, please contact [email protected].
Privacy Notice

For more information about why we need your data and how we use it, please see our privacy policy: https://collectivehealth.com/privacy-policy/.

About Collective Health

Collective Health is a technology company that provides a cloud-based platform for self-insured employers to manage their employee health benefits. The platform includes tools for plan design, enrollment, claims processing, and member engagement. Collective Health was founded in 2013 and is headquartered in San Francisco, California. The company has raised over $400 million in funding and has partnerships with several major insurance carriers, including Aetna, Cigna, and Anthem.
Learn more about Collective Health
Size
500 employees
Industry
Founded
2013

Similar Jobs

More Jobs at Collective Health

More Information Technology Jobs

Find similar Lead Data Engineer jobs: