HeadLight

Data Engineer

HeadLight$150K — $165K *
Healthcare
Less than 5 years of experience
Job Overview by Ladders

Qualifications

  • 3+ years of data engineering experience in a production environment
  • Strong SQL skills including complex procedures and optimization
  • Proficient in Python, especially for ETL tasks
  • Hands-on experience with data warehouse and ETL workflows
  • Experience in building robust data pipelines with error handling
  • Ability to resolve production failures effectively under pressure
  • Familiarity with Azure Blob Storage for ETL staging

Responsibilities

  • Own and enhance the Python-based ETL orchestration engine
  • Build and maintain data pipelines for operational reporting
  • Integrate various data sources into the data warehouse
  • Optimize SQL workloads for high-frequency queries
  • Support and extend custom AI automation agents
  • Ensure HIPAA compliance in data handling processes
  • Write and maintain version-controlled code on GitHub

Benefits

  • Medical, Dental, and Vision insurance effective soon after employment
  • Paid Vacation, Sick, and Holiday time
  • Employee Assistance Program for personal and professional support
  • 401(k) plan with company contributions
  • Professional development and training opportunities
  • Collaborative and supportive work culture
  • Impactful role in improving patient care and healthcare processes
Full Job Description

Headlight Health is a fast-growing behavioral healthcare company providing therapy and psychiatry services across multiple states. Our BI and data engineering function sits at the heart of operations: from the real-time provider scheduling engine that matches patients to clinicians, to AI-powered real time agents, to the financial and clinical reporting that drives every major business decision.

We are looking for a Data Engineer to join a lean, high-output team. You will work directly alongside the VP of Analytics and CTO on production systems that run 24/7 in a HIPAA-regulated environment. This is a hands-on engineering role - you will own pipelines, write stored procedures, debug integrations, and ship improvements every week. You will develop AI-powered automation as we continuously improve our data and operations activities.

Tech Stack:

  • Primary Languages: Python (pandas, pyodbc, requests) and T-SQL (Azure SQL Server stored procedures, 300+ version-controlled scripts)


  • Data Warehouse: Azure SQL Server - primary production warehouse with 200+ tables


  • Cloud Infrastructure: Azure


  • Custom python-based ETL system.


  • CRM: HubSpot - deep integration across contacts, companies, deals, tickets, referrals, and referring physician hierarchies


  • EHR: NextGen - raw feed ingestion for patients, appointments, pharmacy, and clinical transactions


Our Pillars

  • Make things easier.
  • Forge genuine connections.
  • Elevate the standard.


Roles and Responsibilities

  • Own and extend the Python-based ETL job orchestration engine: add new job types, monitor execution, and resolve production failures


  • Build and maintain data pipelines powering operational reporting for scheduling, finance, credentialing, and clinical operations


  • Integrate data sources into the warehouse - EHR (NextGen), CRM (HubSpot), HR platforms (ADP, Lever), credentialing (Modio), call center (Five9), and other third-party APIs


  • Optimize high-frequency SQL workloads


  • Support and extend custom AI agents


  • Maintain HIPAA compliance across all data handling: enforce access controls, audit logging, and PHI segregation in pipelines and reporting layers


  • Write and maintain version-controlled code in the GitHub repository


Required Qualifications

  • 3+ years of data engineering experience in a production environment


  • Strong SQL skills: complex stored procedures, CTEs, window functions, temp tables, index optimization, execution plan analysis


  • Python proficiency for ETL workloads: pandas, pyodbc or SQLAlchemy, REST API consumption, file handling, scheduling


  • Hands-on experience data warehouse/ETL workloads


  • Experience building and maintaining scheduled data pipelines with robust error handling, retry logic, and logging


  • Ability to debug production failures independently, communicate status clearly, and resolve issues quickly under pressure


  • Comfort working in a HIPAA-regulated environment and handling PHI with appropriate care and controls


  • Familiarity with Azure Blob Storage or equivalent object storage for ETL staging workflows


Preferred Qualifications

  • HubSpot CRM data integration experience (Contacts, Deals, Tickets, Company hierarchy, API rate limit handling)


  • Healthcare data experience: EHR integrations (NextGen, Epic, or Cerner), credentialing systems, claims data, HIPAA BAA contexts


  • Experience building or maintaining AI/LLM-powered applications (Claude, OpenAI) in a production context


  • Familiarity with voice or telephony data pipelines, conversational AI systems, or patient intake automation


  • Experience with complex provider or resource scheduling systems and the data modeling they require


  • Exposure to CAC, LTV, or patient funnel analytics in a B2C healthcare or SaaS context


  • Strong communication skills - this team works directly with clinical ops, finance, and marketing stakeholders


Benefits

  • W2 role with competitive compensation
  • Medical, Dental and Vision on the first of the month after employment
  • Paid Vacation, Sick, and Holiday time
  • Employee Assistance Program (EAP) provides confidential counseling services, resources, and support to help you navigate personal or professional challenges.
  • 401(k) plan with company contribution
  • Opportunity to work in a cutting-edge healthcare technology environment
  • Professional development opportunities and training
  • Collaborative and supportive work culture
  • Impactful role contributing to the enhancement of patient care and healthcare processes


$150,000 - $165,000 a year

Not meeting all the requirements? Research indicates that women, communities of color, and historically underrepresented individuals are often hesitant to apply for jobs unless they meet every qualification. We are committed to cultivating a diverse, inclusive, and genuine workplace. If you're enthusiastic about this position but your previous experience doesn't precisely match every qualification listed, we enthusiastically encourage you to submit your application. You could be the ideal candidate for this role or others!

Headlight is committed to the principles of diversity, equity, and inclusiveness and seeks to create a working environment reflective of this commitment. We seek to provide a diverse clinician base to support the diversity of our clients. Headlight supports and respects diversity of people, culture, and ideas throughout our organization. Headlight thrives to be a welcoming, diverse and discrimination- and harassment-free workplace.

By applying for this position, you consent to receive future communications from Headlight via email or text regarding this application and related employment opportunities. You may opt-out at anytime by contacting us directly.

Job Postings on Indeed and other job boards may post with total compensation (base + bonus). For the exact base salary range please check our website or our job-site

We may use artificial intelligence (AI) tools to support parts of the hiring process, such as reviewing applications, analyzing resumes, or assessing responses. These tools assist our recruitment team but do not replace human judgment. Final hiring decisions are ultimately made by humans. If you would like more information about how your data is processed, please contact us.

Similar Jobs

More Jobs at HeadLight

More Healthcare Jobs

Find similar Data Engineer jobs: