AI Data Engineer

DEEPREC.AI

US-AnywhereRemote in United States
Healthcare
Less than 5 years of experience
Job Overview by Ladders

Qualifications

  • Bachelor's degree in Computer Science, Engineering, or related field
  • Experience with large-scale or big data systems, particularly Apache Spark
  • Proficient in programming languages such as Python, Scala, or Java
  • Experienced in ETL pipelines, data warehousing, and data modelling
  • Familiarity with cloud platforms (AWS, GCP, or Azure) and associated tools
  • Strong problem-solving skills

Responsibilities

  • Collaborate with Data Scientists and ML Engineers to clarify data requirements for models
  • Develop and maintain scalable data pipelines for healthcare datasets
  • Ensure high data quality through cleaning, validation, and monitoring
  • Design data structures and schemas to optimize model training
  • Source and manage new data while adhering to healthcare regulations

Benefits

  • Remote work flexibility
  • Opportunity to work with cutting-edge HealthTech AI projects
  • Engagement in innovative data solutions for healthcare
  • Collaboration with a team of skilled data professionals
  • Focus on compliance with important regulations (e.g., HIPAA)
Full Job Description
AI Data Engineer- HealthTech AI

Up to $180,000 | U.S (Remote) | Full-Time

This is a remote Data Engineering role focused on building and maintaining scalable pipelines that ingest, clean, and structure large, complex healthcare datasets.

What You'll Do
  • Work with Data Scientists and ML Engineers to define data needs for LLM and ML models.
  • Build and maintain scalable data pipelines for large healthcare datasets.
  • Ensure data quality through cleaning, validation, and monitoring.
  • Design efficient data structures and schemas for model training and use.
  • Source new data while ensuring compliance with healthcare regulations (e.g., HIPAA)

Requirements
  • Bachelor's degree in Computer Science, Engineering, or a related field.
  • Experience as a Data Engineer working with large-scale or big data systems such as Apache Spark
  • Strong programming skills in Python, Scala, or Java.
  • Experience with ETL pipelines, data warehousing, and data modelling.
  • Familiarity with cloud platforms (AWS, GCP, or Azure) and tools like Apache Spark.
  • Strong problem-solving skills

Nice to Have
  • Master's degree in Computer Science, Engineering, Data Science, or a related field.
  • Experience working with healthcare data and standards such as FHIR or HL7.
  • Familiarity with machine learning concepts and LLM fine-tuning workflows.
  • Experience using data orchestration tools such as Apache Airflow.

Similar Jobs

More Jobs at DEEPREC.AI

More Healthcare Jobs

Find similar AI Data Engineer jobs: