Informatics Lead/Cloud Data Engineer

Seneca Holdings

$100K — $130K *
Healthcare
8 - 10 years of experience
Job Overview by Ladders

Qualifications

  • Master's degree in Health Informatics, Computer Science, Data Engineering, Software Engineering, or related field.
  • 8+ years of experience in designing cloud systems and big data infrastructures for federal health programs.
  • 5+ years of experience modernizing legacy federal data networks to cloud configurations.
  • Expertise in Databricks, Delta Lake, pySpark, and complex SQL querying.
  • Experience with implementation of federal SHARE IT Act and automated metadata systems.
  • Microsoft Certified: Azure Data Engineer Associate or Azure Solutions Architect Expert required.
  • Databricks Certified Data Engineer Professional or Databricks Certified Professional Data Scientist required.

Responsibilities

  • Optimize the One CDC Data Platform (1CDP) environment for NHM&E tracking systems.
  • Direct full-lifecycle data operations including cloud ingestion and predictive model deployment.
  • Design and implement secure multi-source data engineering architecture and validation processes.
  • Operationalize advanced analytics SaaS environments for unified data insights.
  • Architect real-time data validation controls within cloud pipelines to ensure data integrity.
  • Collaborate to scale and deploy predictive modeling scripts into production environments.
  • Lead the implementation of the SHARE IT Act for federal software registration and compliance.

Benefits

  • Flexible work environment promoting a balanced lifestyle.
  • Opportunities for professional development and certifications.
  • Contributions to impactful federal health initiatives.
  • Collaborative team culture focused on innovation and excellence.
  • Access to cutting-edge technology and resources in data engineering.
Full Job Description
Seneca Federal Health is seeking a Informatics Lead/Cloud Data Engineer in Atlanta, GA. Responsibilities include, but are not limited to: 1CDP Platform Optimization: Serve as the primary technical architect responsible for the operation, continuous tuning, and platform optimization of the One CDC Data Platform (1CDP) environment supporting NHM&E tracking systems. Full-Lifecycle Pipeline Engineering: Direct full-lifecycle data operations, from automated cloud ingestion and feature engineering to predictive model deployment and automated data factory pipeline tracking (Azure Data Factory, Databricks). Full-Lifecycle Data Architecture: Design and implement full lifecycle data engineering work-spanning secure multi-source ingestion, programmatic feature engineering, automated validation, and continuous platform pipeline monitoring. Modern Analytics SaaS Deployment: Operationalize advanced analytics SaaS environments and scalable cloud workflows that transform scattered surveillance, research, and program tracking elements into strategic, unified data insights. Automated Quality Enforcement: Architect real-time data completeness and data validation check blocks inside cloud pipelines, eliminating anomalies, missing fields, or structural non-compliance early at the data lake boundary. Predictive Analytics Operationalization: Collaborate with biostatisticians and program evaluation staff to scale and deploy predictive modeling scripts into live production environments, ensuring scalable pipeline integration. SHARE IT Act Compliance Lead: Take ultimate accountability for the implementation of the Federal SHARE IT Act (Public Law 118-187). Ensure all newly developed software code is registered in CDC repositories, complete with machine-readable README.md metadata, and discoverable for federal reuse. EPLC Stage-Gate Technical Alignment: Develop and update project architecture documentation, system catalogs, and data flow sheets to satisfy HHS Enterprise Architecture audits and CDC Enterprise Performance Life Cycle gate reviews. IAM Control & Role-Based Access: Incorporate zero-trust access profiles across all database networks and cloud environments, tracking provisioning rules through the secure SAMS portal and executing automated 4-hour deactivation closures upon staff reassignments. Secure DevSecOps Repository Governance: Enforce uniform secure coding principles (OWASP standards) across all development paths. Manage the CI/CD pipeline infrastructure (GitHub Actions, Jenkins, Azure DevOps), mandating automated vulnerability scans and rigorous peer-review controls before deployment resets. Basic Qualifications: Master's degree in Health Informatics, Computer Science, Data Engineering, Software Engineering, or a related computational field required. Ability to obtain and/or maintain an active favorable Tier I background check 8+ years of progressive data architecture experience designing cloud systems, big data infrastructure, and enterprise ETL pipelines for federal health programs. 5+ years of production experience leading modernization initiatives moving legacy federal data networks into cloud cloud configurations (Azure Government, AWS GovCloud). Expert level command of Databricks lakehouse frameworks, Delta Lake engine optimization, pySpark clustering, and complex SQL relational querying. Direct experience implementing the federal SHARE IT Act, Capital Planning and Investment Control (CPIC) models, and automated metadata systems. Microsoft Certified: Azure Data Engineer Associate or Azure Solutions Architect Expert required. Databricks Certified Data Engineer Professional or Databricks Certified Professional Data Scientist required. AWS Certified Data Specialist or PMP Certification preferred.

Similar Jobs

More Jobs at Seneca Holdings

More Healthcare Jobs

Find similar Informatics Lead/Cloud Data Engineer jobs: