Full Job Description
Seneca Federal Health is seeking a Informatics Lead/Cloud Data Engineer in Atlanta, GA.
Responsibilities include, but are not limited to:
1CDP Platform Optimization: Serve as the primary technical architect responsible for the operation, continuous tuning, and platform optimization of the One CDC Data Platform (1CDP) environment supporting NHM&E tracking systems.
Full-Lifecycle Pipeline Engineering: Direct full-lifecycle data operations, from automated cloud ingestion and feature engineering to predictive model deployment and automated data factory pipeline tracking (Azure Data Factory, Databricks).
Full-Lifecycle Data Architecture: Design and implement full lifecycle data engineering work-spanning secure multi-source ingestion, programmatic feature engineering, automated validation, and continuous platform pipeline monitoring.
Modern Analytics SaaS Deployment: Operationalize advanced analytics SaaS environments and scalable cloud workflows that transform scattered surveillance, research, and program tracking elements into strategic, unified data insights.
Automated Quality Enforcement: Architect real-time data completeness and data validation check blocks inside cloud pipelines, eliminating anomalies, missing fields, or structural non-compliance early at the data lake boundary.
Predictive Analytics Operationalization: Collaborate with biostatisticians and program evaluation staff to scale and deploy predictive modeling scripts into live production environments, ensuring scalable pipeline integration.
SHARE IT Act Compliance Lead: Take ultimate accountability for the implementation of the Federal SHARE IT Act (Public Law 118-187). Ensure all newly developed software code is registered in CDC repositories, complete with machine-readable README.md metadata, and discoverable for federal reuse.
EPLC Stage-Gate Technical Alignment: Develop and update project architecture documentation, system catalogs, and data flow sheets to satisfy HHS Enterprise Architecture audits and CDC Enterprise Performance Life Cycle gate reviews.
IAM Control & Role-Based Access: Incorporate zero-trust access profiles across all database networks and cloud environments, tracking provisioning rules through the secure SAMS portal and executing automated 4-hour deactivation closures upon staff reassignments.
Secure DevSecOps Repository Governance: Enforce uniform secure coding principles (OWASP standards) across all development paths. Manage the CI/CD pipeline infrastructure (GitHub Actions, Jenkins, Azure DevOps), mandating automated vulnerability scans and rigorous peer-review controls before deployment resets.
Basic Qualifications:
Master's degree in Health Informatics, Computer Science, Data Engineering, Software Engineering, or a related computational field required.
Ability to obtain and/or maintain an active favorable Tier I background check
8+ years of progressive data architecture experience designing cloud systems, big data infrastructure, and enterprise ETL pipelines for federal health programs.
5+ years of production experience leading modernization initiatives moving legacy federal data networks into cloud cloud configurations (Azure Government, AWS GovCloud).
Expert level command of Databricks lakehouse frameworks, Delta Lake engine optimization, pySpark clustering, and complex SQL relational querying.
Direct experience implementing the federal SHARE IT Act, Capital Planning and Investment Control (CPIC) models, and automated metadata systems.
Microsoft Certified: Azure Data Engineer Associate or Azure Solutions Architect Expert required.
Databricks Certified Data Engineer Professional or Databricks Certified Professional Data Scientist required.
AWS Certified Data Specialist or PMP Certification preferred.