Master's degree in Biostatistics, Statistics, Mathematics, or similar quantitative field
7+ years of SAS macro development and statistical programming in federal health or clinical trials
5+ years of data manipulation and multi-stage sampling calculation expertise using SAS PROC SURVEY
Experience with CDC data stores and architectures, particularly MMP and NHBS
Direct experience in dual-reviewer validation with GitLab/GitHub version control systems
SAS Certified Advanced Programmer for SAS 9 or SAS Viya required
Preferred: SAS Certified Statistical Business Analyst or Advanced Analytics Professional
Responsibilities
Lead the development and maintenance of a Master SAS Macro Library
Drive migration of legacy SAS frameworks to cloud environments, utilizing Python and other modern tools
Conduct parallel runs to evaluate discrepancies between legacy and new cloud pipelines
Optimize automated data cleaning processes applying complex logic rules
Calibrate complex survey weights and execute advanced sampling methodologies
Embed automated data suppression rules within SAS and Python code to ensure confidentiality
Validate data precision through dual-reviewer protocols for urgent ad hoc queries
Extract schema attributes from datasets and automate cataloging within the CDC Data Platform
Ensure comprehensive documentation for compliance with federal regulations
Benefits
Comprehensive health plans
Retirement savings options
Flexible work environment
Professional development opportunities
Collaborative team culture
Access to cutting-edge technology in data science
Supportive leadership and mentorship programs
Full Job Description
Seneca Federal Health is seeking a Senior SAS Programmer/Biostatistician in Atlanta, GA.
Responsibilities include, but are not limited to:
Master SAS Macro Library Control: Serve as the principal architecture lead for the development, maintenance, and structural optimization of DHP's Master SAS Macro Library, applying semantic versioning via GitLab/GitHub.
Data modernizations & Pipeline Lead: Drive the programmatic migration of legacy SAS macro frameworks into optimized cloud environments (Python, R, pySpark, Databricks), acting as a primary engine for division-wide data modernization goals.
Parallel Discrepancy Evaluation: Design, deploy, and execute parallel production runs between legacy code systems and newly engineered cloud pipelines, running detailed equivalence tests and managing discrepancy remediation down to zero tolerance variance.
MMP Rule Engine Execution: Direct and optimize automated cleaning suites applying 400+ distinct field-level logic rules across complex clinical and behavioral surveillance domains to generate analysis-ready files.
Complex Survey Weights Calibration: Calculate Respondent-Driven Sampling (RDS) and venue-based complex weights for NHBS cycles, running advanced sample calibrations via PROC SURVEYFREQ, SURVEYMEANS, and SURVEYREG workflows.
Section 308(d) Table Suppression: Programmatically embed automated small-cell suppression rules and data-masking constraints directly inside SAS and Python reporting code blocks, mitigating confidentiality disclosure prior to clearance routing.
Dual-Reviewer Equivalence Testing: Operate rigid dual-reviewer validation protocols for all high-priority ad hoc data queries from leadership or Congress, certifying data precision and traceability under 5 business day turnarounds.
1CDP Metadata Harvesting: Coordinate the programmatic extraction of data dictionaries and schema attributes from frozen analytical datasets, automating catalog integration with the One CDC Data Platform (1CDP).
Audit-Ready Technical Documentation: Generate comprehensive code annotations, lineage maps, change tracking tables, and reproducibility records to ensure full compliance with federal data integrity and code harmonization rules.
Basic Qualifications:
Master's degree in Biostatistics, Statistics, Mathematics, or a highly quantitative mathematical discipline required.
Ability to obtain and/or maintain an active favorable Tier I background check
7+ years of continuous SAS macro software development and structural statistical programming experience within federal health or clinical trial settings.
5+ years of hands-on data manipulation and multi-stage sampling weight calculation experience utilizing specialized SAS PROC SURVEY tools.
Direct technical familiarity working with massive historical CDC data stores, including MMP data architectures, NHBS repositories, and eHARS files.
Proven experience executing dual-reviewer validation architectures using formal version control systems (GitLab/GitHub).
SAS Certified Advanced Programmer for SAS 9 or SAS Viya required
SAS Certified Statistical Business Analyst or Advanced Analytics Professional preferred.