The Department of Surgery is seeking a detail-oriented and highly motivated Database Analyst I to join our team. The Database Analyst I will play a critical role in advancing our computational medicine research by architecting efficient data pipelines and managing, cleaning, and preparing complex, large-scale clinical datasets. Working with both Protected Health Information (PHI) and de-identified (non-PHI) data, this individual will ensure that rigorous data governance and security protocols are met. This position is essential for building a robust, secure data infrastructure that maximizes computational efficiency and supports high-performance machine learning, AI-driven predictive modeling, and advanced clinical analytics.
Job Responsibilities:- Data Consolidation & Management: Clean, manage, and consolidate both sensitive PHI and non-PHI datasets from diverse clinical sources (e.g., continuous physiological monitors, electronic medical records). Ensure strict adherence to institutional and federal data privacy regulations (e.g., HIPAA) while maintaining high data quality, accuracy, and structural integrity.
- Dataset Harmonization: Map and transform disparate data types into unified formats. Incorporate harmonized data standards, specifically utilizing the Observational Medical Outcomes Partnership Common Data Model (OMOP-CDM) and the Medical Event Data Standard (MEDS), to build automated, reproducible data processing pipelines that safely handle varying levels of data sensitivity.
- Compute Architecture & Efficiency: Architect data workflows and optimize the lab's computational infrastructure to support high-throughput processing. Enhance the computational efficiency of compute environments by optimizing resource allocation (including CPU/GPU utilization), parallelizing data pipelines, and resolving processing bottlenecks to accelerate large-scale machine learning tasks.
- Scalable Infrastructure & Tooling: Structure and index large-scale harmonized datasets for highly efficient querying within distributed computing environments. Develop, test, and maintain robust codebases (e.g., Python, SQL) for ongoing analytical tasks, implement version control, and comprehensively document architectural decisions and data extraction procedures.
Minimum Requirements:
Work requires a bachelor's degree in mathematics, computer science, or a computer related field or the equivalent coursework or technical training.
This position is Onsite. The work is performed on-site or at a designated assignment location.
Duke Surgery is #1 in NIH Grant Funding. We are an internationally recognized leader in laboratory and clinical investigation. Duke Surgery's faculty and trainees are consistently among the top- funded researchers nationwide. This position is 100% grant funded. Beyond the engaging work, you'll also benefit from Duke's competitive benefits package, including health insurance plans, generous paid time off, retirement programs with employer contributions, tuition assistance for employees and their children, and more. Join our award-winning team and be part of an inclusive culture that values excellence, innovation, and discovery.
Anticipated Pay Range: Duke University provides an annual base salary range for this position as USD $59,829.00 to USD $104,550.00. Duke University considers factors such as (but not limited to) scope and responsibilities of the position; candidate's work experience, education/training, and key skills; internal peer equity; as well as market and organizational considerations when extending an offer.
Your total compensation goesbeyond the dollars on your paycheck. Duke provides comprehensive and competitive medical and dental care programs, generous retirement benefits, and a wide array of family-friendly and cultural programs to eligible team members. Learn more at: https://hr.duke.edu/benefits/