Job Summary
A position for a Bioinformatics Data Integration Analyst is available in the Department of Biomedical Informatics at The Children’s Hospital of Philadelphia to work with Dr. Deanne Taylor’s research group. This permanent position is supporting a project in integrating and uniformly processing large-scale NIH biological datasets to provide a database and “crosswalk” between several disparate data sources. The successful candidate will have strong programming skills in Python and exhibited experience in building complex databases. A basic familiarity with common biological ontologies and experience working with biological datasets is a plus, as is experience with Neo4j graph databases. The successful candidate will have good communication skills and will be comfortable with project development in a supportive and interactive team environment.
Job Responsibilities
- Collaborate with biomedical researchers to identify and analyze scientific problems that require integration of disparate, highly dimensional data types such as clinical and genomic data.
- Quickly acquire biomedical domain knowledge in order to understand requirements for data integration, optimizing data resources and applications to meet scientific needs.
- Create requirements for complex data integration and application development projects, and translate requirements into deliverables.
- Develop and implement innovative data models that represent complex biomedical data types in usable and accessible schema.
- Write extract, transform, and load (ETL) procedures that combine and recombine biomedical data into new, more useful formats.
- Build and optimize scientific data management, data discovery, reporting, and analysis applications using a combination of off-the-shelf and custom tools.
- Manage small projects and subprojects within larger initiatives, identifying, tracking, and reporting on tasks and deliverables against project timelines.
Job Responsibilities (Continued)
Job Responsibilities (Continued)
Required Licenses, Certifications, Registrations
Required Education and Experience
Required Education: Bachelors in Computer Science, Information Science, Informatics, Biomedical Engineering, Biological Science or a related discipline.
Required Experience: At least three (3) years of experience in database development, database administration, data management, or related discipline, with progressively more complex projects
The successful candidate will have strong programming skills in Python and exhibited experience in building complex databases.
A basic familiarity with common biological ontologies and experience working with biological datasets is a plus, as is experience with Neo4j graph databases.
The successful candidate will have good communication skills and will be comfortable with project development in a supportive and interactive team environment.
Preferred Education, Experience & Cert/Lic
Preferred Education: Masters in Computer Science, Information Science, Informatics, Biomedical Engineering, Biological Science or a related discipline.
Preferred Experience:
- Four (4) or more years of experience in database development, database administration, data management, or related discipline, with progressively more complex projects preferably in a biomedical science or healthcare environment.
- Experience with source code management, continuous integration, containerization, and automated testing tools and processes is preferred.
- Project management experience preferred.
- Previous experience in data modeling, ETL, and applications of highly dimensional data types such as derived from genomics and observational clinical or human subjects research data preferred.
Additional Technical Requirements
- Working knowledge of SQL required.
- Working knowledge of relational database management systems such as PostgreSQL, MySQL, Oracle required.
- Working knowledge of one or more of the following preferred: Python/Django, JavaScript/HTML (as used in Single Page Applications), Java, Scala.
- Working knowledge of biomedical or healthcare data standards such as HL7, ICD, CPT, SNOMED preferred.
- Must exhibit excellent oral, presentation, and written communication skills.