OverviewWe seek a Mid-level Data Scientist to support a client in
hybrid remote / Reston, VA.
ResponsibilitiesWhat you get to do everyday:- Support the Advanced Data Consortium (ADC), a member-driven program that enables participating organizations to share, analyze, and act on complex data sets across a secure, cloud-hosted environment.
- Contribute to a SAFe/Scrum delivery team, participating in sprint planning, standups, reviews, and retrospectives.
- Analyze large and complex structured and unstructured data sets to identify trends, patterns, and anomalies that support member decision-making.
- Build and maintain automated data collection and ingestion pipelines using AWS GovCloud services.
- Develop and refine predictive models and machine learning algorithms, including ensemble approaches, to address Key Intelligence Questions (KIQs) posed by ADC members.
- Design and deliver data visualizations that communicate findings clearly to both technical teams and non-technical stakeholders.
- Support the ADC member directory and opt-in/opt-out data governance framework, ensuring data is surfaced appropriately based on member permissions.
- Partner with the ADC Portal engineering team to process member data samples and match against customer requirements, in coordination with the program's vendor management team.
- Propose data-driven solutions and analytic strategies in response to evolving member requirements.
- Assist in developing and documenting data suitability processes, tracking analytic requirements, and conducting follow-up engagements with customer and vendor on technical access and quality issues.
QualificationsRequired Qualifications: - Bachelor's degree in Data Science, Computer Science, Mathematics, Statistics, Engineering, or a related technical discipline; equivalent combination of education, technical training, or military experience will be considered.
- 3 to 5 years of experience in a data science, data analytics, data engineering, or related analytical role.
- Experience working with or supporting U.S. government programs, federal agencies, or the Intelligence Community is strongly preferred.
- Proficiency in Python for data analysis, automation, and pipeline development.
- Proficiency in SQL for querying, joining and transforming data in relational stores such as PostreSQL.
- Experience building and managing automated data pipelines and collection workflows on AWS GovCloud (e.g., Lambda, Glue, S3, Step Functions).
- Ability to preprocess, clean, and prepare both structured and unstructured data sets for analysis.
- Experience developing machine learning models and applying data visualization techniques to communicate analytical results.
- Familiarity with containerized environments and microservices architecture.
- Working knowledge of access management and data governance principles, including role-based permissions and secure data sharing practices.
- Strong written and verbal communication skills, with the ability to present findings to varied audiences.
- Ability to work independently and manage multiple priorities in a fast-paced program environment.
Desired Qualifications: - Prior experience supporting Intelligence Community programs or military intelligence organizations.
- Familiarity with consortium, membership, or directory-based data platforms and opt-in/opt-out data governance models.
- Experience with AWS SageMaker or similar cloud-native ML toolsets within a GovCloud environment.
- Experience building data pipelines using Apache Spark, Kafka, or Apache NiFi.
- Knowledge of IAM policies, S3 bucket security, VPN, and SSH access controls.
- Experience with data visualization platforms such as Tableau, Power BI, or custom web-based dashboards.
What you can expect from us