OverviewData Scientist
Reston, VA (hybrid/remote) - this role requires that you are onsite 2 days a week
Active TS/SCI
Are you ready to lean into analytic approaches that show customers the power of both technical and methodological innovation? Join our growing team supporting customer missions as a Data Scientistin Reston, VA (remote/hybrid-2 days a week onsite required)
Responsibilities
What you get to do everyday:
- Support the Advanced Data Consortium (ADC), a member-driven program that enables participating organizations to share, analyze, and act on complex data sets across a secure, cloud-hosted environment.
- Contribute to a SAFe/Scrum delivery team, participating in sprint planning, standups, reviews, and retrospectives.
- Analyze large and complex structured and unstructured data sets to identify trends, patterns, and anomalies that support member decision-making.
- Build and maintain automated data collection and ingestion pipelines using AWS GovCloud services.
- Develop and refine predictive models and machine learning algorithms, including ensemble approaches, to address Key Intelligence Questions (KIQs) posed by ADC members.
- Design and deliver data visualizations that communicate findings clearly to both technical teams and non-technical stakeholders.
- Support the ADC member directory and opt-in/opt-out data governance framework, ensuring data is surfaced appropriately based on member permissions.
- Partner with the ADC Portal engineering team to process member data samples and match against customer requirements, in coordination with the programs vendor management team.
- Propose data-driven solutions and analytic strategies in response to evolving member requirements.
- Assist in developing and documenting data suitability processes, tracking analytic requirements, and conducting follow-up engagements with customer and vendor on technical access and quality issues.
Qualifications
Required Qualifications:
- Active TS/SCI
- Bachelors degree in Data Science, Computer Science, Mathematics, Statistics, Engineering, or a related technical discipline
- 3-5 years of experience in a data science, data analytics, data engineering, or related analytical role
- Proficiency in Python for data analysis, automation, and pipeline development
- Proficiency in SQL for querying, joining and transforming data in relational stores such as PostreSQL
- Experience building and managing automated data pipelines and collection workflows on AWS GovCloud (e.g., Lambda, Glue, S3, Step Functions)
- Ability to preprocess, clean, and prepare both structured and unstructured data sets for analysis
- Experience developing machine learning models and applying data visualization techniques to communicate analytical results
- Familiarity with containerized environments and microservices architecture
- Working knowledge of access management and data governance principles, including role-based permissions and secure data sharing practices
- Strong written and verbal communication skills, with the ability to present findings to varied audiences
- Ability to work independently and manage multiple priorities in a fast-paced program environment
Desired Qualifications:
- Prior experience supporting Intelligence Community programs or military intelligence organizations
- Familiarity with consortium, membership, or directory-based data platforms and opt-in/opt-out data governance models
- Experience with AWS SageMaker or similar cloud-native ML toolsets within a GovCloud environment
- Experience building data pipelines using Apache Spark, Kafka, or Apache NiFi
- Knowledge of IAM policies, S3 bucket security, VPN, and SSH access controls
- Experience with data visualization platforms such as Tableau, Power BI, or custom web-based dashboards
What you can expect from us