Data Scientist

Bespoke Technologies, Inc

$100K — $130K *
Aerospace & Defense
Less than 5 years of experience
Job Overview by Ladders

Qualifications

  • 5-7 years of experience in data science with a focus on natural language processing (NLP)
  • Proficient in Python and SQL, with hands-on experience in NLP and machine learning libraries
  • Experience with deep learning frameworks such as PyTorch, Tensorflow, or Keras
  • Familiarity with the HuggingFace Transformers library for NLP tasks
  • Strong background in creating and evaluating machine learning models for text processing
  • Ability to communicate complex technical methodologies and results effectively
  • Active Poly clearance required for this role

Responsibilities

  • Conduct advanced analysis using NLP and other deployed tools
  • Process and clean both structured and unstructured data, particularly text data
  • Design and implement ETL pipelines for complex datasets
  • Develop and organize relevant information within Oracle databases using SQL
  • Author analytic publications and create ad-hoc reports, including visualizations
  • Stay updated with the client’s metadata collection tools and processes
  • Provide technical training to staff as needed

Benefits

  • Work is performed fully on-site in McLean, VA
  • Engage in cutting-edge NLP and data science projects
  • Opportunity to influence senior organizational decisions through data insights
  • Collaborate with a team of experts in a dynamic environment
  • Access to advanced tools and technologies in data analysis
Full Job Description
BT-285 - Data Scientist
Skill Level: Expert
Location: McLean, VA (fully on-site, no remote option)

**Please do NOT apply if you do not have an active Poly clearance. Those without a Poly will not be considered.**

Introduction:
The client provides data-driven business analysis to support senior organizational leaders. The client requires support specializing in natural language processing (NLP) and associated data preparation to help identify challenges and opportunities for the client's customers. The client needs experienced SQL and Python skills to transform the client's structured and unstructured data into clear and supported analytic insights to help customers with decision making related to production, resources and personnel. The work may be performed independently or within a team environment.

Work Requirements:
  • The Contractor shall conduct sophisticated analysis using deployed tools and natural language processing.
  • The Contractor shall analyze large amounts of raw data, including text data, to provide business insights.
  • The Contractor shall preprocess or clean structured and unstructured client data, including text data.
  • The Contractor shall design and implement advanced ETL code and table configurations for complex data sets.
  • The Contractor shall use Structured Query Language (SQL) in organization's Oracle database to develop and organize relevant information with supporting analytics.
  • The Contractor shall independently, or with a team, author analytic publications and produce ad-hoc reports to include data visualizations using the client's templates.
  • The Contractor shall stay current with the client's enterprise metadata collection tools.
  • The Contractor shall implement the client's existing coordination process.
  • The Contractor shall provide technical education to staff on an ad-hoc basis.
  • The Contractor shall provide subject matter expertise in NLP to support client's initiatives

Required Skills:
  1. Demonstrated professional or academic experience performing NLP tasks, including selecting the best Python libraries for a given task, choosing appropriate pre-processing actions, performing analysis, and assessing model performance.
  2. Demonstrated professional or academic experience using Python NLP packages such as Spacy, Gensim, or NLTK to analyze or process collections of documents.
  3. Demonstrated professional or academic experience with deep learning frameworks such as PyTorch, Tensorflow, or Keras
  4. Demonstrated professional or academic experience with the HuggingFace Transformers library and hub.
  5. Demonstrated experience creating machine learning models that conduct text classification and topic modeling in Python using standard machine learning (Scikit-learn) or deep learning models.
  6. Demonstrated academic or professional experience using encoder-decoder and generative language models to perform NLP tasks.
  7. Demonstrated academic or professional experience communicating methodological choices and model results.
  8. Demonstrated professional or academic experience and proficiency with SQL to include using common table expressions, set operations, aggregated functions and nested subqueries.
  9. Demonstrated professional or academic experience with version control systems such as Github and Jenkins.
  10. Demonstrated experience leveraging GPUs for accelerated computing.
  11. Develop practical approaches for measuring performance.
  12. Assist in developing types of measure, the collection of data, analyzing the data, and presenting that data to senior leadership.
  13. Conduct advanced statistical analysis on personnel and performance metrics.
  14. Assist in selection or development of appropriate methodology to conduct research.
  15. Analyze information and provide research findings in a manner that is easily grasped by the customer and consumers.

Desired Skills:
  1. Demonstrated experience writing Python scripts that pull data from web-based APIs and relational databases.
  2. Demonstrated experience with cloud computing development and architecture
  3. Demonstrated experience with front-end web development frameworks such as Flask.
  4. Demonstrated experience developing applications for semantic search.
  5. Demonstrated experience tuning LLMs on custom data sets and applying results to specific use cases.
  6. Demonstrated professional or academic experience and proficiency with Tableau to produce visualizations and dashboards.

Similar Jobs

More Jobs at Bespoke Technologies, Inc

  • Data Scientist
    $100K — $130K *
    Mclean, VA 22101 (Fairfax County)
    Aerospace & Defense
    In-Person
  • Cloud Solutions Architect
    $120K — $150K *
    Chantilly, VA 20152 (Loudoun County)
    Information Technology
    In-Person
  • Database Developer
    $100K — $130K *
    Reston, VA 20191 (Fairfax County)
    Information Technology
    In-Person
  • Application Developer
    $90K — $120K *
    Chantilly, VA 20152 (Loudoun County)
    Information Technology
    In-Person
  • Systems Architect
    $120K — $150K *
    Chantilly, VA 20152 (Loudoun County)
    Aerospace & Defense
    In-Person

More Aerospace & Defense Jobs

Find similar Data Scientist jobs: