Data Scientist

GRVTY

$100K — $130K *
Aerospace & Defense
Less than 5 years of experience
Job Overview by Ladders

Qualifications

  • 5-7 years of experience in building production data pipelines and ETL/ELT workflows at scale.
  • Proficiency in Apache Spark and PySpark for distributed data processing.
  • Advanced skills in Python, especially with data manipulation libraries (Pandas, NumPy).
  • Deep understanding of data security, privacy, governance, and compliance principles.
  • Experience with workflow orchestration tools like Step Functions and Airflow.
  • Familiarity with containerization technologies such as Docker or Podman for cloud deployments.
  • Worked with AWS services like S3, Lambda, and Step Functions for data applications.
  • Hands-on experience with PostgreSQL and MySQL, including performance tuning.
  • Expertise in SQL and query optimization for complex analytical workloads.
  • Understanding of version control (Git) and CI/CD for data pipelines.

Responsibilities

  • Build and maintain scalable data pipelines for large-scale data processing.
  • Collaborate with software development teams to integrate analytical models into applications.
  • Develop and optimize ETL processes for transforming and extracting critical data.
  • Ensure data security and compliance adherence in all data-related projects.
  • Implement workflow orchestration solutions for enhancing data workflows.
  • Deploy containerized applications in cloud environments to support scalability.
  • Work closely with stakeholders to gather data requirements and design effective solutions.

Benefits

  • Robust health plan including medical, dental, and vision coverage.
  • Health Savings Account with company contribution to support employee health.
  • Annual Paid Time Off and Paid Holidays to promote work-life balance.
  • Paid Parental Leave to support family growth and bonding.
  • 401k plan with a generous company match for future financial stability.
  • Opportunities for Training and Development to grow skills and advance careers.
  • Recognition through Award Programs to celebrate employee accomplishments.
  • Variety of Company Sponsored Events for team building and camaraderie.
Full Job Description
What You'll be Owning

GRVTY is seeking a Data Scientist with a TS/SCI + Poly clearance (applicable to this customer) to join one of our top projects in McLean, VA. The Data Scientist will be working in a fast-paced, dynamic, agile software development environment. The multi-disciplinary project team works together on multiple projects that includes automating processing of large forensic images, extracting and enriching metadata, and displaying resulting information in meaningful ways for analysts to conduct assessments. Team members utilize a mix of COTS and GOTS tools and technologies; as well as build integrations with a variety of external partner applications. Most solutions are cloud-based. The Sponsor adheres to Agile Scrum development methodology best practices and has 2-week sprint cycles.

What You Must Have
  • Demonstrated experience building production data pipelines and ETL/ELT workflows at scale.
  • Demonstrated experience with Apache Spark and PySpark for distributed data processing.
  • Demonstrated experience with advanced Python programming skills including data manipulation libraries (Pandas, NumPy) and data engineering best practices.
  • Demonstrated experience understanding data security, privacy, governance, and compliance principles.
  • Demonstrated experience with workflow orchestration tools such as Step Functions and Airflow.
  • Demonstrated experience with containerization such as Docker or Podman, and deploying data applications in cloud environments.
  • Demonstrated experience with AWS services (in particular S3, Lambda, and Step Functions).
  • Demonstrated experience with PostgreSQL and MySQL in production environments, including performance tuning and schema design.
  • Demonstrated experience with SQL and query optimization for complex analytical workloads.
  • Demonstrated experience with version control (Git) and CI/CD practices for data pipelines.
  • Demonstrated experience working with stakeholders to understand data requirements, assess feasibility, and design appropriate solutions with minimal oversight.
  • Demonstrated experience with strong problem-solving and debugging skills for data quality issues, pipeline failures, and performance bottlenecks.

#LI-BPJ

Why Choose GRVTY

The toughest national security challenges demand vision and ingenuity, not just resources. We deliver mission and technical expertise to outpace our adversaries. We're purpose-built to tackle the most entrenched, systemic national security issues around the world.

We partner with our customers to help them overcome challenges in every corner of technology and defense-including the ones still being explored. Our growing capabilities create complementary advantages, giving on-the-ground operations the edge they need to succeed. We muster everything we have to answer every challenge presented, every day of our lives.

At GRVTY, we believe that when our employees thrive, our company thrives. That's why we offer a comprehensive and competitive benefits package designed to support your well-being, growth, and work-life balance.
• Robust health plan including medical, dental, and vision
• Health Savings Account with company contribution
• Annual Paid Time Off and Paid Holidays
• Paid Parental Leave
• 401k with generous company match
• Training and Development Opportunities
• Award Programs
• Variety of Company Sponsored Events

Similar Jobs

More Jobs at GRVTY

More Aerospace & Defense Jobs

Find similar Data Scientist jobs: