Data Engineer - ML/AI Data Platform (Remote)

FEI Systems

• $100K — $130K *

Columbia, MD 21044In-Person

Information Technology

5 - 7 years of experience

3 weeks ago

By clicking Apply, I agree with Ladders' Terms of Use and Privacy Policy

Job Overview by Ladders

Qualifications

5+ years of hands-on data engineering experience in cloud environments.
Strong proficiency in Python for data processing and pipeline development.
Advanced SQL skills with hands-on Snowflake transformation experience.
Experience designing and optimizing ELT pipelines in Snowflake.
Familiarity with AWS-native services like S3, Glue, and Athena.

Responsibilities

Design, build, and maintain scalable data pipelines for ML/AI workloads.
Engineer pipeline patterns such as full loads and incremental loads.
Ensure data pipelines are reliable, secure, and performant within AWS.
Perform data transformations in Snowflake using SQL features.
Collaborate with ML engineers to operationalize features into training pipelines.

Benefits

Fully remote position.
Comprehensive company benefits.
Opportunities for professional growth and development.

Full Job Description

Role Overview

We are seeking a Data Engineer to support Machine Learning and AI initiatives. Working closely with the Solution Architect, Data Architect, DevOps, and Application Engineering teams, this role is responsible for ensuring that data within our cloud-based platform is high quality, well-governed, feature-ready, and production-grade to support model training, deployment, and ongoing operations.

The ideal candidate has 5+ years of cloud data engineering experience with strong proficiency in Snowflake, Python, and SQL, and solid familiarity with AWS-native data services.

Candidates are not expected to arrive with expertise across every area listed. We are looking for demonstrated strength in the core data engineering and Snowflake skills, combined with the initiative and aptitude to grow into the broader scope of the role.

Day-One Priorities & Scope

Immediate focus is Snowflake-based data engineering, pipeline development, and data quality. Feature engineering, model training support, and MLOps contributions are growth areas that will ramp over time as you become embedded with the team.

Key Responsibilities

Data Pipeline Engineering

Design, build, and maintain scalable data pipelines supporting ML/AI workloads.
Engineer pipeline patterns including full loads, incremental loads, change-based loads, and slowly changing dimensions.
Ensure pipelines are reliable, performant, secure, and maintainable, troubleshoot and monitor pipelines within an AWS ecosystem.

Snowflake & Cloud Data Engineering

Perform data transformations in Snowflake using SQL and native Snowflake features.
Design and optimize schemas, tables, views, and materialized views for ML/AI consumption.
Support AWS-native data lake patterns using S3, Glue, Athena, Apache Iceberg, and S3 Tables.

Feature Engineering & Data Preparation

Perform data cleansing, normalization, and enrichment to support ML model development.
Design and implement feature engineering pipelines including aggregation and transformation.
Ensure consistency, reuse, and versioning of features across models and use cases.
Support feature store patterns to enable feature discoverability and reuse.
Collaborate with ML engineers and data scientists to operationalize features into training pipelines.

Model Training & MLOps Support

Support model training workflows, including dataset preparation and scheduled refreshes.
Ensure training datasets and features are reproducible, traceable, and auditable.
Integrate data pipelines into CI/CD workflows; support version control, testing, and deployment of data assets.
Monitor pipeline health, data freshness, and downstream impact on ML/AI systems.

Required Skills & Experience

5+ years of hands-on data engineering experience in a cloud environment.

Core Technologies

Python - strong proficiency for data processing and pipeline development.
SQL - advanced skills with hands-on Snowflake transformation experience.
Snowflake - ELT pipeline design, schema optimization, performance tuning, cost management.
PostgreSQL - experience with querying, data modeling, and analytics; familiarity with SQL Server to PostgreSQL migration a plus.
AWS - S3, Glue, Athena, Snowflake integration, and managed relational databases (e.g., Aurora, RDS).
Apache Iceberg / S3 Tables - familiarity with open table format ecosystems.
Streaming ingestion tools (e.g., Kinesis, Kafka, or equivalent).
Workflow orchestration tools (e.g., Airflow, Step Functions, or equivalent).

Pipeline & Data Engineering

Experience with full loads, incremental loads, append-only pipelines, change-based processing, and SCDs.
Data validation, reconciliation, error handling, and restart/recovery patterns.
Data modeling for analytics, ML/AI, and downstream application use cases.
Ability to evaluate pipeline design trade-offs across performance, cost, reliability, and maintainability.

DevOps & Engineering Practices

Structured SDLC experience with CI/CD pipelines for data and ML workflows.
API-based and event-driven data integration patterns.
Distributed data processing environments.

ML/AI Data Foundations

Understanding of data requirements for ML/AI workloads.
Experience preparing training datasets and features from enterprise data lakes.
Familiarity with reproducibility, dataset versioning, and data lineage concepts.
Familiarity with GenAI concepts relevant to data engineering, such as embedding pipelines, vector databases, retrieval-augmented generation (RAG) data flows, or prompt-driven data processing - including awareness of data security and privacy considerations when working with LLMs.

Education

Bachelor's degree in Computer Science, Data Engineering, Information Systems, or a related technical field. Equivalent professional experience will be considered.

Location: Remote

Status: Full time position with full company benefits.

* Ladders Estimates

Similar Jobs

Data Engineer
$93K — $114K *
FocusKPI Inc.
New York, NY 10025 (New York County)
Today
Data Manager - Bethesda, MD; Must have an active TS/SCI Clearance with a polygraph, Immediate Hire
$100K — $130K *
Synertex LLC
Bethesda, MD 20817 (Montgomery County)
Yesterday
Database / Data Management Specialist
$88K — $154K *
Parsons Corporation
Washington, DC 20011 (District Of Columbia County)
Yesterday
Corporate Planning & Management, Data Engineering, New York, Associate
$90K — $130K *
The Goldman Sachs Group, Inc
New York, NY 10025 (New York County)
Yesterday
Data Engineer
$62K — $141K *
Booz Allen Hamilton, Inc.
Arlington, VA 22204 (Arlington County)
Yesterday
Data Scientist Prin
$100K — $130K *
BAE Systems
Sterling, VA 20164 (Loudoun County)
Reposted Yesterday

Get Ready For Your
Next Interview

More Jobs at FEI Systems

Full Stack Developer - AI (Remote)
$90K — $130K *
Columbia, MD 21044 (Howard County)
2 weeks ago
Healthcare
In-Person
Full Stack Developer - AI (Remote)
$90K — $130K *
Remote
2 weeks ago
Healthcare
Remote in Columbia, MD
Business Analyst Team Lead (Remote)
$90K — $120K *
Columbia, MD 21044 (Howard County)
3 weeks ago
Healthcare
In-Person
Business Analyst Team Lead (Remote)
$90K — $120K *
Remote
3 weeks ago
Business Services
Remote in Columbia, MD
Data Business Analyst - Migration and Integration Analyst (Remote)
$75K — $95K *
Columbia, MD 21044 (Howard County)
3 weeks ago
Healthcare
In-Person

More Information Technology Jobs

Client Partner - Banking / Financial Services / Capital Markets
$325K — $350K + $100K bonus *
Large IT Services Firm (client of TechLink Systems)
New York, NY 10001 (New York County)
3 days ago
Business Development Director
$300K — $345K + $120K bonus *
Tier1 IT Services Firm
Kansas City, MO 64116 (Clay County)
1 week ago
Client Partner / Business Developemnt - Banking
$250K — $320K + $70K bonus *
IT Services Firm (client of TechLink Systems)
New York, NY 10001 (New York County)
1 week ago
Director It
$140K — $160K *
Mohegan
Niagara Falls, ON L2E 0A1
Today
Developer III
$100K — $130K *
Buildium
Richardson, TX 75080 (Dallas County)
Today

Find similar Data Engineer - ML/AI Data Platform (Remote) jobs:

Nationwide Columbia, MD

Data Engineer - ML/AI Data Platform (Remote)

Job Overview by Ladders

Full Job Description

Get Ready For Your Next Interview

Find similar Data Engineer - ML/AI Data Platform (Remote) jobs:

Get Ready For Your
Next Interview