AI Data Engineer

Giesecke+Devrient

• $95K — $115K *

Montreal, QC H1A 0A1In-Person

Information Technology

Less than 5 years of experience

1 week ago

By clicking Apply, I agree with Ladders' Terms of Use and Privacy Policy

Job Overview by Ladders

Qualifications

3+ years of hands-on experience in data engineering or related roles.
Proven ability to build production-grade data pipelines and ETL workflows.
Experience in preparing and validating data for machine learning projects.
Familiarity with RAG, document processing, and knowledge graph technologies is a plus.
Bachelor's degree in Computer Science, Software Engineering, or related field preferred.

Responsibilities

Design, build, and maintain data pipelines for AI and ML initiatives.
Develop document ingestion and processing pipelines for varied types of enterprise content.
Implement chunking strategies and retrieval-ready datasets for RAG applications.
Integrate vector databases, search indexes, and data lakes with existing systems.
Prepare data for machine learning including data cleaning and feature engineering.
Implement data quality checks and monitoring for ML pipelines.
Collaborate with cross-functional teams to deliver AI solutions.

Benefits

Opportunity to work in a cutting-edge AI Hub focused on Generative AI.
Support for professional development and continuous learning.
Collaborative environment with diverse teams and stakeholders.
Exposure to the latest technologies in data engineering and AI.
Challenging projects that drive innovation in machine learning.

Full Job Description

Compensation: $95,000-115,000 plus up to 5% bonus, capped at 150%

Job Summary

We are seeking a technical and execution-focused Data Engineer to join G+D's new AI Hub.

The ideal candidate will combine hands-on experience in data engineering for AI systems with strong Python, SQL, and data pipeline engineering capabilities. This role will support both AI engineering initiatives and machine learning projects by making enterprise data reliable, accessible, well-structured, and ready for production use.

This role is focused on data engineering for Generative AI, RAG, document ingestion, vector search, knowledge graphs, and machine learning workflows, including data preparation, data quality, feature engineering, and reusable data assets for AI solutions.

Primary Responsibilities

Design, build, and maintain data pipelines that support AI engineering, RAG, and machine learning initiatives from experimentation through production.
Develop document ingestion and processing pipelines for structured, semi-structured, and unstructured enterprise content, including parsing, cleaning, normalization, metadata extraction, and enrichment.
Implement chunking strategies, embedding pipelines, indexing workflows, and retrieval-ready datasets for RAG and Graph RAG applications.
Build and maintain integrations with vector databases, search indexes, graph databases, data lakes, warehouses, and enterprise source systems.
Support knowledge graph initiatives by preparing entities, relationships, ontologies, metadata, and graph-ready data pipelines.
Prepare and transform data for machine learning projects, including data cleaning, labeling support, feature engineering, feature validation, and dataset versioning.
Implement data quality checks, lineage, observability, monitoring, and automated validation for AI and ML data pipelines.
Collaborate with data scientists, applied AI engineers, platform engineers, security, data governance teams, and business stakeholders to deliver scalable AI solutions.
Contribute to reusable ingestion components, data engineering patterns, technical standards, and best practices for the AI Hub.
Other duties as assigned.

Qualifications, Experience and Educational Requirements

Work Experience:

Three (3)+ years of hands-on experience in data engineering, analytics engineering, machine learning engineering, or related software/data development roles.
Experience building production-grade data pipelines, ETL/ELT workflows, APIs, data services, or distributed data processing systems.
Experience preparing data for machine learning projects, including data cleaning, feature engineering, dataset creation, and data quality validation.
Experience with RAG, document processing, embeddings, vector databases, search systems, or knowledge graphs is strongly preferred.
Experience contributing to production-grade systems in enterprise, regulated, or security-sensitive environments is preferred.

Skills and Competencies:

Strong Python and SQL skills, with practical experience building reliable, maintainable, and testable data pipelines.
Hands-on experience with data engineering tools and frameworks such as Pandas, PySpark, Airflow, Dagster, Prefect, dbt, or similar technologies.
Practical knowledge of document ingestion, document parsing, chunking, embeddings, semantic search, hybrid search, and retrieval pipelines.
Hands-on experience with vector databases and search technologies such as pgvector, Pinecone, Weaviate, Milvus, OpenSearch, Elasticsearch, or similar platforms.
Hands-on experience with graph databases or knowledge graph technologies such as Neo4j, RDF, SPARQL, graph data modeling, or entity-relationship extraction is considered an asset.
Experience with cloud data platforms, lakehouse patterns, object storage, relational databases, and data warehouse technologies.
Understanding of machine learning workflows, feature engineering, feature stores, model training data requirements, and dataset versioning.
Ability to implement data quality controls, validation tests, lineage, monitoring, access control, and governance-aware data workflows.
Ability to work with technical specifications, data contracts, architecture patterns, and engineering standards.
Experience working in specification-first, contract-driven, or Spec-Driven Development environments is considered an asset.
Strong problem-solving skills and ability to work in a fast-moving, delivery-focused environment.

Education:

Bachelor's degree in Computer Science, Software Engineering, Data Engineering, Artificial Intelligence, Data Science, or related field preferred.
Master's degree is considered an asset.

Additional Information

*This job description is not intended to be all inclusive. The candidate hired will also perform other reasonable related business duties as assigned by the supervisor. The company reserves the right to revise or change job duties as needed. This job description does not constitute a written or implied contract of employment.

By applying to this position you are confirming you possess either a Canadian citizenship, permanent resident status or valid work permit.

Please note: Reference Checks and Credit, Criminal Background Checks will be administered on suitably qualified candidates. Your application will be kept on file for up to two years.

$$ https://career5.successfactors.eu/career?company=gieseckede&career_job_req_id=27204&career_ns=job_application

* Ladders Estimates

Similar Jobs

Data Engineer
$114K — $135K *
Cushman & Wakefield
Boston, MA 02115 (Suffolk County)
Reposted Today
Manager - Infrastructure Specialist - Data & Cloud Platforms (IC)
$60K — $132K *
CVS Health
Remote
Reposted Today
Data Engineer Consultant (Microsoft Fabric)
$90K — $120K *
Keyrus Canada
Montreal, QC H1A 0A1
Reposted Today
Software Data Modeler
$73K — $155K *
CACI International
Remote
Yesterday
Data Engineer
$82K — $145K *
Boeing
Richmond, QC J0B 2B0
Yesterday
Associate Data Engineer Artificial Intelligence AI
$90K — $110K *
Cotiviti
Remote
Yesterday

Get Ready For Your
Next Interview

More Jobs at Giesecke+Devrient

Business Development Director, Americas
$175K — $210K *
San Jose, CA 95123 (Santa Clara County)
Today
Transportation
In-Person
Buyer II (18 months contract)
$73K — $91K *
Markham, ON L3R 0G6
5 days ago
Manufacturing & Automotive
In-Person
Production Manager, EMV
$125K — $145K *
Bolingbrook, IL 60440 (Will County)
Reposted 5 days ago
Manufacturing & Automotive
In-Person
AI Data Engineer
$95K — $115K *
Montreal, QC H1A 0A1
1 week ago
Information Technology
In-Person
Sr. Account Manager, Transport & Logistics Technology
$125K — $140K *
Chicago, IL 60629 (Cook County)
1 week ago
Transportation
In-Person

More Information Technology Jobs

Business Development Director
$300K — $345K + $120K bonus *
Tier1 IT Services Firm
Kansas City, MO 64116 (Clay County)
6 days ago
Client Partner / Business Developemnt - Banking
$250K — $320K + $70K bonus *
IT Services Firm (client of TechLink Systems)
New York, NY 10001 (New York County)
6 days ago
Software Engineer II, Search & Data Infrastructure -Slack
$117K — $223K *
Salesforce
Washington, DC 20011 (District Of Columbia County)
Reposted Today
Software Engineer Lead
$55K — $158K *
The PNC Financial Services Group, Inc
Dallas, TX 75217 (Dallas County)
Reposted Today
Senior R&D Engineer-17637
$130K — $180K *
Synopsys Inc
Sunnyvale, CA 94087 (Santa Clara County)
Today

Find similar AI Data Engineer jobs:

Nationwide Montreal, QC

AI Data Engineer

Job Overview by Ladders

Full Job Description

Get Ready For Your Next Interview

Find similar AI Data Engineer jobs:

Get Ready For Your
Next Interview