Data Engineer, Specialist

Vanguard Group, Inc.

$90K — $130K *
Information Technology
Less than 5 years of experience
Job Overview by Ladders

Qualifications

  • 5+ years of experience in data engineering or related field
  • Proficiency in SQL for data querying, transformation, and reporting
  • Experience with Python or ETL tools for effective data preparation
  • Familiarity with data platforms and reporting tools
  • Knowledge of data quality and validation implementations
  • Understanding of version control and deployment pipelines

Responsibilities

  • Integrate data from various enterprise systems into cohesive datasets
  • Translate business requirements into usable queries and reports
  • Support batch and near real-time data processing needs
  • Ensure data pipelines are scalable, performant, and resilient
  • Implement data cleansing and validation logic for accuracy
  • Develop data models and pipelines from user-defined use cases
  • Refine and iterate on existing data products for enhanced functionality

Benefits

  • Contribute to enterprise-level data management for AI systems
  • Engage in impactful work with high-level data ownership
  • Work in a dynamic environment focusing on Responsible AI
  • Opportunity for involvement in key enterprise capabilities
  • Collaborative culture that fosters continuous improvement
Full Job Description
This role enables the foundational data layer for enterprise risk visibility and Responsible AI at scale. By delivering accurate, integrated, and timely data products, you help ensure Vanguard can monitor AI systems, track inventory, and manage supplier egress risk with confidence and precision. As a Data Engineer you will be responsible for designing, developing, and maintaining data pipelines and data products that enable enterprise insights, monitoring, and governance across Supplier Egress, the Enterprise GenAI Monitoring Function, and the GenAI Inventory.

This role focuses on executing ETL (Extract, Transform, Load) processes and integrating data from disparate systems, tools, and platforms to create reliable, scalable, and high-quality data assets. These data products support critical business capabilities including third-party risk visibility, AI system monitoring, inventory tracking, and governance reporting.

Role Context (Why This Role Exists)

This role sits at the intersection of three critical enterprise capabilities:

1. Supplier Egress & Insights

  • Build datasets that unify data across internal systems and third-party integrations


  • Enable visibility into where and how sensitive data (e.g., PII) is shared externally


2. Enterprise GenAI Monitoring Function

  • Enable ingestion, transformation, and aggregation of monitoring metrics, alerts, and system performance data


  • Contribute to standardized, scalable monitoring datasets used for governance and incident management


3. GenAI Inventory

  • Build and maintain datasets supporting the enterprise inventory of AI systems (e.g., MGM, ServiceNow integrations)


  • Enable tracking of system attributes (risk tier, ownership, compliance status, lifecycle stage)


  • Support reconciliation and alignment across multiple systems of record


Key responsibilities

  • Integrate data from multiple enterprise systems (e.g., monitoring platforms, inventory systems, third-party tools)


  • Translate requirements into queries, datasets, and reports


  • Support both batch and near real-time data processing use cases


  • Ensure pipelines are scalable, performant, and resilient


  • Implement data validation, cleansing, and transformation logic to ensure accuracy and completeness


  • Translate use cases (e.g., monitoring dashboards, inventory reporting) into data models and pipelines


  • Support iterative development and refinement of data products


Required Skills

  • SQL for querying, transformation and reporting


  • Experience with Python or DETL tools for data preparation


  • Familiarity with Data Platforms and reporting tools


  • Data quality and data validation implementations


  • Familiarity with version control and deployment pipelines


An informational will be held on 6/3 at 12:00 PM

Microsoft Teams meeting

Join: https://teams.microsoft.com/meet/273136031135663?p=aEqgklN88Q1FPLLpnP

Meeting ID: 273 136 031 135 663

Passcode: 72g4gJ2U

Special Factors

Sponsorship
Vanguard is not offering visa sponsorship for this position.

Similar Jobs

More Jobs at Vanguard Group, Inc.

More Information Technology Jobs

Find similar Data Engineer, Specialist jobs: