Software Engineer, Data Integration

Aaru

$100K — $150K *
Information Technology
Less than 5 years of experience
Job Overview by Ladders

Qualifications

  • 3+ years of experience in data integration, data engineering, or ETL/ELT roles
  • Expertise in managing high-volume data (>100 TB)
  • Proficient in SQL and Python
  • Experience with platforms like Snowflake, BigQuery, or Databricks
  • Strong analytical skills related to data quality management

Responsibilities

  • Build and maintain scalable data pipelines for large multimodal datasets
  • Oversee data ingestion through various methods including APIs and cloud storage
  • Design workflows for linking and deduplicating disparate datasets
  • Collaborate with teams to ensure usability of integrated data for simulations
  • Implement data quality checks and validation processes
  • Evaluate new data sources for integration potential

Benefits

  • Comprehensive medical, vision, and dental coverage
  • Visa sponsorship and relocation support
  • Various other benefits and perks
Full Job Description
ABOUT THE ROLE

As a Data Integration Specialist, you will build and maintain the data foundation that powers Aaru's simulations. You will work across large internal and third-party datasets, designing reliable integration workflows, and ensuring that data can be linked, queried, and trusted at scale. This role sits at the intersection of data engineering and architecture and is critical to how Aaru produces predictive intelligence.

RESPONSIBILITIES
  • Build and maintain scalable pipelines to ingest, clean, and integrate large multimodal datasets
  • Own data ingestion across APIs, flat files, cloud storage, and data warehouses
  • Design workflows for linkage, entity resolution, deduplication, and schema harmonization across imperfect or incongruent datasets
  • Work with engineering, research, and deployment teams to make integrated data usable for simulation ingestion
  • Establish and monitor data quality checks, validation logic, and documentation across datasets and pipelines
  • Help evaluate new data sources and determine how they can be joined with existing data assets


YOU MAY BE A FIT IF
  • You have 3+ years of experience in data integration, data engineering, ETL/ELT, or a similar role involving large-scale datasets
  • You have hands-on experience working with messy, high-volume data (>100 TB) and know how to build systems that remain reliable at scale
  • You are highly fluent in SQL and Python, and comfortable working across modern data infrastructure such as Snowflake, BigQuery, Databricks, or similar tools
  • You have strong judgment around data quality and know how to preemptively identify inconsistencies, edge cases, and integration risks


STRONG CANDIDATES MAY ALSO
  • Have experience with alternative data, (transaction data, clickstream, geospatial, etc) either from a hedge fund or data marketplace lens
  • Have experience building matching or entity-resolution systems across fragmented or noisy identifiers
  • Have familiarity with privacy, compliance, and data licensing considerations when working with sensitive or third-party data
  • Have worked closely with researcher or product teams to turn raw data from disjoint sources into accessible structured database
  • Have a background in statistics and familiarity with sampling biases, bot-detection, imputation, and standard data quality metrics


LOCATION

This role is based in New York City. Aaru is an in-person company, working 5 days a week in office. Candidates are expected to be located within the New York City metropolitan area or open to relocation.

BENEFITS

At Aaru, we take care of our people. In addition to a competitive base salary and equity participation, we offer comprehensive medical, vision, and dental coverage, visa sponsorship and relocation support, and various other benefits and perks.

Similar Jobs

More Jobs at Aaru

More Information Technology Jobs

Find similar Software Engineer, Data Integration jobs: