Staff / Principal Data Engineer

Appgate

• $180K — $270K *

New York, NY 10025In-Person

Enterprise Technology

Less than 5 years of experience

Today

Be an Early Applicant

By clicking Apply, I agree with Ladders' Terms of Use and Privacy Policy

Job Overview by Ladders

Qualifications

Extensive experience with large-scale data platforms and data lakes at high volumes.
Hands-on expertise with Apache Spark, Apache Flink, and modern big data systems.
Proven best practices for building and maintaining data pipelines in batch and streaming.
Strong production engineering skills, including Kubernetes and CI/CD tools.
A history of owning data infrastructure with minimal supervision.

Responsibilities

Design, build, and operate the data lake and ingestion platform end-to-end.
Develop low-latency batch and streaming pipelines for signal ingestion and normalization.
Streamline the addition of new data sources to expand risk perspectives.
Establish data quality and observability for trustworthy foundational automation.
Build pipelines that support generative AI and unstructured data processing.
Manage deployment, CI/CD, and operational reliability on Kubernetes.
Collaborate with data science, product, and architecture for a unified data platform.

Benefits

Minimal travel expectations; primarily on-site role in NYC.
Collaborative environment with senior engineers and data scientists.
Opportunity to shape foundational technology used in fraud detection.
Focus on innovative applications of AI in fraud protection.

Full Job Description

About the Role

We are building an AI-native data platform that powers fraud detection and response across 360 Fraud Protection. We are hiring a Staff or Principal Data Engineer to own the data platform and data lake at the heart of that work. You work hands-on and own the domain end-to-end, alongside a small group of senior engineers, data scientists and product partners.

Owns the unified data platform and data lake that powers detection and response across 360 Fraud Protection.
Every detection model and downstream AI capability depends on this data foundation, which makes it one of the highest-leverage engineering roles on the team.
Stronger, broader and more reliable fraud signal directly improves detection accuracy, reduces customer losses and protects brand trust.

Key Responsibilities

Own the design, build and operation of the data lake and ingestion platform end-to-end, from architecture through production reliability.
Build low-latency batch and streaming pipelines that ingest signals from internal and external sources, normalize them to a common schema, enrich them with context and serve model-ready data to the layers above.
Make adding a new data source a routine task rather than a project, so our view of risk keeps widening over time.
Establish data quality, freshness, completeness, lineage and observability so the platform is trustworthy enough to automate on top of.
Build data pipelines that ground generative AI, including unstructured text and threat intelligence processing, embedding generation, vector storage and retrieval.
Own deployment, CI/CD and operational reliability of the platform on Kubernetes.
Partner with data science, product and architecture to turn the platform into a shared foundation across 360 Fraud Protection.

Required Qualifications

Extensive experience building and operating large-scale data platforms and data lakes, with comfort working at high data volumes.
Deep, hands-on expertise with Apache Spark, Apache Flink and modern big-data systems.
Proven command of best practices for building and maintaining data pipelines in both batch and streaming modes.
Strong production engineering skills across the full delivery lifecycle, including Kubernetes and CI/CD tooling, with the ability to ship end-to-end.
A track record of owning data infrastructure end-to-end with limited supervision.

Preferred Qualifications

Experience with generative AI and embedding models, including embedding pipelines, vector databases and retrieval.
A cybersecurity or threat intelligence background, with hands-on exposure to threat types such as phishing, mobile threats and malware.
Familiarity with transaction data and transaction fraud signals.

Compensation

Base salary range: $180 - $270
Bonus / commission: 15%

Travel

Minimal travel expected. This is an on-site role based in New York City, with 3-4 days per week in the office.

* Ladders Estimates

Similar Jobs

Principal MLOps Engineer
$150K — $200K *
Raft Company Website
Boston, MA 02115 (Suffolk County)
Reposted Today
Principal MLOps Engineer
$150K — $200K *
Raft Company Website
Remote
Reposted Today
Sr Director/Scientific Fellow, AI Safety, R&D Data Science and Digital Health
$196K — $342K *
Johnson & Johnson
Spring House, PA 19477 (Montgomery County)
Reposted Today
Sr Director/Scientific Fellow, AI Safety, R&D Data Science and Digital Health
$196K — $342K *
Johnson & Johnson
Cambridge, MA 02139 (Middlesex County)
Reposted Today
Sr Director/Scientific Fellow, AI Safety, R&D Data Science and Digital Health
$196K — $342K *
Johnson & Johnson
Titusville, NJ 08560 (Mercer County)
Reposted Today
Principal Data Engineer, Data Platform
$170K — $195K *
A Place For Mom Inc
Remote
2 days ago

Get Ready For Your
Next Interview

More Jobs at Appgate

Staff / Principal Data Engineer
$180K — $270K *
New York, NY 10025 (New York County)
Today
Enterprise Technology
In-Person
Senior/Staff/Principal SWE- Observability Engineering
$130K — $180K *
New York, NY 10025 (New York County)
1 month ago
Information Technology
In-Person
Senior/Staff/Principal SWE - OT Security Engineering
$130K — $180K *
New York, NY 10025 (New York County)
1 month ago
Information Technology
In-Person
Senior/Staff/Principal AI/ML Engineer - Threat Detection Engineering
$130K — $180K *
New York, NY 10025 (New York County)
1 month ago
Information Technology
In-Person
VP of Global Channels & Alliances
$180K — $220K *
New York, NY 10025 (New York County)
1 month ago
Enterprise Technology
In-Person

More Enterprise Technology Jobs

Product Owner - Advanced Analytics & Research
$100K — $130K *
Cincinnati, OH 45238 (Hamilton County)
Reposted Today
Applied AI ML Director
$180K — $220K *
JP Morgan Chase & Co.
Palo Alto, CA 94304 (Santa Clara County)
Today
Senior Director of Software Engineering - Business Partner Network Services
$150K — $200K *
JP Morgan Chase & Co.
Plano, TX 75024 (Collin County)
Today
Solutions Analyst [Multiple Positions Available]
$70K — $95K *
JP Morgan Chase & Co.
Philadelphia, PA 19103 (Philadelphia County)
Today
Senior Product Transformation & Agentic AI Lead
$120K — $150K *
JP Morgan Chase & Co.
Columbus, OH 43240 (Delaware County)
Today

Find similar Staff / Principal Data Engineer jobs:

Nationwide New York, NY

Staff / Principal Data Engineer

Job Overview by Ladders

Full Job Description

Get Ready For Your Next Interview

Find similar Staff / Principal Data Engineer jobs:

Get Ready For Your
Next Interview