Software Engineer, Data Infrastructure

Scale AI • $186K — $233K *

New York, NY 10025In-Person

Aerospace & Defense

5 - 7 years of experience

Today

Be an Early Applicant

By clicking Apply, I agree with Ladders' Terms of Use and Privacy Policy

Job Overview by Ladders

Qualifications

5+ years of backend or data infrastructure experience at a Senior level or higher.
Expertise in systems programming languages like Rust, Go, C++, Python, or Java; strong grasp of memory management and distributed systems.
Experience with high-throughput, low-latency data processing and optimization.
Solid understanding of information retrieval, ideally in complex systems such as search or recommendation engines.
Desire to contribute to technology that impacts national security and defense.

Responsibilities

Design and build the architecture for a unified data ensemble from diverse simulation inputs.
Develop scalable data architectures to manage large batch throughput jobs with minimal latency.
Create sophisticated relational data models that represent complex simulation environments.
Problem-solve ambiguous requirements to build custom systems suited for intricate data needs.
Lead technical initiatives, ensuring high code quality and robust system performance.

Benefits

Comprehensive health, dental, and vision insurance.
Retirement benefits.
Learning and development stipend.
Generous paid time off (PTO).
Potential commuter stipend.

Full Job Description

Scale AI is seeking a highly skilled and motivated Mission Software Engineer to join our dynamic Federal Engineering team. As a part of this team, you will play a critical role in supporting Scale's government customers by scoping and developing onsite solutions. Our scalable, high-performance platform is the foundation for these customer solutions, and your expertise will be instrumental in designing and implementing systems that can handle interactions with existing customer systems to help our products integrate into existing customer workflows.
The Role

We are looking for an exceptional Senior Software Engineer to architect and build the foundational data infrastructure that will serve as the brain of a project ecosystem.
We are not looking for someone to stitch together off-the-shelf data frameworks. You will be responsible for designing highly novel data models and processing pipelines capable of handling massive quantities of output data from complex simulations.
At the core of this role is the challenge of building a foundational data ensemble-a unified architecture that seamlessly aggregates, structures, and stages diverse sources of simulation outputs and user inputs. Your systems will manage enormous batch throughput jobs with strict, minimal latency requirements, ensuring that downstream AI systems and language models have the exact context they need to actionably reason over complex, multi-dimensional scenarios.

Key Responsibilities

Architect the Data Ensemble: Design and implement the architecture to ensemble various sources of injected context (deeply structural simulation data, historical game states, and dynamic user inputs) into a unified, highly queryable format optimized for LLM consumption.
Massive Batch Infrastructure: Build highly scalable, resilient data architectures from scratch. You will optimize for moving, transforming, and processing massive quantities of simulation output data via enormous batch jobs, maintaining the minimal latency required for rapid wargame iterations.
Complex Data Modeling: Design sophisticated, highly relational data models that accurately represent massive, state-based simulation environments, making them easily interpretable by machine learning models.
First-Principles Problem Solving: Navigate highly ambiguous product requirements to design custom, ground-up systems where existing open-source or enterprise tools simply cannot handle the structural complexity or scale.
Technical Leadership: Set the technical standard for the data infrastructure team, driving rigorous code quality, system performance, and architectural clarity.

What We're Looking For

Experience: 5+ years of backend or data infrastructure experience, operating at a Senior, Staff, or Principal level.
Engineering Excellence: Deep, expert-level proficiency in systems languages (e.g., Rust, Go, C++, or highly optimized Python/Java, Spark) and a fundamental understanding of memory management, compute limits, and distributed systems architecture.
High-Throughput / Low-Latency Data: Proven track record of processing massive datasets. You understand how to optimize massive batch jobs and parallel processing across distributed simulation nodes without sacrificing speed.
Information Retrieval & Context Surfacing: You don't need a background in AI agents, but you must be an expert in surfacing the right needle from an ocean of hay to feed decision-making engines. We highly value engineers with backgrounds in:

Search & RecSys: Building complex information retrieval systems or recommendation engines.
Gaming / MMOs: Managing complex state, data relationships, and telemetry for massive, highly populated simulations.
High-Frequency Trading (HFT): Processing disparate, massive streams of data for algorithmic decision-making.

Mission-Driven: A strong desire to build robust, foundational technology that supports national security and defense modernization.

Nice to Have

- Security Clearance: An active Secret or TS/SCI clearance is a nice to have for this role. If you do not have an active clearance, you must be eligible and willing to obtain one.
- Experience with LLM context optimization, vector embeddings, or agentic AI frameworks (e.g., advanced RAG architectures).
- Deep domain experience working with wargaming data, complex systems modeling, or distributed simulation protocols.
- Previous experience in a high-growth, 0-to-1 startup environment.

Compensation packages at Scale for eligible roles include base salary, equity, and benefits. The range displayed on each job posting reflects the minimum and maximum target for new hire salaries for the position and may be inclusive of several career levels at Scale; it will be determined during the interview process based on work location and additional factors, including job-related skills, experience, qualifications, interview performance, and relevant education or training. Scale employees in eligible roles are also granted equity based compensation, subject to Board of Director approval. Your recruiter can share more about the specific salary range for your preferred location during the hiring process, and confirm whether the hired role will be eligible for equity grant. You'll also receive benefits including, but not limited to: comprehensive health, dental and vision coverage, retirement benefits, a learning and development stipend, and generous PTO. Additionally, this role may be eligible for additional benefits such as a commuter stipend.

Please reference the job posting's subtitle for where this position will be located. For pay transparency purposes, the base salary range for this full-time position in the locations of San Francisco, New York, Seattle is:

$186,400-$233,000 USD

PLEASE NOTE: Our policy requires a 90-day waiting period before reconsidering candidates for the same role. This allows us to ensure a fair and thorough evaluation of all applicants.

About Scale AI

Scale AI is an artificial intelligence company that provides data annotation services to improve machine learning algorithms. The company's platform offers a range of services including image annotation, text annotation, and 3D annotation. Scale AI was founded in 2016 and is headquartered in San Francisco, California.

Learn more about Scale AI

Size

500 employees

Industry

Information Technology

Founded

2017

* Ladders Estimates

Similar Jobs

Senior Lead Data Engineer
$209K — $238K *
Capital One Financial Corporation
Richmond, VA 23223 (Richmond City County)
Today
Senior Lead Data Engineer
$229K — $262K *
Capital One Financial Corporation
Mclean, VA 22101 (Fairfax County)
Today
Software Engineer, Data Infrastructure
$186K — $233K *
Scale AI
Washington, DC 20011 (District Of Columbia County)
Today
Senior Data Engineer
$147K — $199K *
General Dynamics Information Technology, Inc.
Falls Church, VA 22042 (Fairfax County)
Today
Principal Data Engineer
$215K — $250K *
Upside Business Travel
Washington, DC 20011 (District Of Columbia County)
Today
Data Architect
$170K — $200K *
APEX Analytix, Inc
Remote
Reposted Yesterday

Get Ready For Your
Next Interview

More Jobs at Scale AI

Software Engineer, Identity
$216K — $270K *
San Francisco, CA 94112 (San Francisco County)
5 days ago
Information Technology
In-Person
Software Engineer, Identity
$216K — $270K *
New York, NY 10025 (New York County)
5 days ago
Information Technology
In-Person
Strategic Capture Lead, Homeland Security & Federal Law Enforcement
$203K — $254K *
Seattle, WA 98115 (King County)
5 days ago
Education, Government & Non-Profit
In-Person
Strategic Capture Lead, Homeland Security & Federal Law Enforcement
$203K — $254K *
Washington, DC 20011 (District Of Columbia County)
5 days ago
Education, Government & Non-Profit
In-Person
Strategic Capture Lead, Homeland Security & Federal Law Enforcement
$203K — $254K *
San Francisco, CA 94112 (San Francisco County)
5 days ago
Business Services
In-Person

More Aerospace & Defense Jobs

Chief Executive Officer – UAV Aerospace Technology
$300K + significant company stock/equity participation *
Soaring Aerospace Inc.
Orange, CA 92868 (Orange County)
Today
Site General Manager
$200K — $500K++ $60K bonus *
Spartronics
Williamsport, PA 17703 (Lycoming County)
2 days ago
Chief Executive Officer
The Mitalmor Group
New York, NY 10001 (New York County)
Reposted 2 days ago
Engineering Program Manager
$80K — $150K *
Signature Research, Inc.
Calumet, MI 49913 (Houghton County)
1 week ago
Mid-Level Structural Analysis Engineer - Systems Stress
$103K — $140K *
Boeing
North Charleston, SC 29405 (Charleston County)
Reposted Today

Find similar Software Engineer, Data Infrastructure jobs:

Nationwide New York, NY

Software Engineer, Data Infrastructure

Job Overview by Ladders

Full Job Description

Get Ready For Your Next Interview

Find similar Software Engineer, Data Infrastructure jobs:

Get Ready For Your
Next Interview