AI Quality Engineer

Luma Financial Technologies

• $90K — $115K *

Cincinnati, OH 45238In-Person

Finance & Insurance

Less than 5 years of experience

4 days ago

Be an Early Applicant

By clicking Apply, I agree with Ladders' Terms of Use and Privacy Policy

Job Overview by Ladders

Qualifications

Hands-on experience with LLM APIs in production environments.
Strong skills in prompt engineering and instruction design.
Analytical mindset for isolating variables impacting model output quality.
Experience in designing structured test cases or evaluation frameworks.
Familiarity with JSON schema and data validation patterns.
Ability to interpret complex financial documents, preferably with financial services experience.
Strong written communication skills for documenting findings clearly.

Responsibilities

Run daily accuracy evaluations against extraction schemas for various financial product types.
Design and maintain test cases and regression suites to benchmark extraction quality.
Diagnose extraction failures and distinguish between various issues.
Iterate on prompt engineering and system instructions to enhance extraction accuracy.
Collaborate with AI Engineer lead to integrate findings into validation logic.
Document failure modes with reproducible examples and hypotheses.
Build and report on evaluation metrics and accuracy trends.

Benefits

High-ownership role on a strategic automation initiative with visibility to leadership.
Opportunity to build foundational quality systems for real financial data processing.
Potential to evolve this role into evaluation infrastructure and model improvement strategies.

Full Job Description

About the role

Luma Fintech is building a best-in-class LLM-powered document parsing pipeline that extracts structured data from complex financial product term sheets. We are seeking an AI Quality Engineer to own the daily testing, analysis, and iterative improvement of our Claude API-based extraction system. This role sits at the intersection of financial data operations and applied AI, you will be the person who closes the loop between what the model outputs and what the schema demands.

What you'll do

Run daily accuracy evaluations against a defined extraction schema, tracking field-level performance across structured product types (autocallables, CLNs, barrier notes, etc.)
Design and maintain test cases, regression suites, and gold-standard document sets to benchmark extraction quality over time
Diagnose extraction failures, distinguishing between prompt logic issues, schema ambiguity, model hallucinations, and edge-case document formats
Iterate on prompt engineering, system instructions, and context design to improve field-level extraction accuracy
Work alongside the AI Engineer lead to feed findings into validation logic and rules-based layers that sit on top of LLM output
Document failure modes with reproducible examples and root-cause hypotheses
Build and maintain evaluation metrics (precision, recall, field coverage, hallucination rate) and report on accuracy trends
Flag schema gaps or ambiguities surfaced by real document variance and collaborate with data operations to refine field definitions
Contribute to RAG improvements by identifying where retrieved context is insufficient or misleading

Qualifications

Required

Hands-on experience working with LLM APIs (Anthropic, OpenAI, or similar) in a production or near-production context
Strong prompt engineering skills, you understand how instruction design affects model behavior, not just output tone
Analytical mindset with the ability to systematically isolate variables in model output quality
Experience designing structured test cases or evaluation frameworks (QA background is a plus)
Familiarity with JSON schema, structured data output, and data validation patterns
Ability to read and interpret complex financial or legal documents (term sheets, prospectuses, offering documents), prior financial services exposure strongly preferred
Strong written communication; you'll be documenting findings for both technical and non-technical stakeholders

Preferred

Experience with RAG pipelines and retrieval evaluation
Python proficiency for scripting evaluation workflows or parsing outputs
Background in structured financial products (autocallables, structured notes, credit-linked notes)
Familiarity with evaluation frameworks or tools (e.g., LangSmith, RAGAS, custom evals)

What Success Looks Like

In 90 days, you have established a repeatable daily evaluation process, a documented baseline of field-level accuracy across product types, and have driven at least one measurable improvement in extraction quality through prompt iteration.

Why This Role

This is a high-ownership position on a strategic automation initiative with direct visibility to leadership. You won't be maintaining someone else's test suite, you're building the quality layer of a system that processes real financial data at scale. The role will evolve as the system matures, with opportunity to expand into evaluation infrastructure and model improvement strategy.

The pay range for this role is:

90,000 - 115,000 USD per year (Cincinnati)

* Ladders Estimates

Similar Jobs

Sr AI Solution Engineer
$106K — $133K *
James Hardie Industries plc
Chicago, IL 60629 (Cook County)
4 days ago
Senior AI & Automation Specialist
$110K — $140K *
Xsolla
Remote
4 days ago
AI and Data Science Engineer III
$113K — $208K *
Deloitte
Cleveland, OH 44130 (Cuyahoga County)
4 days ago
AI and Data Science Engineer III
$113K — $208K *
Deloitte
Pittsburgh, PA 15237 (Allegheny County)
4 days ago
AI and Data Science Engineer III
$113K — $208K *
Deloitte
Chicago, IL 60629 (Cook County)
4 days ago
AI and Data Science Engineer III
$113K — $208K *
Deloitte
Nashville, TN 37211 (Davidson County)
4 days ago

Get Ready For Your
Next Interview

More Jobs at Luma Financial Technologies

AI Quality Engineer
$90K — $115K *
Cincinnati, OH 45238 (Hamilton County)
4 days ago
Finance & Insurance
In-Person
Full Stack Engineer
$100K — $120K *
Cincinnati, OH 45238 (Hamilton County)
4 days ago
Finance & Insurance
In-Person
Director, Insurance Product Sales
$120K — $150K *
Cincinnati, OH 45238 (Hamilton County)
4 days ago
Finance & Insurance
In-Person
Product Marketing Manager
$135K — $150K *
New York, NY 10025 (New York County)
1 month ago
Finance & Insurance
In-Person

More Finance & Insurance Jobs

Relationship Manager II - C&IB
$112K — $208K *
The PNC Financial Services Group, Inc
Cincinnati, OH 45238 (Hamilton County)
Reposted Today
Financial Advisor ( Family Advisor)
$88K — $150K *
Parcion Private Wealth
Bellevue, WA 98006 (King County)
Today
Mergers & Acquisitions Account Manager - Employee Benefits
$90K — $120K *
Alliant Insurance Services
New York, NY 10025 (New York County)
Reposted Today
District Senior Manager- Lee District, FL
$90K — $120K *
Wells Fargo
Lee, FL 32059 (Madison County)
Today
Senior Product Manager
$100K — $130K *
Wells Fargo
Wilmington, DE 19805 (New Castle County)
Today

Find similar AI Quality Engineer jobs:

Nationwide Cincinnati, OH

AI Quality Engineer

Job Overview by Ladders

Full Job Description

Get Ready For Your Next Interview

Find similar AI Quality Engineer jobs:

Get Ready For Your
Next Interview