AI QA Engineer (Multilingual)

Scaled Cognition

• $90K — $130K *

New York, NY 10025In-Person

Information Technology

Less than 5 years of experience

Today

Be an Early Applicant

By clicking Apply, I agree with Ladders' Terms of Use and Privacy Policy

Job Overview by Ladders

Qualifications

Strong technical background with hands-on Python coding experience and Git/GitHub proficiency.
Fluency in English and native or near-native proficiency in at least one other language.
Deep understanding of Large Language Models and their failure modes.
Proven experience in Quality Assurance, Data Quality, or Data Engineering with a history of auditing large datasets.
Exceptional written communication skills across multiple languages.

Responsibilities

Inspect and review LLM training data and evaluation test cases for quality and accuracy.
Maintain local development environments to run test pipelines and investigate edge cases.
Dive deep into training data to identify error cases as a technical data detective.
Leverage LLMs to translate, verify, and manage cross-lingual datasets.
Collaborate with the engineering team to refine evaluation criteria and improve data pipelines.

Benefits

Work in a fast-paced environment with ownership of data quality.
Engage in hands-on data inspection and problem-solving with a technical focus.
Collaborate with a dedicated engineering team.
Opportunity to leverage linguistic skills and enhance multilingual datasets.

Full Job Description

AI QA Engineer (Multilingual)

As an AI QA Engineer (Multilingual) at Scaled Cognition, you will be the final line of defense for our model's quality. You'll sit at the critical intersection of data engineering, quality assurance, and linguistics, ensuring our LLM training data and evaluation sets are flawless. You'll be getting your hands dirty, meticulously inspecting data, and making direct code contributions to fix issues. If you love the idea of turning messy, imperfect data into gold and have the technical chops to automate parts of that cleanup, you will thrive here.

What you'll do:

Meticulously inspect, review, and grade LLM training data, evaluation test cases, and model outputs to ensure maximum quality and accuracy.
Maintain local development environments to run test pipelines, investigate edge cases, and submit PRs via Git/GitHub to update our training repositories.
Act as a technical data detective, diving deep into training data to spot error cases.
Leverage LLMs as internal tools to translate, verify, and maintain our cross-lingual datasets.
Collaborate closely with the engineering team to refine our evaluation criteria and improve our data pipelines.

You might be the right person for the job if you:

Have an obsessive attention to detail and get a dopamine hit from finding the one edge case or bad translation that broke a prompt.
Are a builder who doesn't mind the weeds. You understand that high-quality AI is built on rigorous, sometimes repetitive data inspection, and you embrace that reality.
Are technically self-sufficient. You're comfortable navigating a terminal, running Python scripts locally, and managing your own version control.
Love languages and understand the linguistic nuances required for high-quality translation and cross-lingual model evaluation.
Thrive in a fast-paced environment where you can take ownership of the data quality that directly drives model performance.

Key Qualifications:

Strong technical background with hands-on coding experience (Python preferred) and proficiency with Git/GitHub.
Fluency in English and native or near-native proficiency in at least one other language.
Deep understanding of Large Language Models, their failure modes (hallucinations, formatting errors), and effective prompting techniques.
Proven experience in Quality Assurance, Data Quality, or Data Engineering, with a track record of auditing and maintaining large datasets.
Exceptional written communication skills across multiple languages.

* Ladders Estimates

Similar Jobs

Software Engineer II
$123K — $165K *
The Walt Disney Company
New York, NY 10025 (New York County)
Reposted Today
Sr. Software Engineer, AI (Founding)
$120K — $160K *
Turbo Law Inc
Remote
Today
AI/ML Engineeer
$77K — $176K *
TeleTech
Norfolk, VA 23503 (Norfolk City County)
Today
AI/ML Engineeer
$77K — $176K *
TeleTech
Hampton, VA 23666 (Hampton City County)
Today
Founding Staff AI Engineer
$130K — $180K *
Qumis Inc
New York City, NY 10025 (New York County)
Today
Sr. AI FDE
$120K — $160K *
MegazoneCloud
Rochester, NY 14609 (Monroe County)
Today

Get Ready For Your
Next Interview

More Jobs at Scaled Cognition

Research Scientist
$100K — $150K *
New York, NY 10025 (New York County)
Today
Pharmaceuticals & Biotech
In-Person
Research Scientist
$90K — $130K *
Boston, MA 02115 (Suffolk County)
Today
Technical Services
In-Person
Research Scientist
$100K — $150K *
Boston, NY 14025 (Erie County)
Today
Information Technology
In-Person
AI QA Engineer (Multilingual)
$90K — $120K *
Boston, NY 14025 (Erie County)
Today
Information Technology
In-Person
AI QA Engineer (Multilingual)
$90K — $130K *
Mountain View, CA 94040 (Santa Clara County)
Today
Technical Services
In-Person

More Information Technology Jobs

SDET (Software Development Engineer In Test)
Confidential Company
Washington, DC 20001 (District Of Columbia County)
5 days ago
Full Stack Engineer
$100K — $120K *
Grapefruit Health
Remote
Reposted Today
Software Engineer (Applied AI)
$190K — $260K *
Collate Labs, Inc
New York, NY 10025 (New York County)
Today
Backend Engineer
$140K — $190K *
Method Security
New York, NY 10025 (New York County)
Today
Backend Software Engineer
$120K — $160K *
New Lantern
San Francisco, CA 94112 (San Francisco County)
Today

Find similar AI QA Engineer (Multilingual) jobs:

Nationwide New York, NY

AI QA Engineer (Multilingual)

Job Overview by Ladders

Full Job Description

Get Ready For Your Next Interview

Find similar AI QA Engineer (Multilingual) jobs:

Get Ready For Your
Next Interview