Research Engineer

talentpluto

$120K — $150K *
Information Technology
Less than 5 years of experience
Job Overview by Ladders

Qualifications

  • Proficient in Python and experienced in Linux environments
  • Experience with Docker and reproducible development workflows
  • Skilled in managing large-scale datasets for validation, transformation, or analysis
  • Demonstrated problem-solving capabilities and quick adaptability in technical settings
  • Self-starter capable of delivering results in a fast-paced, early-stage environment
  • Strong written and verbal communication skills, with experience collaborating across time zones

Responsibilities

  • Define quality standards for training datasets used in reinforcement learning
  • Build tools and workflows for auditing supplier-generated datasets
  • Evaluate and implement human-in-the-loop review workflows to enhance quality
  • Collaborate with external data suppliers to resolve quality issues and improve processes
  • Integrate QA insights into internal tools and supplier interfaces to minimize inconsistencies
  • Monitor and track QA outcomes to refine processes and documentation

Benefits

  • Medical, dental, and vision coverage
  • Meals provided
  • 401(k) retirement plan
  • Commuter benefits
  • Wellness perks
Full Job Description
Location: San Francisco Bay Area
Work model: On-site (some team members are remote, but this role is currently on-site)
Industry: AI infrastructure / Reinforcement Learning (RL) training data & evaluations
Compensation: Competitive (range not provided) + benefits (medical/dental/vision coverage, meals, 401(k), commuter benefits, wellness perk)
The Opportunity

Our partner is hiring a Research Engineer to help scale the quality assurance (QA) systems behind training data generated through their infrastructure. This role sits at the intersection of data quality, tooling, and applied ML operations: you'll build the standards, pipelines, and feedback loops that ensure datasets are reliable, consistent, and ready for training and evaluation.

You'll work closely with internal stakeholders and external data suppliers to diagnose quality issues, improve workflows, and continuously fold QA learnings back into the platform. If you enjoy building systems that make high-quality data scalable-and want to do it in a high-ownership, fast-paced environment-this role is a strong fit.
Responsibilities
  • Define and enforce quality standards for training datasets used for RL training and evaluation
  • Build tooling and workflows to audit supplier-generated datasets, including sampling strategies, validation pipelines (rule-based and model-assisted), and feedback loops
  • Evaluate and implement human-in-the-loop review workflows where beneficial to improve quality and efficiency
  • Partner with external data suppliers to debug quality issues, provide actionable feedback, and improve their data generation processes
  • Integrate QA learnings into internal tools and supplier portals to reduce anomalies, inconsistencies, and edge cases over time
  • Track QA outcomes and continuously improve processes, metrics, and documentation
Requirements
  • Proficiency with Python and experience working in Linux environments
  • Experience with Docker and reproducible development/deployment workflows
  • Experience working with large-scale datasets (validation, transformation, or analysis)
  • Strong problem-solving skills and evidence of rapid learning in technical environments
  • Ability to operate independently and deliver results in an early-stage, fast-moving setting
  • Clear written and verbal communication skills (including collaborating across time zones)

Nice to have
  • Experience building data validation pipelines and/or human-in-the-loop review systems
  • Familiarity with common training-data failure modes and techniques to detect subtle inconsistencies
  • Comfort designing QA metrics, experiments, and processes-not just executing predefined checks
  • Familiarity with modern AI tooling and LLM capabilities

Similar Jobs

More Jobs at talentpluto

  • Founding Account Executive
    $100K — $150K *
    San Francisco, CA 94112 (San Francisco County)
    Finance & Insurance
    In-Person
  • Account Executive
    $200K — $260K *
    New York, NY 10025 (New York County)
    Business Services
    In-Person
  • Account Executive
    $150K — $180K *
    New York, NY 10025 (New York County)
    Enterprise Technology
    In-Person
  • Account Executive
    $250K *
    New York, NY 10025 (New York County)
    Legal & Accounting
    In-Person
  • Founding Account Executive
    $100K — $150K *
    Remote
    Finance & Insurance
    Remote in San Francisco, CA

More Information Technology Jobs

Find similar Research Engineer jobs: