AI Research Engineer

The Path

$120K — $160K *
Healthcare
Less than 5 years of experience
Job Overview by Ladders

Qualifications

  • 5-7 years of experience with AI therapy and related technologies.
  • Proficient in large language models (LLMs) including fine-tuning and training data design.
  • Experience shipping and maintaining production-level code for AI systems.
  • Strong data engineering skills, with the ability to set up production data pipelines.
  • Scientific mindset and ability to evaluate hypotheses through data analysis and experiments.
  • Demonstrated ability to prioritize projects effectively and deliver company value.
  • Exceptional communication skills with a focus on collaboration.

Responsibilities

  • Improve AI therapy quality through innovative strategies and training data management.
  • Maintain a robust evaluation stack, improving regression testing and defect detection.
  • Build and iterate on the production AI therapy codebase while ensuring reliability and cost-efficiency.
  • Oversee the entire production process from model training to deployment and monitoring.
  • Collaborate with clinicians to integrate safety and clinical guardrails into AI systems.
  • Run high-quality experiments to enhance understanding and R&D progress.
  • Translate product and clinical needs into actionable model and system changes.

Benefits

  • Opportunities for professional development and continuing education.
  • Flexible work hours with potential for remote work.
  • Collaborative work environment with access to industry experts.
  • Focus on impactful projects that advance mental health technology.
Full Job Description
Note - Below contains the outcomes and competencies for the team. If you bring standout strengths in some areas but not all, you are still encouraged to apply.

Outcomes
  1. Improve quality of AI Therapy: Deliver measurable improvements in conversation quality, therapeutic alliance, and user outcomes through fine-tuning strategies, training data curation, building RL environments, new model architectures and other AI innovations.
  2. Improve evaluation of AI quality: Improve on and maintain a robust eval stack that includes scripted tests, LLM-as-judge evaluations, human ratings, and safety checks. Improve automated regression testing, detection of defects, and observability (eg dashboards).
  3. Own AI system. Build, maintain, and iterate on the production codebase that delivers AI therapy and supports the evaluation and iteration of our AI.
  4. Productionize Models and Pipelines. Own The Path from notebook to production: training jobs, model packaging, deployment, monitoring, and rollback strategies. Keep latency, reliability, and cost within agreed budgets while enabling rapid iteration on new ideas.
  5. Improve Safety, Alignment, and Clinical Guardrails Work with clinicians and internal experts to encode clinical guidelines into prompts, reward functions, tools, and filters. Proactively identify and reduce harmful or low-quality behaviors through targeted experiments, red teaming, and mitigations.
  6. Own Research Roadmap and Experiment Velocity Run high-quality experiments from hypothesis to analysis to improve our understanding of what matters and what works. Shape and execute a focused R&D roadmap.
  7. Collaboration with Clinicians, Product, and Engineering. Translate product and clinical requirements into concrete model and system changes. Partner with full-stack product engineers so that new AI capabilities are easy to integrate and maintain in the product.
Competencies
  1. LLM and Applied ML Depth. Demonstrates strong experience with large language models, including fine-tuning, training data design, and model selection. Knows how to move core metrics on conversation quality and user outcomes, rather than chasing generic benchmarks. Can look at evals, transcripts, and metrics and quickly form grounded hypotheses for improvement.
  2. Ships clean, maintainable, quality code. No only do you know how transformers work, but you are also an engineer that has experience shipping production-level code and/or maintaining an AI system in production.
  3. Data Engineering Skills. Can set up production-level data pipelines for training new models, evals, analysis, etc.
  4. Scientific Mindset. You formulate hypotheses, and you are good at evaluating them (eg through experiments, data analysis, etc). You are consistently learning at the cutting edge, and you're able to leverage and communicate those learnings to make the entire company more successful.
  5. Ruthless Prioritizer. You are keenly aware of how to provide company value and to prioritize projects accordingly. Resistant to nerd-sniping.
  6. Quality Obsessive: Refuses to ship subpar work, continuously improving the codebase.
  7. Fast: Prioritizes speed by leveraging AI, breaking down complex tasks, shipping early, optimizing for learnings, iterating quickly, and avoiding over-engineering.
  8. Strong communicator. You can work collaboratively in a positive way. Sees others perspectives. Strong opinions, loosely held. Focused on user/business value, not ego.
Great to have
  • Personal or other experience with therapy or coaching
  • Domain knowledge of psychology, neuroscience, therapy, or coaching.

Similar Jobs

More Jobs at The Path

  • Head of Growth
    $120K — $180K *
    San Francisco, CA 94112 (San Francisco County)
    Consumer Technology
    In-Person
  • AI Research Engineer
    $120K — $160K *
    San Francisco, CA 94112 (San Francisco County)
    Healthcare
    In-Person

More Healthcare Jobs

Find similar AI Research Engineer jobs: