Prompting & AI Agent Research Engineer

Hello Patient

$180K — $230K *
US-AnywhereRemote in United States
Healthcare
5 - 7 years of experience
Job Overview by Ladders

Qualifications

  • 5+ years of experience in prompting, AI research, or applied AI roles
  • Advanced degree (PhD preferred) in a relevant research-oriented field
  • Proven experience in prompt engineering to influence system behavior
  • Strong foundation in experimental design and statistical significance
  • Hands-on experience with large language models (LLMs) in practical settings
  • Familiarity with retrieval-augmented generation (RAG) and agentic architectures
  • Effective written communication skills for diverse audiences.

Responsibilities

  • Design structured experiments with controls to ensure valid results
  • Build regression test coverage to identify unintended consequences of prompt changes
  • Develop a validated list of improvements for engineering use
  • Collaborate with the team to prioritize features based on experimental data
  • Oversee prompt design across Mia's workflows, specifically in healthcare context
  • Stay abreast of latest developments in prompting and agent architecture research
  • Prototype new methodologies before implementing in production.
  • Translate experiment outcomes into actionable recommendations.

Benefits

  • Equity in a rapidly growing healthtech startup
Full Job Description
About the Role

Hello Patient is seeking a Prompting & AI Agent Research Engineer to bring structure and rigor to how we advance our AI. We build Mia, an AI voice agent that handles inbound patient calls for multi-location healthcare providers - and as we grow, we need someone who can help us make more confident, evidence-backed decisions about how Mia evolves. You'll design experiments we can trust, stay close to what's emerging in the research community, and turn promising ideas into something testable.

This role sits at the intersection of prompt engineering and applied AI research. You should be comfortable enough technically to prototype and run local experiments independently. Above all, we're looking for someone with deep, hands-on prompting experience - someone who has deliberately engineered prompts to change how a system behaves.
What You'll Do
Experimentation & Evaluation
  • Design structured experiments with real controls and statistical rigor
  • Build regression test coverage so we know when a global prompt change is breaking things across workflows we didn't expect
  • Create a pipeline of validated, high-confidence improvements engineering can pull from instead of building on faith
  • Work with the team to prioritize what actually gets shipped based on what the experiments say
Prompting & Agent Architecture Research
  • Own prompt design across Mia's agent workflows - multi-turn, voice-first, healthcare context
  • Stay current on what's coming out in prompting research, and agent architecture, and bring back ideas worth testing
  • Prototype new approaches locally before anything touches production
  • Be the person the Product team comes to when they're stuck on a prompting problem
Collaboration & Knowledge Sharing
  • Work closely with our Product and Engineering teams - understand where Mia is struggling and help decide what's worth experimenting on
  • Translate experiment results into clear recommendations people can actually act on
  • Build out evaluation templates and frameworks so this doesn't live only in your head
What We're Looking For
Must Have
  • 5+ years of experience in a prompting, AI research, or applied AI role
  • Advanced degree in a research-oriented field (PhD preferred) - CS, linguistics, cognitive science, stats, or similar
  • Real prompt engineering experience - deliberately designing, testing, and improving prompts to change system behavior
  • Solid experimental design fundamentals: controls, statistical significance, knowing when a result actually means something
  • Hands-on experience working with LLMs in applied contexts
  • Comfort with RAG, agentic architectures, and modern LLM tooling
  • Ability to evaluate and validate AI system behavior - understanding what the model is doing and why
  • Clear written communication; your findings need to land with engineers and non-technical stakeholders alike
Preferred
  • Experience with voice AI or conversational systems
  • Research background with a focus on prompting techniques, prompt optimization, or evaluation of LLM behavior
  • Time at an AI lab, applied AI team, or early-stage startup
  • Familiarity with LLM evaluation frameworks and behavioral test suites
  • Published or applied work in prompt design, chain-of-thought, or related areas
Compensation

Base salary: $180,000 - $230,000

Equity: Meaningful ownership in a fast-growing healthtech startup

Similar Jobs

More Jobs at Hello Patient

More Healthcare Jobs

Find similar Prompting & AI Agent Research Engineer jobs: