The roleYou'll own the intelligence at the core of Blockit: our scheduling agents that autonomously coordinate meetings across people, time zones, and constraints.
This includes designing and iterating on agent architectures, writing and refining prompts, building evaluation frameworks, and shipping new capabilities as models improve. You'll work across our orchestrator agents (which manage conversation flow) and specialized sub-agents, with the goal of making Blockit smarter, faster, and capable of handling increasingly complex coordination problems.
You'll be architecting and building real-world AI agents used in production.
What you'll do- Write and refine prompts across our agent system-orchestrators, sub-agents, and tools
- Build and maintain evals to measure agent quality and catch regressions
- Debug agent failures: figure out why it misunderstood a request or made a bad call
- Implement new agent capabilities as user needs expand
- Experiment with new architectures and techniques as models improve
- Instrument and analyze agent behavior to find patterns and failure modes
What we're looking for- 2+ years of experience shipping and owning production software
- Strong backend engineering skills, with the ability to work across the stack when needed
- Experience working with LLMs in production systems (or a demonstrated ability to learn quickly in this space)
- Deep curiosity about agent architectures and how the industry is evolving beyond simple prompt-based systems
- Clear, structured communicator who can explain what's working, what isn't, and why
Location- San Francisco, CA. On-site 4 days per week