Principal AI Engineer

Gorgias • $120K — $150K *

Toronto, ON M3C 0E3In-Person

Information Technology

8 - 10 years of experience

Today

Be an Early Applicant

By clicking Apply, I agree with Ladders' Terms of Use and Privacy Policy

Job Overview by Ladders

Qualifications

8+ years of engineering experience, ideally at a Staff level
Strong backend experience, preferably with Python
Experience with distributed systems and event-driven architectures
Familiarity with tools like Kafka or Pub/Sub
Proficient in working with LLMs, including prompting and evaluation workflows
Comfortable with monitoring, observability, and system performance

Responsibilities

Architect and develop an internal evaluation platform to automate testing lifecycles
Design tools to reduce time for delivering production-ready AI agents
Collaborate with infrastructure teams to support scalable AI evaluations
Guide product squads as the 'AI Technical Lead' on agent design
Establish quality standards for AI performance organization-wide
Mentor engineers on integrating rigorous system design with AI

Benefits

5 weeks of vacation following local PTO laws
Paid sick leave and parental leave (16 weeks)
Provided with a MacBook Pro and access to private health insurance
Home workstation setup allowance up to $900 CAD
Annual learning material budget of up to $2,600 CAD
Company-wide summits and annual offsite team retreats

Full Job Description

Team & Context

Gorgias is an AI-first company building products powered by LLMs and agent-based systems.

As we scale our AI capabilities, we need to improve how we evaluate, iterate, and operate these systems in production. Today, parts of this process remain manual or fragmented, especially around prompt iteration, validation, and evaluation workflows.

This role will focus on building and scaling the systems that support AI evaluation and iteration, helping the team move faster and more reliably.

About the Role

You'll have a chance to:

Work on production AI systems used by thousands of businesses
Define how we evaluate and improve AI performance at scale
Build internal platforms and tooling used by AI and engineering teams
Reduce manual processes and improve iteration speed on AI features
Collaborate across AI, ML, and product teams
Raise the engineering bar and mentor others

What You'll Do

1. Architect the Evaluation "Factory"

End-to-End Platform Ownership: Architect and lead the development of our internal evaluation platform, moving the needle from manual testing to a fully automated lifecycle (from LLM-as-a-judge creation to production monitoring).
Accelerate Time-to-Market: Directly impact our primary KPI by designing tools and workflows that drastically reduce the time it takes to deliver a calibrated, production-ready agent.
Infrastructure Collaboration: Partner with the Orchestration team to build the robust, scalable infrastructure required to run complex evals and agentic simulations at scale.

2. Scaling AI Expertise

Squad Empowerment: Serve as the "AI Technical Lead" for product squads, guiding them through the complexities of agent design, failure analysis, and prompting best practices.
Decentralize Quality: Instead of being a bottleneck, you will build the "paved road" that allows product squads to become autonomous in measuring and maintaining their own agent quality.
Standard Setting: Define what "good" looks like for AI at [Company Name]. You'll translate non-deterministic AI behavior into predictable engineering metrics that the whole organization can trust.

3. Engineering Leadership

Mentor & Level Up: Bridge the gap between traditional software engineering and AI. You'll mentor engineers on how to apply rigorous system design to the world of LLMs and agents.
Continuous Observability: Take ownership of the feedback loop, ensuring that production insights from our agents directly inform the next iteration of our evaluation datasets.

Who You Are

8+ Years of Engineering Excellence: You are a Staff-level engineer first. You've built systems that handle high scale, and you know how to architect for long-term maintainability and performance.
Agentic Curiosity: You've moved beyond the "chatbot" phase and are actively experimenting with AI Agents. You understand that the challenge isn't the prompt, but the orchestration, state management, and reliability of the agent's actions.
Systems Thinker (Non-Deterministic Mindset): You recognize that AI is probabilistic. You are excited by the challenge of building deterministic "wrappers" and Evaluation loops around models to make them safe for production.
The "Applied" Edge: You likely come from a background in distributed systems, internal platforms, or developer tooling, and you're now applying that rigor to the AI stack.

What We're Looking For

Beyond the Wrapper: You have serious experience moving beyond simple API calls to architecting multi-stage AI orchestrations (agents, chained workflows, or custom runtime logic).
Orchestration Experience: Even if you aren't an AI researcher, you have experience building complex, multi-step workflows (e.g., temporal systems, state machines, or event-driven architectures) and want to apply this to Agentic loops.
Reliability Obsession: You understand why "vibes-based" testing doesn't work. You've started exploring or building Eval frameworks to measure how models perform against real-world data.
Infrastructure Mindset: You are comfortable with the "glue" that makes AI work: vector databases, semantic caching, and API integration with third-party tools.

Tech Stack & Experience

Strong backend experience (Python preferred)
Experience with distributed systems and event-driven architectures
Familiarity with tools like Kafka, Pub/Sub, or equivalent
Experience working with LLMs (prompting, RAG, agents, evaluation workflows)
Experience building APIs and scalable services
Understanding of monitoring, observability, and system performance

Hiring Process

Recruiter phone screen
HM Interview
System Design Interview
AI Case Study (take-home, ~1-2 hours)
Technical Deep Dive of case study
Final Leadership Interview

Perks & Benefits

5-week vacation (We follow each country's appropriate PTO Laws)

Paid sick leave

Paid parental leave (16 weeks)

MacBook Pro

Private health insurance and retirement pension (RRSP with Gorgias matching up to 4%)

For a smooth onboarding, we invite you to our Toronto office for one week (flights and accommodation handled by Gorgias)

Get up to $900 CAD to set up your workstation at home

Get up to $2,600 CAD of learning material per year (books, courses, training, and individual coaching)

Every quarter, we organize a company-wide summit to discuss where we're going and strengthen social bonds. Once per year we organize offsite team retreats and company retreats!

AI at Gorgias
At Gorgias, AI is a natural extension of how we work and build. Our teams use it every day to research, write, analyze, code, and craft better customer experiences. Everyone has access to premium AI tools (ChatGPT, Claude, Granola & others) and an annual L&D budget to explore new ones.

The real magic happens when we share what we learn. Our #powerup Slack channel is a digital petri dish of new tools and workflows, and each team has AI champions who showcase fresh ideas during weekly company-wide standups, now practically AI demo sessions.

We see AI not as a replacement for creativity or empathy, but as a multiplier, helping us move faster, think deeper, and serve customers better.

About Gorgias

Gorgias is a customer support automation platform that helps businesses deliver personalized customer service at scale. The company's software integrates with e-commerce platforms and other business tools to provide a unified view of customer interactions and automate repetitive tasks. Gorgias was founded in 2015 and is headquartered in San Francisco, with additional locations in Paris and New York.

Learn more about Gorgias

Size

100 employees

Industry

Business Services

Founded

2015

* Ladders Estimates

Similar Jobs

AI Forward Deployed Engineer - CRM & Marketing
$100K — $130K *
Hard Rock
Toronto, ON M3C 0E3
Today
AI Engineer
$90K — $120K *
Precision Castparts
Cleveland, OH 44130 (Cuyahoga County)
Today
Software Engineer II - Python
$90K — $120K *
Rocket Companies
Detroit, MI 48228 (Wayne County)
Today
Senior AI Engineer - Grafana AI/ML | USA | Remote
$129K — $217K *
Grafana Labs
Remote
Today
AI Agentic Engineer
$100K — $120K *
Tata Consultancy Services
Detroit, MI 48228 (Wayne County)
Reposted Today
Senior Machine Learning Engineer
$125K — $233K *
Paylocity
Remote
Reposted Today

Get Ready For Your
Next Interview

More Jobs at Gorgias

Principal AI Engineer
$120K — $150K *
Toronto, ON M3C 0E3
Today
Information Technology
In-Person
Manager, Enterprise Customer Success
$120K — $150K *
New York, NY 10025 (New York County)
2 weeks ago
Enterprise Technology
In-Person
Manager, Enterprise Customer Success
$90K — $130K *
Toronto, ON M3C 0E3
2 weeks ago
Enterprise Technology
In-Person
Chief of Staff, CX Ops
$120K — $150K *
New York, NY 10025 (New York County)
2 weeks ago
Enterprise Technology
In-Person
Chief of Staff, CX Ops
$130K — $180K *
San Francisco, CA 94112 (San Francisco County)
2 weeks ago
Consumer Technology
In-Person

More Information Technology Jobs

SDET (Software Development Engineer In Test)
Confidential Company
Washington, DC 20001 (District Of Columbia County)
1 week ago
Principal Data Engineer
$101K — $163K *
Metropolitan Council
Saint Paul, MN 55106 (Ramsey County)
Today
Senior Applications Administrator
$90K — $120K *
NexGen Data Systems
Scott Air Force Base, IL 62225 (Saint Clair County)
Today
Senior Cybersecurity Auditor
$90K — $120K *
NexGen Data Systems
Scott Air Force Base, IL 62225 (Saint Clair County)
Today
Penetration Tester with Splunk
$141K — $236K *
ManTech International
Fort George G Meade, MD 20755 (Anne Arundel County)
Today

Find similar Principal AI Engineer jobs:

Nationwide Toronto, ON

Principal AI Engineer

Job Overview by Ladders

Full Job Description

Get Ready For Your Next Interview

Find similar Principal AI Engineer jobs:

Get Ready For Your
Next Interview