AI Engineer - Data Intelligence

Clarium

$150K — $180K *
US-AnywhereRemote in United States
Healthcare
Less than 5 years of experience
Job Overview by Ladders

Qualifications

  • Strong Python skills with a proven track record in production coding.
  • Proficient in SQL, including complex queries and data modeling.
  • Comfortable in ambiguous environments with strong problem-scoping abilities.
  • Committed to maintaining data quality and addressing silent bugs.
  • Willingness to learn and develop expertise in unfamiliar domains.

Responsibilities

  • Build and maintain components of the master data enrichment pipeline.
  • Design and own workflows that integrate deterministic logic and LLMs for data processing.
  • Develop evaluation harnesses and regression suites to ensure pipeline quality.
  • Produce quality Python and SQL code, focusing on coding over configurations.
  • Analyze data using statistics and ML to drive actionable insights.
  • Audit data proactively to identify and resolve quality issues.

Benefits

  • Equity options tied to salary package.
  • Fully remote work with optional co-working space in NYC.
  • Unlimited paid time off (PTO).
  • Comprehensive health, vision, and dental insurance.
  • 401K retirement plan with company contribution.
  • Opportunity to contribute to a well-rooted team with a focus on meaningful work.
Full Job Description
The Opportunity

AI-powered platforms, like Clarium's, deliver the highest impact when they are supported by high-quality data. As we scale to more health systems and deepen our offering of intelligent, data-driven workflows, the master data enrichment pipeline (the system that classifies and contextualizes every product flowing through a hospital's supply chain) has become a critical growth lever. We're investing in the team and infrastructure to make that layer faster, smarter, and more reliable.

You'll join the Data Products team, a small, unusually senior group responsible for the data assets, data science, and analytics that drive measurable value for our clients. Day-to-day, you'll build and own components of our enrichment pipeline: classification workflows, entity resolution systems, evaluation harnesses, and the production tooling that keeps it all running. You'll work closely with engineers and data scientists who've shipped real ML systems at scale, and your work will feed directly into decisions made by supply chain teams at some of the country's leading health systems.

A rare early-career opportunity to learn fast and own real work from day one. As the first junior hire on the team, you won't be buried under layers of abstraction. You'll work directly alongside people who've done this before, on problems that actually matter. Short feedback loops, real stakes, and the kind of hands-on growth that's hard to find this early in a career. It's the opportunity many of us wish we'd had starting out.

In This Role You Will
  • Build and maintain components of Clarium's master data enrichment pipeline, the system that classifies and enriches every product flowing through our platform
  • Design and own classification and entity resolution workflows that combine deterministic logic and LLMs for production data processing
  • Build and operate evaluation harnesses, label sets, and regression suites (we use Braintrust) to measure and improve pipeline quality with confidence
  • Write production Python and SQL; the majority of your time will be spent in code, not in configuration tools
  • Analyze complex datasets using statistics and ML to surface actionable insights and inform pipeline improvements
  • Proactively audit data for quality issues; find the problems no one else has noticed yet, diagnose root causes, and ship fixes

What You'll Bring
  • Strong Python skills and a track record of writing production code, not just scripts or notebooks
  • Strong SQL, including complex joins, window functions, performance tuning, and data modeling
  • Comfort working in ambiguous environments; you can scope a problem, make a plan, and execute without hand-holding
  • A genuine, non-negotiable commitment to data quality; you treat silent bugs as real failures
  • Ability to go deep on an unfamiliar domain and develop meaningful expertise over time

Nice to Have
  • Experience with LLM integrations, prompt evaluation, or classification at scale
  • Familiarity with eval frameworks such as Braintrust, Promptfoo, or equivalent
  • Prior work in healthcare, supply chain, or another domain where data quality has direct operational consequences

Skills & Tools You'll Use

Need to Know: Python • SQL • PostgreSQL • CI/CD • Production observability

Nice to Know: Temporal • Braintrust • Snowflake • AWS • Sigma

What You Get at Clarium

Target Base Salary Range: $150K - $180K

The base salary Clarium offers may vary depending upon the ultimate scope and responsibilities of the position and on the candidate's job-related knowledge, skills, and experience. The total package will include equity, in addition to a full range of medical and/or other benefits, depending on the position offered. Pay and benefits are subject to change at any time, consistent with the terms of any applicable compensation or benefit plans.

Incentive Stock Options proportionate to your salary

Fully remote, with a NYC co-working space available; distributed team across multiple time zones with opportunities for in-person time

Unlimited PTO

Top-tier health, vision, and dental benefits

401K

The opportunity to build on a strong foundational team with deep data and engineering roots at a stage where your work genuinely shapes the product

Similar Jobs

More Jobs at Clarium

More Healthcare Jobs

Find similar AI Engineer - Data Intelligence jobs: