Software Engineer, AI/ML GenAI

Instrumentl

• $175K — $220K *

US-AnywhereRemote in United States

Information Technology

5 - 7 years of experience

1 month ago

By clicking Apply, I agree with Ladders' Terms of Use and Privacy Policy

Job Overview by Ladders

Qualifications

5+ years of software engineering experience, with 2+ years in LLMs.
Experience in taking LLM/RAG systems from prototype to production.
Proficiency in building agentic systems and tool integrations.
Strong understanding of RAG concepts and hybrid search methodologies.
Hands-on experience with embeddings and vector databases.
Ability to design evaluation suites for AI systems.
Proficient in Python, and familiar with cloud services (GCP/AWS) and CI/CD practices.

Responsibilities

Design and produce observable, efficient AI systems for various tasks.
Oversee the end-to-end process of RAG systems, from data ingestion to citation.
Manage embedding models and ensure vector stores remain current and effective.
Collaborate cross-functionally to improve user experience and system reliability.
Conduct experiments and iterate based on performance results.

Benefits

100% health, dental, and vision insurance coverage for employees; 50% for dependents.
Generous PTO including parental leave.
401(k) plan available.
Company-provided laptop and home workstation setup stipend.
Opportunities for company retreats to foster team connection.
Engagement in meaningful work with nonprofit organizations.

Full Job Description

About the Role : As a Software Engineer, AI/ML GenAI at Instrumentl, you'll own the full lifecycle of AI features-from rapid prototyping to production deployment and ongoing evaluation. You will build agentic LLM systems that can plan and use tools, implement RAG pipelines over our domain data, manage and evolve embeddings, and stand up evaluation/observability so our AI is grounded, safe, and cost-effective. You'll embed with one of the product pods in a hands-on role, collaborating closely with Product and Design, while partnering with DTI on platform-level AI capabilities.

The Instrumentl team is fully distributed (though if you'd like to work from our Oakland office, we would love to see you there). For this position, we are looking for someone who has overlap with Pacific Time Zone working hours.

What you will do

Design agentic systems & ship AI to production: Build resilient, observable services, while optimizing cost and latency budgets. Build tool-using LLM "agents" (task planning, function/tool calling, multi-step workflows, guardrails) for tasks like grant discovery, application drafting, document parsing and many more.
Own RAG end-to-end: Ingest and normalize content, choose chunking/embedding strategies, implement hybrid retrieval, re-ranking, citations, and grounding. Continuously improve recall/precision.
Manage embeddings at scale: Select, evaluate, and migrate embedding models; maintain vector stores (e.g., pgvector/Qdrant/Pinecone etc.); monitor drift and rebuild strategies.
Collaborate cross-functionally while raising engineering standards: Work side by side with Product, Design on scoping, UX, and measurement; run experiments (A/B, canaries), interpret results, and iterate. Write clear, maintainable code, add tests and docs, and contribute to reliability practices (alerts, dashboards, incident response).

What we're looking for

Software engineering background: 5+ years of professional software engineering experience (as an IC), including 2+ years working with modern LLMs.
Proven production impact: You've taken LLM/RAG systems from prototype to production, owned reliability/observability, and iterated post-launch based on evals and user feedback.
LLM agentic systems: Experience building tool/function-calling workflows, planning/execution loops, and safe tool integrations (e.g., with LangChain/LangGraph, LlamaIndex, Semantic Kernel, or custom orchestration).
RAG expertise: Strong grasp of document ingestion, chunking/windowing, embeddings, hybrid search (keyword + vector), re-ranking, and grounded citations. Experience with re-rankers/cross-encoders, hybrid retrieval tuning, or search/recommendation systems.
Embeddings & vector stores: Hands-on with embedding model selection/versioning and vector DBs (e.g., pgvector, Qdrant, Pinecone, Weaviate, Milvus etc.).
Evaluation mindset: Comfort designing eval suites (RAG/QA, extraction, summarization), using automated and human-in-the-loop methods; familiarity with frameworks like Ragas/DeepEval/OpenAI Evals or equivalent.
Infrastructure & languages: Proficiency in Python (FastAPI, Celery); Experience with GCP/AWS, Docker, CI/CD, and observability (logs/metrics/traces).
Data chops: Comfortable with SQL, schema design, and building/maintaining data pipelines that power retrieval and evaluation.
Collaborative approach: You thrive in a cross-functional environment and can translate research ideas into shippable, user-friendly features.
Results-driven: Bias for action and ownership with an eye for speed, quality, and simplicity.

Nice to have

Startup Experience and comfort operating in fast, scrappy environments is a plus.
Familiarity with responsible AI, red-teaming, and domain-specific safety policies.
Fine-tuning: Practical experience with SFT/LoRA or instruction-tuning (and good intuition for when fine-tuning vs. prompting vs. model choice is the right lever).

Compensation & Benefits

Salary ranges are based on market data, relative to our size, industry, and stage of growth. Salary is one part of total compensation, which also includes equity, perks, and competitive benefits.
For US-based candidates, our target salary band is $175,000 - $220,000/year + equity. Salary decisions will be based on multiple factors including geographic location, qualifications for the role, skillset, proficiency, and experience level.
100% covered health, dental, and vision insurance for employees, 50% for dependents.
Generous PTO policy, including parental leave.
401(k).
Company laptop + stipend to set up your home workstation.
Company retreats for in-person time with your colleagues.
Work with awesome nonprofits around the US. We partner with incredible organizations doing meaningful work, and you get to help power their success.

* Ladders Estimates

Similar Jobs

Data Scientist - Gen AI ML - Tampa/Irving/ Mississauga
$56K — $196K *
Photon
Irving, TX 75061 (Dallas County)
Reposted Today
Lead AI Engineer (AI Foundations, LLM Core and Agentic AI)
$215K — $245K *
Capital One Financial Corporation
San Francisco, CA 94112 (San Francisco County)
Reposted Today
Sr Engineer, Enterprise AI
$133K — $240K *
T-Mobile
Frisco, TX 75034 (Denton County)
Reposted Today
Sr Engineer, Enterprise AI
$133K — $240K *
T-Mobile
Washington, DC 20011 (District Of Columbia County)
Reposted Today
Sr Engineer, Enterprise AI
$133K — $240K *
T-Mobile
Bellevue, WA 98006 (King County)
Reposted Today
Data Scientist - Gen AI ML - Tampa/Irving/ Mississauga
$56K — $196K *
Photon
Irving, TX 75061 (Dallas County)
Reposted Today

Get Ready For Your
Next Interview

More Jobs at Instrumentl

Customer Success Manager
$80K — $100K *
Remote
2 weeks ago
Education, Government & Non-Profit
Remote in United States
Customer Support & Operations Team Lead
$95K — $115K *
Remote
3 weeks ago
Education, Government & Non-Profit
Remote in United States
Sales Manager - Mid-Market and Enterprise
$90K — $130K *
Remote
4 weeks ago
Education, Government & Non-Profit
Remote in United States
Senior Full Stack Software Engineer
$160K — $190K *
Remote
4 weeks ago
Education, Government & Non-Profit
Remote in United States
Senior Data Engineer
$120K — $150K *
Remote
4 weeks ago
Information Technology
Remote in United States

More Information Technology Jobs

SDET (Software Development Engineer In Test)
Confidential Company
Washington, DC 20001 (District Of Columbia County)
4 days ago
Principal Product Manager - Virtualization Architect
$172K — $328K *
Hewlett Packard Enterprise Development LP
Fall City, WA 98024 (King County)
Reposted Today
Manager, Information Technology & Business
$90K — $120K *
Formations, Inc.
Edmonton, AB T5A 0A1
Today
Software Engineer IV, Data
$100K — $130K *
ACV
Toronto, ON M3C 0E3
Today
Server Administrator
$80K — $110K *
ActioNet, Inc
Remote
Reposted Today

Find similar Software Engineer, AI/ML GenAI jobs:

Nationwide Remote

Software Engineer, AI/ML GenAI

Job Overview by Ladders

Full Job Description

Get Ready For Your Next Interview

Find similar Software Engineer, AI/ML GenAI jobs:

Get Ready For Your
Next Interview