Grafana Labs

Senior AI Engineer | US | Remote

Grafana Labs$154K — $185K *
US-AnywhereRemote in United States
Enterprise Technology
8 - 10 years of experience
Job Overview by Ladders

Qualifications

  • 8+ years in software engineering with backend development or systems integration focus.
  • 2+ years applying LLMs/AI in production environments, not just prototypes.
  • Proficient in Python and JavaScript/Node.js with strong Git-based workflow experience.
  • Experience with LLM frameworks including prompt engineering and RAG.
  • Hands-on with multi-agent systems, including orchestration patterns and production monitoring.
  • Ability to diagnose business problems and think in workflows.
  • Familiarity with Google Cloud Platform and serverless/container technologies.

Responsibilities

  • Own multi-agent AI systems development from architecture to ongoing operation.
  • Build modular agentic systems using orchestration frameworks for 24/7 operation.
  • Develop reusable agentic skills for various interfaces including Slack and dashboards.
  • Implement observability for logging, performance metrics, and model evaluation.
  • Establish governance and compliance standards for AI workflows and data handling.
  • Build APIs and microservices connecting AI models to business systems effectively.
  • Design automated workflows with CI/CD processes for partner teams.

Benefits

  • 100% remote work with a diverse global culture.
  • Opportunities for career growth and development.
  • Transparent communication and regular updates company-wide.
  • Innovation-driven environment with autonomy.
  • Open-source community values shaping company culture.
  • 30 days of annual leave with dedicated shutdown days for disconnection.
  • In-person onboarding experience for new hires to foster connection.
Full Job Description
The Opportunity

Grafana Labs is seeking a Senior Engineer (AI & Automation) to own the AI agent infrastructure and automation platform that powers our Marketing Operations organization. You'll build multi-agent architectures, LLM integrations, and backend services that connect AI models to internal and third-party data platforms. You'll ship production systems that teams depend on daily.

This is a high-autonomy role where you own the technical direction. You'll identify the highest-leverage problems across Marketing, RevOps, and SDR teams, design the solutions, and ship them. You'll define the technical direction for the automation platform (data models, API contracts, shared libraries, reference architectures) and partner with Data Engineering, GTM Systems, and Field Operations to build scalable, self-service automation that eliminates manual work and drives operational efficiency.

What You'll Be Doing

Agentic Systems & AI Infrastructure
  • Own end-to-end development of multi-agent AI systems, from architecture and implementation through testing, deployment, and ongoing operation
  • Build modular, composable agentic systems using orchestration frameworks (LangChain, CrewAI, Anthropic MCP, or similar) that operate 24/7 across teams
  • Develop reusable agentic skills that agents invoke across interfaces (Slack, dashboards, internal apps, CLIs)
  • Implement observability and feedback loops including logging, performance metrics, prompt iteration, model evaluation, and cost management
  • Establish governance and compliance standards for AI workflows including access controls, audit trails, PII handling, and human-in-the-loop escalation paths

Systems Integration & Backend Services
  • Build MCP servers, APIs, CLIs, and microservices connecting AI models to business systems (BigQuery, Slack, CRMs, email, calendars, analytics tools)
  • Architect data flows for retrieval-augmented generation (RAG), connecting LLMs to internal knowledge bases, customer data, and real-time business context
  • Build serverless or containerized services (GCP Cloud Functions, Cloud Run) that scale with usage and integrate with Grafana's cloud infrastructure

Automation & Workflow Enablement
  • Partner with RevOps, Demand Generation, Regional Marketing, and SDR teams to scope high-impact automation problems, identify bottlenecks, and build solutions with measurable business outcomes
  • Design and deploy workflows using orchestration tools (n8n, Workato, or custom platforms) with CI/CD, testing, and production reliability standards
  • Build systems designed for self-service with documentation, playbooks, and enablement materials that let partner teams operate independently

We invest heavily in developer productivity. You'll have access to AI coding assistants (Claude Code, Gemini CLI, OpenAI Codex, and others of your choice within security guidelines). We encourage pragmatic AI-assisted development paired with strong code review and quality standards.

What Makes You a Great Fit
  • 8+ years of software engineering experience with depth in backend development, systems integration, or data/analytics engineering
  • 2+ years hands-on experience applying LLMs/AI to production workflows, not just prototypes
  • Strong proficiency in Python and JavaScript/Node.js with Git-based workflows, code review practices, and testing discipline
  • Hands-on experience with LLM frameworks and patterns including prompt engineering, RAG, function calling/tool use, structured output parsing, and evaluation
  • Experience building and operating multi-agent systems at scale including agent decomposition, orchestration patterns (sequential chains, router/dispatcher, parallel fan-out), state management, and production monitoring
  • You diagnose business problems before writing code. You think in workflows and outcomes, not just functions.
  • Deep familiarity with Google Cloud Platform, BigQuery, and serverless/containerized services (Cloud Functions, Cloud Run)
  • Understanding of LLM failure modes and production mitigations including confidence thresholds, fallback logic, human escalation, and cost/latency management
  • Proven ability to identify high-leverage problems, push back on low-impact requests, and deliver end-to-end with minimal direction
  • Fluent with AI-assisted development tools (GitHub Copilot, Cursor, Claude Code). You use AI to build AI systems
  • Clear technical communicator who can explain complex systems in simple terms to both engineers and business stakeholders

Bonus Points
  • Experience with vector databases or retrieval pipelines (Pinecone, Weaviate, ChromaDB, Qdrant, pgvector)
  • Familiarity with marketing or sales platforms (Salesforce, Customer.io, HubSpot, Marketo, Outreach)
  • Experience with frontend frameworks (React, Slack Block Kit) for building user-facing AI tool interfaces
  • Observability tooling for AI systems (LangSmith, Weights & Biases, custom evaluation frameworks)
  • Experience with workflow orchestration platforms (n8n, Temporal, Prefect, Airflow)
  • Familiarity with Model Context Protocol (MCP) or similar standards for connecting AI systems to data sources
  • Prior work automating marketing, sales, or customer success workflows in a B2B SaaS environment
  • Active in open-source communities. Grafana is built on OSS and we value engineers who share that DNA

In the United States, the base compensation range for this role is USD $154,445 - USD $185,334. Actual compensation may vary based on level, experience, and skillset as assessed throughout the interview process. All of our roles include Restricted Stock Units (RSUs), giving every team member ownership in Grafana Labs' success. We believe in shared outcomes-RSUs help us stay aligned and invested as we scale globally.

*Compensation ranges are country specific. If you are applying for this role from a different location than listed above, your recruiter will discuss your specific market's defined pay range & benefits at the beginning of the process.

Why You'll Thrive at Grafana Labs:
  • 100% Remote, Global Culture - As a remote-only company, we bring together talent from around the world, united by a culture of collaboration and shared purpose.
  • Scaling Organization - Tackle meaningful work in a high-growth, ever-evolving environment.
  • Transparent Communication - Expect open decision-making and regular company-wide updates.
  • Innovation-Driven - Autonomy and support to ship great work and try new things.
  • Open Source Roots - Built on community-driven values that shape how we work.
  • Empowered Teams - High trust, low ego culture that values outcomes over optics.
  • Career Growth Pathways - Defined opportunities to grow and develop your career.
  • Approachable Leadership - Transparent execs who are involved, visible, and human.
  • Passionate People - Join a team of smart, supportive folks who care deeply about what they do.
  • In-Person onboarding - We want you to thrive from day 1 with your fellow new 'Grafanistas' to learn all about what we do and how we do it.
  • Balance is Key - We operate a global annual leave policy of 30 days per annum. 3 days of your annual leave entitlement are reserved for Grafana Shutdown Days to allow the team to really disconnect. *We will comply with local legislation where applicable.

About Grafana Labs

Grafana Labs is a software company that provides an open-source platform for data visualization and monitoring. The company's flagship product, Grafana, is a popular tool used by developers and IT professionals to create dashboards and alerts for various data sources. Grafana Labs also offers a cloud-based version of its platform, Grafana Cloud, which provides additional features and integrations. The company's mission is to democratize data and help organizations make better decisions by providing easy-to-use tools for data visualization and monitoring.
Learn more about Grafana Labs
Size
250 employees
Industry
Founded
2014

Similar Jobs

More Jobs at Grafana Labs

More Enterprise Technology Jobs

Find similar Senior AI Engineer | US | Remote jobs: