Trellix

Staff Software Engineer

Trellix$120K — $150K *
Enterprise Technology
5 - 7 years of experience
Job Overview by Ladders

Qualifications

  • 5+ years of Python application software development experience, especially with production AI platforms.
  • Proficiency in Linux with preferred Shell scripting skills.
  • Excellent debugging and development capabilities in Python, especially with REST and async Web APIs.
  • Hands-on experience with Large Language Models (LLMs) and Langchain for multi-agent workflows.
  • Experience with CI/CD pipelines, such as GitHub Actions, and strong documentation skills.

Responsibilities

  • Lead the design and development of a generative AI platform from inception to production.
  • Own end-to-end feature functionality, overseeing design, testing, and operational health.
  • Develop and refine multi-agent workflows for autonomous security operations.
  • Deliver solutions that meet project goals on time and budget, ensuring high quality.
  • Implement security and resilience best practices for sensitive security applications.
  • Design observability tools and dashboards to track platform health and usage.
  • Conduct technical analysis, create documentation, and oversee production issue resolution.

Benefits

  • Comprehensive retirement plans.
  • Medical, dental and vision insurance coverage.
  • Generous paid time off policy.
  • Paid parental leave for employees.
  • Encouragement and support for community involvement.
Full Job Description
Job Title:
Staff Software Engineer

Role Overview:
Role Overview:
Join our innovative team at Trellix, where you'll lead the design and development of a cutting-edge generative AI platform powering advanced AI capabilities across the entire Trellix security portfolio. This isn't prototype work. You'll be building and operating production agentic systems deployed in federal environments, enabling autonomous SOC workflows, multi-agent orchestration, and seamless AI integration across our security products. We're looking for a highly skilled Software Development Engineer with a passion for building robust, scalable, and secure AI solutions that operate at real-world scale.

About the Role:
  • Design and Develop: Lead the design and development of our generative AI platform, driving core functionality, agentic workflows, and platform-level features from concept through production.
  • End-to-End Ownership: Take full ownership of features and functions, from initial design and development through rigorous testing, automation, and ongoing operational health.
  • Agent Workflow Development: Build, iterate on, and harden multi-agent pipelines, including tool use, inter-agent coordination, and autonomous decision workflows for security operations.
  • Deliver High-Quality Solutions: Ensure solutions are delivered on time, within budget, and to the highest quality standards, meeting project goals and customer commitments.
  • Ensure Resilience & Security: Proactively implement best practices to ensure applications are highly resilient, secure, and performant, with particular attention to the sensitivity of security operations data.
  • Observability & Telemetry: Design and implement instrumentation using OpenTelemetry, contribute to operational dashboards, and surface platform health and usage insights to engineering leadership and stakeholders.
  • Technical Analysis & Documentation: Analyze feature requirements and produce detailed design documentation, architectural decision records, and async-friendly technical specs.
  • Production Reliability & Incident Response: Own production issues end-to-end, including triage, root cause analysis, post-mortems, and SLA commitments, for a platform operating in high-stakes environments.

About You:
  • Experience: 5+ years of professional experience in Python application software development, with demonstrated experience building and operating production AI or platform systems.
  • Operating Systems:
    • Linux proficiency is a must
    • Shell scripting (preferred)
  • Core Development Skills:
    • Excellent development and debugging skills in Python
    • Strong grasp of data structures and design patterns
    • Proficiency with REST and async Web APIs
    • CI/CD pipeline experience (GitHub Actions or equivalent)
    • Strong written communication for design docs and async collaboration
    • Ability to operate with autonomy in a fast-moving, ambiguous environment
  • AI/ML Knowledge:
    • Hands-on experience with Large Language Models (LLMs) in production
    • Langchain experience required, including building and operating stateful multi-agent workflows
    • Experience with prompt orchestration and chain composition
    • Familiarity with Agentic AI concepts and patterns (ReACT, chain-of-thought, tool use, Deep Agents)
    • Experience deploying and operating vLLM for self-hosted inference
    • Familiarity with MCP (Model Context Protocol) for agentic tool integration
  • Frameworks & Technologies:
    • FastAPI (preferred)
    • Node.js / TypeScript for tooling and API integration layers (preferred)
  • Databases:
    • Postgres (preferred)
    • Knowledge Graphs, including NebulaGraph or equivalent (preferred)
    • Vector Databases, including Qdrant or equivalent (preferred)
    • Embedding pipeline experience including chunking strategies and retrieval tuning (preferred)
  • Services & Tools:
    • Gunicorn or Uvicorn
    • OpenTelemetry (OTEL) instrumentation
    • Redis (preferred)
    • Langfuse or LangSmith for agent observability (preferred)
    • Kubernetes (preferred)
    • AWS: RDS, EKS, Elasticache, Bedrock (preferred)
  • Domain Knowledge:
    • Working knowledge of threat detection, EDR telemetry, SOC workflows, or SIEM platforms strongly preferred
    • Understanding of Security Incident and Event Management (SIEM) and Incident Response a plus
  • Soft Skills:
    • Excellent communication and collaboration skills with the ability to work effectively across engineering, product, and security research teams
    • Ability to communicate technical decisions and tradeoffs clearly to non-engineering stakeholders
    • Strong written communication for design documentation and distributed team collaboration
    • Comfortable operating with high autonomy and minimal oversight in a fast-moving, ambiguous environment


Company Benefits and Perks:

We believe that the best solutions are developed by teams who embrace each other's unique experiences, skills, and abilities. We work hard to create a dynamic workforce where we encourage everyone to bring their authentic selves to work every day. We offer a variety of social programs, flexible work hours and family-friendly benefits to all of our employees.
  • Retirement Plans
  • Medical, Dental and Vision Coverage
  • Paid Time Off
  • Paid Parental Leave
  • Support for Community Involvement


About Trellix

Trellix was a software company that provided web publishing tools for small businesses and individuals. The company was founded in 1999 and was headquartered in Cambridge, Massachusetts. Trellix was acquired by HP in 2003 and its technology was integrated into HP's Small Business Center.
Learn more about Trellix
Size
3,400 employees
Market Cap
$4.1 billion
Industry
Net Income
-$207.3 million
Founded
2004
5 Year Trend
+8.6%
Revenue
$940.5 million
NASDAQ

Similar Jobs

More Jobs at Trellix

More Enterprise Technology Jobs

Find similar Staff Software Engineer jobs: