Data Scientist

Trellix • $100K — $130K *

Frisco, TX 75034In-Person

Information Technology

5 - 7 years of experience

Today

Be an Early Applicant

By clicking Apply, I agree with Ladders' Terms of Use and Privacy Policy

Job Overview by Ladders

Qualifications

5+ years of experience in data science, ML engineering, or AI research with a focus on evaluation and benchmarking.
Proficiency in Python, including libraries like pandas and NumPy.
Strong skills in statistical analysis and experimental design.
Experience with Large Language Models (LLMs) evaluation frameworks like RAGAS or EleutherAI LM Eval.
Hands-on experience with knowledge graphs and familiarity with graph quality metrics.

Responsibilities

Design and implement evaluation frameworks for AI systems and multi-step reasoning agents.
Execute model evaluations to assess performance across various parameters such as accuracy and safety.
Develop methods for validating knowledge graph quality and completeness.
Curate high-quality datasets for training and testing AI models and agents.
Create structured test harnesses for evaluating agentic systems and their coordination capabilities.

Benefits

Retirement plans offering various options for financial security.
Comprehensive medical, dental, and vision coverage.
Generous paid time off to support work-life balance.
Paid parental leave to facilitate family bonding.
Support for community involvement to encourage employee engagement.

Full Job Description

Job Title:
Data Scientist

Role Overview:
Join our innovative team at Trellix, where you'll be instrumental in building the evaluation and benchmarking infrastructure for our cutting-edge agentic AI platform. This role sits at the intersection of data science and AI engineering - you'll own the science of how we know our AI works, designing evaluation frameworks, curating test datasets, and measuring the performance of AI agents, knowledge graphs, and foundation models across the Trellix security portfolio.

About the Role:

Evaluation Framework Design: Architect and implement rigorous evaluation pipelines for agentic AI systems, including multi-step reasoning agents, retrieval-augmented pipelines, and autonomous SOC workflows.
Model & Agent Benchmarking: Design and execute model evaluations to assess accuracy, reliability, latency, and safety across LLMs and agentic systems, including custom benchmarks tailored to cybersecurity use cases.
Knowledge Graph Evaluation: Develop methods to validate knowledge graph quality, coverage, and correctness including entity resolution, relationship accuracy, and graph completeness metrics.
Dataset Engineering: Build, curate, and maintain high-quality synthetic and real-world datasets for training, fine-tuning, and testing models and agents - including adversarial and edge-case datasets.
Agentic Agent Testing: Design structured test harnesses for agentic systems covering tool use, multi-agent coordination, hallucination rates, decision quality, and task completion fidelity.
Metrics & Observability: Define and instrument evaluation metrics, surface results through dashboards, and translate findings into actionable insights for engineering and product teams.
Research & Innovation: Stay current with the latest evaluation methodologies (e.g., LLM-as-judge, RAGAS, MT-Bench, custom evals) and adapt them to Trellix's security domain.
Cross-Functional Collaboration: Partner closely with AI engineers, product managers, and security researchers to align evaluation standards with real-world performance requirements.

About You:

Experience: 5+ years of professional experience in data science, ML engineering, or AI research, with hands-on work in evaluation or benchmarking of AI/ML systems.
Data Science & ML Core:
- Strong proficiency in Python (pandas, NumPy, scikit-learn)
- Statistical analysis and experimental design
- Experience building and managing datasets for ML training and evaluation
- Familiarity with annotation workflows and data quality frameworks
AI/LLM Evaluation:
- Hands-on experience evaluating Large Language Models (LLMs)
- Familiarity with evaluation frameworks such as RAGAS, HELM, EleutherAI LM Eval, or equivalent
- Experience designing LLM-as-judge pipelines or preference evaluation workflows
- Understanding of hallucination detection, groundedness, and faithfulness metrics
Agentic Systems:
- Experience testing or evaluating agentic AI systems
- Familiarity with tool use, ReACT-style, Deep Agents, and multi-agent coordination patterns
- Ability to define pass/fail criteria for complex, multi-step agent tasks
Knowledge Graphs:
- Experience working with knowledge graphs (NebulaGraph, Neo4j, or equivalent)
- Ability to evaluate graph quality, ontology coverage, and traversal correctness
- Familiarity with embedding-based retrieval and vector databases (Qdrant preferred)
Data Engineering & Infrastructure:
- Experience with synthetic data generation for model and agent testing
- Proficiency with vector databases and embedding pipelines
- Familiarity with MLflow, Weights & Biases, Langfuse, or similar experiment tracking tools
- AWS experience preferred
Domain Knowledge:
- Familiarity with the cybersecurity domain strongly preferred
- Understanding of SOC workflows, threat detection, and incident response a plus
- Experience evaluating AI systems in high-stakes or regulated environments a plus
Soft Skills:
- Strong analytical thinking and ability to translate ambiguous quality questions into measurable metrics
- Excellent written communication - able to document evaluation methodologies and present findings to technical and non-technical stakeholders
- Collaborative mindset with a bias toward rigor and reproducibility

Company Benefits and Perks:

We believe that the best solutions are developed by teams who embrace each other's unique experiences, skills, and abilities. We work hard to create a dynamic workforce where we encourage everyone to bring their authentic selves to work every day. We offer a variety of social programs, flexible work hours and family-friendly benefits to all of our employees.

Retirement Plans
Medical, Dental and Vision Coverage
Paid Time Off
Paid Parental Leave
Support for Community Involvement

About Trellix

Trellix was a software company that provided web publishing tools for small businesses and individuals. The company was founded in 1999 and was headquartered in Cambridge, Massachusetts. Trellix was acquired by HP in 2003 and its technology was integrated into HP's Small Business Center.

Learn more about Trellix

Size

3,400 employees

Market Cap

$4.1 billion

Industry

Consumer Technology

Net Income

-$207.3 million

Founded

2004

5 Year Trend

+8.6%

Revenue

$940.5 million

NASDAQ

FEYE

* Ladders Estimates

Similar Jobs

Data Scientist
$90K — $130K *
Trellix
Mount Enterprise, TX 75681 (Rusk County)
Today
Data Scientist
$90K — $130K *
Guidehouse
Houston, TX 77084 (Harris County)
Today
Data Scientist II
$90K — $130K *
Arrive Logistics
Austin, TX 78745 (Travis County)
Yesterday
Junior Data Scientist
$114K — $135K *
Cushman & Wakefield
Austin, TX 78745 (Travis County)
Yesterday
GIS Developer
$80K — $110K *
Amentum
Houston, TX 77084 (Harris County)
Yesterday
Data Scientist
$87K — $159K *
Bread Financial Holdings, Inc.
Frisco, TX 75034 (Denton County)
Yesterday

Get Ready For Your
Next Interview

More Jobs at Trellix

Data Scientist
$100K — $130K *
Frisco, TX 75034 (Denton County)
Today
Information Technology
In-Person
Data Scientist
$90K — $130K *
Mount Enterprise, TX 75681 (Rusk County)
Today
Enterprise Technology
In-Person
Reverse Engineer- Android
$90K — $130K *
Remote
1 week ago
Information Technology
Remote in United States
Cleared Cyber Security Engineer
$90K — $130K *
Baltimore, MD 21215 (Baltimore City County)
2 weeks ago
Aerospace & Defense
In-Person
SLED & Healthcare Account Executive
$135K — $160K *
Fairfax, VA 22030 (Fairfax City County)
2 weeks ago
Healthcare
In-Person

More Information Technology Jobs

SDET (Software Development Engineer In Test)
Confidential Company
Washington, DC 20001 (District Of Columbia County)
1 week ago
Software Engineer 3
$100K — $130K *
Avid Technology
Annapolis Junction, MD 20701 (Howard County)
Reposted Today
Senior Principal HPC Administrator (High Performance Computing) Northeast US locations, Hybrid
$107K — $204K *
Raytheon Technologies
Tewksbury, MA 01876 (Middlesex County)
Today
Sr. Infrastructure Site Reliability Engineer
$120K — $150K *
Charles Schwab
Southlake, TX 76092 (Tarrant County)
Today
Quality Assurance Engineer
$70K — $95K *
Fiserv
Milwaukee, WI 53215 (Milwaukee County)
Today

Find similar Data Scientist jobs:

Nationwide Frisco, TX

Data Scientist

Job Overview by Ladders

Full Job Description

Get Ready For Your Next Interview

Find similar Data Scientist jobs:

Get Ready For Your
Next Interview