LLM-Based Knowledge Extraction and Failure Analysis Internship

Siemens • $66K — $97K *

Princeton, NJ 08540In-Person

Information Technology

Less than 5 years of experience

Today

Be an Early Applicant

By clicking Apply, I agree with Ladders' Terms of Use and Privacy Policy

Job Overview by Ladders

Qualifications

Master's or PhD candidate in Computer Science or related field.
3+ years experience in AI, Machine Learning, or Data Science.
Hands-on Python programming experience with libraries like PyTorch or TensorFlow.
Experience in prompt and context engineering for LLM outputs.
Strong grasp of data modeling and validation principles.
Skills in creating AI workflows involving LLMs and structured context.
Excellent communication skills for technical concepts.

Responsibilities

Design and refine prompts for effective failure classification.
Analyze LLM output quality to identify classification errors.
Create test cases and evaluation metrics to measure accuracy.
Enhance JSON schemas and validation processes for data integrity.
Prototype improvements in AI data pipelines using Python.
Collaborate with engineers and researchers on failure categories and model behavior.
Document experiments and findings for internal and external dissemination.

Benefits

Health and wellness benefits
Potential extension of internship
Hands-on experience in a collaborative environment
Mentorship from experienced professionals
Opportunity to contribute to innovative industrial applications

Full Job Description

LLM-Based Knowledge Extraction and Failure Analysis Internship

Siemens Research & Predevelopment (RPD) is the central R&D department of Siemens and thus has a key role to shape the future of our products. RPD acts as a strategic partner to support the executive units of Siemens. In consequence the main research focus is on future technologies for industry, infrastructure, mobility, and healthcare. In this context, we are looking for an Intern that supports our Software Systems and Processes team in Princeton, NJ by researching and developing scalable intelligent systems using LLMs and semantic technologies.

Transform the everyday with us!

Are you passionate about pushing the boundaries of AI and data science? We're looking for an innovative PhD intern to join our team and contribute to groundbreaking research focused on developing and improving knowledge graphs for advanced intelligent systems.

Modern industrial software systems generate large volumes of complex engineering signals, logs, test results, and failure information that are difficult to interpret consistently with traditional automation alone. In this internship, you will work on LLM-based knowledge extraction and failure classification workflows that transform technical inputs into structured, explainable JSON-based outputs. The focus is on prompt engineering, context engineering, model-output debugging, and iterative quality improvement-understanding why a model selected a particular failure class, which evidence influenced the result, where context was missing or misleading, and how to make the pipeline more accurate, transparent, and reliable for industrial use cases.

The internship provides a unique experience to contribute to innovative industrial applications while mentored by experienced professionals in an international setting.

This role is preferred to be on-site in Princeton, NJ, for a hands-on and collaborative experience, however remote candidates will be considered. The position is a full-time role for at least 3 months with the possibility of extension.

Key Responsibilities

Design, test, and refine prompts and context-selection strategies that help models classify failures, use relevant evidence, and produce consistent structured JSON outputs.
Analyze LLM output quality to understand why models choose incorrect failure classes, overlook important evidence, rely on misleading context, or generate inconsistent explanations.
Create evaluation examples, test cases, scoring rubrics, and error-analysis summaries to measure classification accuracy, evidence quality, explanation quality, and robustness.
Improve JSON schemas, validation checks, metadata fields, and intermediate representations used by downstream analysis and reporting workflows.
Prototype improvements to data preparation, retrieval or context assembly, prompt templates, output formatting, post-processing, and evaluation logic in Python-based AI pipelines.
Collaborate with software engineers, AI researchers, and domain experts to understand failure categories, edge cases, expected model behavior, and quality requirements.
Document experiments, observed failure modes, design decisions, evaluation results, and recommendations through internal demos, technical reports, and potential scientific publications.

Basic Qualifications

Currently enrolled in a Master's or PhD program in Computer Science, Artificial Intelligence, Data Science, Knowledge Engineering, Information Science, or a closely related technical field.
3+ years of foundational knowledge and research or project experience in Artificial Intelligence, Machine Learning, Generative AI, NLP, Data Engineering, or knowledge-based intelligent systems.
3+ years of hands-on programming experience in Python, including experience with AI/ML libraries or frameworks such as PyTorch, TensorFlow, Hugging Face Transformers, scikit-learn, LangChain, LlamaIndex, or similar tools.
Hands-on experience with prompt engineering, context engineering, structured LLM outputs, or LLM-based information extraction and classification workflows.
Strong understanding of data modeling, structured outputs, metadata design, schema quality, validation concepts, and data quality principles.
Experience designing, implementing, or evaluating AI workflows that combine LLMs with structured context, retrieval, information extraction, classification, or rule-based validation.
Demonstrated ability to conduct independent research, critically analyze complex problems, work through ambiguity, and deliver structured technical outputs on defined timelines.
Strong written and verbal communication skills in English, with the ability to explain technical concepts clearly to both technical and domain-expert audiences.
The position requires the person to be in the United States of America and hold a valid work permit in the US for the duration of the internship.

Preferred Skills

Knowledge of transformer-based models, attention mechanisms, NLP/NLU methods, named entity recognition, relation extraction, question answering, or text classification.
Experience building reproducible data or AI pipelines, including data ingestion, validation, testing, documentation, and workflow orchestration with tools such as Apache Airflow, Prefect, Git, Docker, or similar technologies.
Ability to work with domain experts to translate engineering failure categories, business requirements, and quality expectations into clear prompts, evaluation criteria, and structured output formats.
Excellent analytical skills, attention to detail, and ability to reason about model behavior, evidence quality, data ambiguity, reproducibility, and maintainability of AI pipeline outputs.
Capacity to work independently, prioritize effectively, communicate progress clearly, and collaborate in an interdisciplinary research environment.
Interest in applying LLMs, knowledge extraction, and quality-focused AI engineering to industrial software systems, intelligent automation, or enterprise-scale engineering use cases.

You'll Benefit From
Siemens offers a variety of health and wellness benefits to our employees. Details regarding our benefits can be found here: https://www.benefitsquickstart.com/siemens/index.html
The pay range for this position is $32-$47 per hour. The actual wage offered may be lower or higher depending on budget and candidate experience, knowledge, skills, qualifications and premium geographic location.

About Siemens

Siemens AG is a German multinational conglomerate company headquartered in Munich and the largest industrial manufacturing company in Europe with branch offices abroad. The principal divisions of the company are Industry, Energy, Healthcare, and Infrastructure & Cities, which represent the main activities of the company. The company is a prominent maker of medical diagnostics equipment and its medical health-care division, which generates about 12 percent of the company's total sales, is its second-most profitable unit, after the industrial automation division. The company is a component of the Euro Stoxx 50 stock market index. Siemens and its subsidiaries employ approximately 385,000 people worldwide and reported global revenue of around €87 billion in 2019 according to its earnings release.

Learn more about Siemens

Size

305,000 employees

Industry

Business Services

Founded

1847

NASDAQ

SIEGY

* Ladders Estimates

Similar Jobs

Associate Scientist II
$77K — $87K *
Ascidian Therapeutics
Boston, MA 02115 (Suffolk County)
Today
Regulatory Bioassay Research Scientist
$65K — $108K *
Guidehouse
Silver Spring, MD 20906 (Montgomery County)
Today
Agentic AI, LLM Evaluation, and Trustworthy Systems Research Internship
$66K — $97K *
Siemens
Princeton, NJ 08540 (Mercer County)
Today
Scientist (Algorithm Developer)
$85K — $145K *
Areté
Falls Church, VA 22042 (Fairfax County)
Today
Scientist, Biological Research - Oncology
$90K — $120K *
Johnson & Johnson
Springs, PA 15562 (Somerset County)
Today
Research Scientist
$70K — $95K *
Virginia Jobs
Charlottesville, VA 22903 (Charlottesville City County)
Today

Get Ready For Your
Next Interview

More Jobs at Siemens

Senior Manufacturing Engineer
$82K — $141K *
Buffalo Grove, IL 60089 (Lake County)
Today
Manufacturing & Automotive
In-Person
Service Delivery Manager - SQILLS
$70K — $100K *
Oakville, ON L6H 0A5
Reposted Today
Enterprise Technology
In-Person
Agentic AI, LLM Evaluation, and Trustworthy Systems Research Internship
$66K — $97K *
Princeton, NJ 08540 (Mercer County)
Today
Information Technology
In-Person
Production Supervisor
$61K — $105K *
Fort Worth, TX 76137 (Tarrant County)
Today
Manufacturing & Automotive
In-Person
Sr. Director, Technical Business Development - Software Virtual Platform
$274K — $500K+*
Remote
Today
Enterprise Technology
Remote in Austin, TX

More Information Technology Jobs

SDET (Software Development Engineer In Test)
Confidential Company
Washington, DC 20001 (District Of Columbia County)
Yesterday
Client Partner - Banking / Financial Services / Capital Markets
$325K — $350K + $100K bonus *
Large IT Services Firm (client of TechLink Systems)
New York, NY 10001 (New York County)
1 week ago
Senior Reliability Engineer
$160K — $190K *
Stream Data Centers
Dallas, TX 75217 (Dallas County)
Today
Director, AI Engineering
$130K — $180K *
Royal Bank of Canada
Toronto, ON M3C 0E3
Reposted Today
INFORMATION TECHNOLOGY SPECIALIST
$75K — $95K *
U.S. Marine Corps
Quantico, VA 22134 (Prince William County)
Today

Find similar LLM-Based Knowledge Extraction and Failure Analysis Internship jobs:

Nationwide Princeton, NJ

LLM-Based Knowledge Extraction and Failure Analysis Internship

Job Overview by Ladders

Full Job Description

Get Ready For Your Next Interview

Find similar LLM-Based Knowledge Extraction and Failure Analysis Internship jobs:

Get Ready For Your
Next Interview