Score: 1

HalluciNot: Hallucination Detection Through Context and Common Knowledge Verification

Published: April 9, 2025 | arXiv ID: 2504.07069v1

By: Bibek Paudel , Alexander Lyzhov , Preetam Joshi and more

Potential Business Impact:

Finds when AI makes up wrong information.

Business Areas:

Semantic Search Internet Services

This paper introduces a comprehensive system for detecting hallucinations in large language model (LLM) outputs in enterprise settings. We present a novel taxonomy of LLM responses specific to hallucination in enterprise applications, categorizing them into context-based, common knowledge, enterprise-specific, and innocuous statements. Our hallucination detection model HDM-2 validates LLM responses with respect to both context and generally known facts (common knowledge). It provides both hallucination scores and word-level annotations, enabling precise identification of problematic content. To evaluate it on context-based and common-knowledge hallucinations, we introduce a new dataset HDMBench. Experimental results demonstrate that HDM-2 out-performs existing approaches across RagTruth, TruthfulQA, and HDMBench datasets. This work addresses the specific challenges of enterprise deployment, including computational efficiency, domain specialization, and fine-grained error identification. Our evaluation dataset, model weights, and inference code are publicly available.

Towards Long Context Hallucination Detection

Computation and Language

Helps computers avoid making up fake information.

28 Apr 2025 2

90%

UCSC at SemEval-2025 Task 3: Context, Models and Prompt Optimization for Automated Hallucination Detection in LLM Output

Computation and Language

Finds fake facts in AI answers.

5 May 2025 1

90%

Principled Detection of Hallucinations in Large Language Models via Multiple Testing

Computation and Language

Stops AI from making up wrong answers.

25 Aug 2025 1

View PDF Login to Bookmark

Repos / Data Links

github.com

Page Count

13 pages

HalluciNot: Hallucination Detection Through Context and Common Knowledge Verification

Finds when AI makes up wrong information.

Technical Abstract

Towards Long Context Hallucination Detection

UCSC at SemEval-2025 Task 3: Context, Models and Prompt Optimization for Automated Hallucination Detection in LLM Output

Principled Detection of Hallucinations in Large Language Models via Multiple Testing