Score: 1

The Semantic Illusion: Certified Limits of Embedding-Based Hallucination Detection in RAG Systems

Published: December 17, 2025 | arXiv ID: 2512.15068v2

By: Debu Sinha

Potential Business Impact:

Finds fake answers in AI writing.

Business Areas:

Augmented Reality Hardware, Software

Retrieval-Augmented Generation (RAG) systems remain susceptible to hallucinations despite grounding in retrieved evidence. While current detection methods leverage embedding similarity and natural language inference (NLI), their reliability in safety-critical settings remains unproven. We apply conformal prediction to RAG hallucination detection, transforming heuristic scores into decision sets with finite-sample coverage guarantees (1-alpha). Using calibration sets of n=600, we demonstrate a fundamental dichotomy: on synthetic hallucinations (Natural Questions), embedding methods achieve 95% coverage with 0% False Positive Rate (FPR). However, on real hallucinations from RLHF-aligned models (HaluEval), the same methods fail catastrophically, yielding 100% FPR at target coverage. We analyze this failure through the lens of distributional tails, showing that while NLI models achieve acceptable AUC (0.81), the "hardest" hallucinations are semantically indistinguishable from faithful responses, forcing conformal thresholds to reject nearly all valid outputs. Crucially, GPT-4 as a judge achieves 7% FPR (95% CI:[3.4%, 13.7%]) on the same data, proving the task is solvable via reasoning but opaque to surface-level semantics--a phenomenon we term the "Semantic Illusion."

The Semantic Illusion: Certified Limits of Embedding-Based Hallucination Detection in RAG Systems

Machine Learning (CS)

Finds fake facts in AI answers.

17 Dec 2025 1

92%

Detecting Hallucinations in Graph Retrieval-Augmented Generation via Attention Patterns and Semantic Alignment

Computation and Language

Makes AI understand facts better, stops fake answers.

9 Dec 2025 0

92%

HalluGraph: Auditable Hallucination Detection for Legal RAG Systems via Knowledge Graph Alignment

Machine Learning (CS)

Checks if AI lawyers are telling the truth.

1 Dec 2025 0

View PDF Login to Bookmark

Page Count

12 pages

The Semantic Illusion: Certified Limits of Embedding-Based Hallucination Detection in RAG Systems

Finds fake answers in AI writing.

Technical Abstract

The Semantic Illusion: Certified Limits of Embedding-Based Hallucination Detection in RAG Systems

Detecting Hallucinations in Graph Retrieval-Augmented Generation via Attention Patterns and Semantic Alignment

HalluGraph: Auditable Hallucination Detection for Legal RAG Systems via Knowledge Graph Alignment