Score: 0

The Geometry of Truth: Layer-wise Semantic Dynamics for Hallucination Detection in Large Language Models

Published: October 6, 2025 | arXiv ID: 2510.04933v1

By: Amir Hameed Mir

Potential Business Impact:

Stops AI from making up false information.

Business Areas:

Visual Search Internet Services

Large Language Models (LLMs) often produce fluent yet factually incorrect statements-a phenomenon known as hallucination-posing serious risks in high-stakes domains. We present Layer-wise Semantic Dynamics (LSD), a geometric framework for hallucination detection that analyzes the evolution of hidden-state semantics across transformer layers. Unlike prior methods that rely on multiple sampling passes or external verification sources, LSD operates intrinsically within the model's representational space. Using margin-based contrastive learning, LSD aligns hidden activations with ground-truth embeddings derived from a factual encoder, revealing a distinct separation in semantic trajectories: factual responses preserve stable alignment, while hallucinations exhibit pronounced semantic drift across depth. Evaluated on the TruthfulQA and synthetic factual-hallucination datasets, LSD achieves an F1-score of 0.92, AUROC of 0.96, and clustering accuracy of 0.89, outperforming SelfCheckGPT and Semantic Entropy baselines while requiring only a single forward pass. This efficiency yields a 5-20x speedup over sampling-based methods without sacrificing precision or interpretability. LSD offers a scalable, model-agnostic mechanism for real-time hallucination monitoring and provides new insights into the geometry of factual consistency within large language models.

Distributional Semantics Tracing: A Framework for Explaining Hallucinations in Large Language Models

Computation and Language

Finds why AI makes up fake facts.

7 Oct 2025 0

91%

Grounding the Ungrounded: A Spectral-Graph Framework for Quantifying Hallucinations in multimodal LLMs

Machine Learning (CS)

Makes AI tell the truth, not make things up.

26 Aug 2025 0

90%

Semantic Energy: Detecting LLM Hallucination Beyond Entropy

Machine Learning (CS)

Finds when AI is wrong and tells you.

20 Aug 2025 1

View PDF Login to Bookmark

Page Count

27 pages

The Geometry of Truth: Layer-wise Semantic Dynamics for Hallucination Detection in Large Language Models

Stops AI from making up false information.

Technical Abstract

Distributional Semantics Tracing: A Framework for Explaining Hallucinations in Large Language Models

Grounding the Ungrounded: A Spectral-Graph Framework for Quantifying Hallucinations in multimodal LLMs

Semantic Energy: Detecting LLM Hallucination Beyond Entropy