Score: 1

Hallucination Detection via Internal States and Structured Reasoning Consistency in Large Language Models

Published: October 13, 2025 | arXiv ID: 2510.11529v1

By: Yusheng Song , Lirong Qiu , Xi Zhang and more

Potential Business Impact:

Finds when AI lies or makes mistakes.

Business Areas:

Natural Language Processing Artificial Intelligence, Data and Analytics, Software

The detection of sophisticated hallucinations in Large Language Models (LLMs) is hampered by a ``Detection Dilemma'': methods probing internal states (Internal State Probing) excel at identifying factual inconsistencies but fail on logical fallacies, while those verifying externalized reasoning (Chain-of-Thought Verification) show the opposite behavior. This schism creates a task-dependent blind spot: Chain-of-Thought Verification fails on fact-intensive tasks like open-domain QA where reasoning is ungrounded, while Internal State Probing is ineffective on logic-intensive tasks like mathematical reasoning where models are confidently wrong. We resolve this with a unified framework that bridges this critical gap. However, unification is hindered by two fundamental challenges: the Signal Scarcity Barrier, as coarse symbolic reasoning chains lack signals directly comparable to fine-grained internal states, and the Representational Alignment Barrier, a deep-seated mismatch between their underlying semantic spaces. To overcome these, we introduce a multi-path reasoning mechanism to obtain more comparable, fine-grained signals, and a segment-aware temporalized cross-attention module to adaptively fuse these now-aligned representations, pinpointing subtle dissonances. Extensive experiments on three diverse benchmarks and two leading LLMs demonstrate that our framework consistently and significantly outperforms strong baselines. Our code is available: https://github.com/peach918/HalluDet.

What are Models Thinking about? Understanding Large Language Model Hallucinations "Psychology" through Model Inner State Analysis

Computation and Language

Finds fake computer answers without looking outside.

19 Feb 2025 0

91%

Neural Probe-Based Hallucination Detection for Large Language Models

Computation and Language

Finds fake facts in computer writing.

24 Dec 2025 0

91%

Principled Detection of Hallucinations in Large Language Models via Multiple Testing

Computation and Language

Stops AI from making up wrong answers.

25 Aug 2025 1

View PDF Login to Bookmark

Country of Origin

🇨🇳 China

Repos / Data Links

github.com

Page Count

5 pages

Hallucination Detection via Internal States and Structured Reasoning Consistency in Large Language Models

Finds when AI lies or makes mistakes.

Technical Abstract

What are Models Thinking about? Understanding Large Language Model Hallucinations "Psychology" through Model Inner State Analysis

Neural Probe-Based Hallucination Detection for Large Language Models

Principled Detection of Hallucinations in Large Language Models via Multiple Testing