Score: 1

Mechanisms of Prompt-Induced Hallucination in Vision-Language Models

Published: January 8, 2026 | arXiv ID: 2601.05201v1

By: William Rudman , Michal Golovanevsky , Dana Arad and more

Potential Business Impact:

Fixes AI that makes up answers instead of seeing.

Business Areas:

Visual Search Internet Services

Large vision-language models (VLMs) are highly capable, yet often hallucinate by favoring textual prompts over visual evidence. We study this failure mode in a controlled object-counting setting, where the prompt overstates the number of objects in the image (e.g., asking a model to describe four waterlilies when only three are present). At low object counts, models often correct the overestimation, but as the number of objects increases, they increasingly conform to the prompt regardless of the discrepancy. Through mechanistic analysis of three VLMs, we identify a small set of attention heads whose ablation substantially reduces prompt-induced hallucinations (PIH) by at least 40% without additional training. Across models, PIH-heads mediate prompt copying in model-specific ways. We characterize these differences and show that PIH ablation increases correction toward visual evidence. Our findings offer insights into the internal mechanisms driving prompt-induced hallucinations, revealing model-specific differences in how these behaviors are implemented.

Mitigating Prompt-Induced Hallucinations in Large Language Models via Structured Reasoning

Computation and Language

Makes AI tell the truth, not make things up.

6 Jan 2026 0

91%

Exploring Causes and Mitigation of Hallucinations in Large Vision Language Models

CV and Pattern Recognition

Stops computers from making up fake things in pictures.

24 Feb 2025 0

91%

Vision-Language Introspection: Mitigating Overconfident Hallucinations in MLLMs via Interpretable Bi-Causal Steering

CV and Pattern Recognition

Fixes AI mistakes when seeing and talking.

8 Jan 2026 1

View PDF Login to Bookmark

Country of Origin

🇺🇸 United States

Repos / Data Links

github.com

Page Count

18 pages

Mechanisms of Prompt-Induced Hallucination in Vision-Language Models

Fixes AI that makes up answers instead of seeing.

Technical Abstract

Mitigating Prompt-Induced Hallucinations in Large Language Models via Structured Reasoning

Exploring Causes and Mitigation of Hallucinations in Large Vision Language Models

Vision-Language Introspection: Mitigating Overconfident Hallucinations in MLLMs via Interpretable Bi-Causal Steering