Score: 0

Attention Reveals More Than Tokens: Training-Free Long-Context Reasoning with Attention-guided Retrieval

Published: March 12, 2025 | arXiv ID: 2503.09819v1

By: Yuwei Zhang , Jayanth Srinivasa , Gaowen Liu and more

Potential Business Impact:

Helps computers remember more for complex thinking.

Business Areas:

Semantic Search Internet Services

Large Language Models (LLMs) often exhibit substantially shorter effective context lengths than their claimed capacities, especially when handling complex reasoning tasks that require integrating information from multiple parts of a long context and performing multi-step reasoning. Although Chain-of-Thought (CoT) prompting has shown promise in reducing task complexity, our empirical analysis reveals that it does not fully resolve this limitation. Through controlled experiments, we identify poor recall of implicit facts as the primary cause of failure, which significantly hampers reasoning performance. Interestingly, we observe that the internal attention weights from the generated CoT tokens can effectively ground implicit facts, even when these facts are not explicitly recalled. Building on this insight, we propose a novel training-free algorithm, Attrieval, which leverages attention weights to retrieve relevant facts from the long context and incorporates them into the reasoning process. Additionally, we find that selecting context tokens from CoT tokens further improves performance. Our results demonstrate that Attrieval enhances long-context reasoning capability notably on both synthetic and real-world QA datasets with various models.

Don't Take Things Out of Context: Attention Intervention for Enhancing Chain-of-Thought Reasoning in Large Language Models

Computation and Language

Fixes AI's thinking to get better answers.

14 Mar 2025 0

91%

Facilitating Long Context Understanding via Supervised Chain-of-Thought Reasoning

Computation and Language

Helps computers understand long stories better.

18 Feb 2025 0

91%

Chain-of-Thought Matters: Improving Long-Context Language Models with Reasoning Path Supervision

Computation and Language

Helps computers understand long stories better.

28 Feb 2025 1

View PDF Login to Bookmark

Country of Origin

🇺🇸 United States

Page Count

13 pages

Attention Reveals More Than Tokens: Training-Free Long-Context Reasoning with Attention-guided Retrieval

Helps computers remember more for complex thinking.

Technical Abstract

Don't Take Things Out of Context: Attention Intervention for Enhancing Chain-of-Thought Reasoning in Large Language Models

Facilitating Long Context Understanding via Supervised Chain-of-Thought Reasoning

Chain-of-Thought Matters: Improving Long-Context Language Models with Reasoning Path Supervision