Score: 0

Fine-Mem: Fine-Grained Feedback Alignment for Long-Horizon Memory Management

Published: January 13, 2026 | arXiv ID: 2601.08435v1

By: Weitao Ma , Xiaocheng Feng , Lei Huang and more

Effective memory management is essential for large language model agents to navigate long-horizon tasks. Recent research has explored using Reinforcement Learning to develop specialized memory manager agents. However, existing approaches rely on final task performance as the primary reward, which results in severe reward sparsity and ineffective credit assignment, providing insufficient guidance for individual memory operations. To this end, we propose Fine-Mem, a unified framework designed for fine-grained feedback alignment. First, we introduce a Chunk-level Step Reward to provide immediate step-level supervision via auxiliary chunk-specific question answering tasks. Second, we devise Evidence-Anchored Reward Attribution to redistribute global rewards by anchoring credit to key memory operations, based on the specific memory items utilized as evidence in reasoning. Together, these components enable stable policy optimization and align local memory operations with the long-term utility of memory. Experiments on Memalpha and MemoryAgentBench demonstrate that Fine-Mem consistently outperforms strong baselines, achieving superior success rates across various sub-tasks. Further analysis reveals its adaptability and strong generalization capabilities across diverse model configurations and backbones.

Agentic Memory: Learning Unified Long-Term and Short-Term Memory Management for Large Language Model Agents

Computation and Language

Helps computers remember more for longer tasks.

5 Jan 2026 1

89%

Memory-R1: Enhancing Large Language Model Agents to Manage and Utilize Memories via Reinforcement Learning

Computation and Language

Lets computers remember and answer complex questions.

27 Aug 2025 0

89%

Memory-R1: Enhancing Large Language Model Agents to Manage and Utilize Memories via Reinforcement Learning

Computation and Language

Lets computers remember more to answer questions.

27 Aug 2025 0

View PDF Login to Bookmark

Fine-Mem: Fine-Grained Feedback Alignment for Long-Horizon Memory Management

Technical Abstract

Agentic Memory: Learning Unified Long-Term and Short-Term Memory Management for Large Language Model Agents

Memory-R1: Enhancing Large Language Model Agents to Manage and Utilize Memories via Reinforcement Learning

Memory-R1: Enhancing Large Language Model Agents to Manage and Utilize Memories via Reinforcement Learning