Score: 3

MEM1: Learning to Synergize Memory and Reasoning for Efficient Long-Horizon Agents

Published: June 18, 2025 | arXiv ID: 2506.15841v2

By: Zijian Zhou , Ao Qu , Zhaoxuan Wu and more

BigTech Affiliations: Massachusetts Institute of Technology

Potential Business Impact:

Lets computers remember more with less space.

Business Areas:

Semantic Search Internet Services

Modern language agents must operate over long-horizon, multi-turn interactions, where they retrieve external information, adapt to observations, and answer interdependent queries. Yet, most LLM systems rely on full-context prompting, appending all past turns regardless of their relevance. This leads to unbounded memory growth, increased computational costs, and degraded reasoning performance on out-of-distribution input lengths. We introduce MEM1, an end-to-end reinforcement learning framework that enables agents to operate with constant memory across long multi-turn tasks. At each turn, MEM1 updates a compact shared internal state that jointly supports memory consolidation and reasoning. This state integrates prior memory with new observations from the environment while strategically discarding irrelevant or redundant information. To support training in more realistic and compositional settings, we propose a simple yet effective and scalable approach to constructing multi-turn environments by composing existing datasets into arbitrarily complex task sequences. Experiments across three domains, including internal retrieval QA, open-domain web QA, and multi-turn web shopping, show that MEM1-7B improves performance by 3.5x while reducing memory usage by 3.7x compared to Qwen2.5-14B-Instruct on a 16-objective multi-hop QA task, and generalizes beyond the training horizon. Our results demonstrate the promise of reasoning-driven memory consolidation as a scalable alternative to existing solutions for training long-horizon interactive agents, where both efficiency and performance are optimized.

MemSearcher: Training LLMs to Reason, Search and Manage Memory via End-to-End Reinforcement Learning

Computation and Language

Makes AI remember past talks to work better.

4 Nov 2025 1

91%

Memory-R1: Enhancing Large Language Model Agents to Manage and Utilize Memories via Reinforcement Learning

Computation and Language

Lets computers remember and answer complex questions.

27 Aug 2025 0

91%

Memory-R1: Enhancing Large Language Model Agents to Manage and Utilize Memories via Reinforcement Learning

Computation and Language

Lets computers remember more to answer questions.

27 Aug 2025 0

View PDF Login to Bookmark

Country of Origin

🇺🇸 🇸🇬 United States, Singapore

Repos / Data Links

github.com

Page Count

23 pages

MEM1: Learning to Synergize Memory and Reasoning for Efficient Long-Horizon Agents

Lets computers remember more with less space.

Technical Abstract

MemSearcher: Training LLMs to Reason, Search and Manage Memory via End-to-End Reinforcement Learning

Memory-R1: Enhancing Large Language Model Agents to Manage and Utilize Memories via Reinforcement Learning

Memory-R1: Enhancing Large Language Model Agents to Manage and Utilize Memories via Reinforcement Learning