Score: 0

InfiniteICL: Breaking the Limit of Context Window Size via Long Short-term Memory Transformation

Published: April 2, 2025 | arXiv ID: 2504.01707v2

By: Bowen Cao, Deng Cai, Wai Lam

Potential Business Impact:

Lets computers remember much more information.

Business Areas:

Natural Language Processing Artificial Intelligence, Data and Analytics, Software

In-context learning (ICL) is critical for large language models (LLMs), but its effectiveness is constrained by finite context windows, particularly in ultra-long contexts. To overcome this, we introduce InfiniteICL, a framework that parallels context and parameters in LLMs with short- and long-term memory in human cognitive systems, focusing on transforming temporary context knowledge into permanent parameter updates. This approach significantly reduces memory usage, maintains robust performance across varying input lengths, and theoretically enables infinite context integration through the principles of context knowledge elicitation, selection, and consolidation. Evaluations demonstrate that our method reduces context length by 90% while achieving 103% average performance of full-context prompting across fact recall, grounded reasoning, and skill acquisition tasks. When conducting sequential multi-turn transformations on complex, real-world contexts (with length up to 2M tokens), our approach surpasses full-context prompting while using only 0.4% of the original contexts. These findings highlight InfiniteICL's potential to enhance the scalability and efficiency of LLMs by breaking the limitations of conventional context window sizes.

From 128K to 4M: Efficient Training of Ultra-Long Context Large Language Models

Computation and Language

Lets computers understand much longer stories.

8 Apr 2025 2

90%

Illusion or Algorithm? Investigating Memorization, Emergence, and Symbolic Processing in In-Context Learning

Computation and Language

AI learns new things from just a few examples.

16 May 2025 2

89%

You Only Fine-tune Once: Many-Shot In-Context Fine-Tuning for Large Language Model

Computation and Language

Teaches computers to do many jobs well at once.

6 Jun 2025 1

View PDF Login to Bookmark

Page Count

12 pages

InfiniteICL: Breaking the Limit of Context Window Size via Long Short-term Memory Transformation

Lets computers remember much more information.

Technical Abstract

From 128K to 4M: Efficient Training of Ultra-Long Context Large Language Models

Illusion or Algorithm? Investigating Memorization, Emergence, and Symbolic Processing in In-Context Learning

You Only Fine-tune Once: Many-Shot In-Context Fine-Tuning for Large Language Model