Score: 1

MemFlow: Flowing Adaptive Memory for Consistent and Efficient Long Video Narratives

Published: December 16, 2025 | arXiv ID: 2512.14699v1

By: Sihui Ji , Xi Chen , Shuai Yang and more

Potential Business Impact:

Makes videos stay the same story over time.

Business Areas:

Motion Capture Media and Entertainment, Video

The core challenge for streaming video generation is maintaining the content consistency in long context, which poses high requirement for the memory design. Most existing solutions maintain the memory by compressing historical frames with predefined strategies. However, different to-generate video chunks should refer to different historical cues, which is hard to satisfy with fixed strategies. In this work, we propose MemFlow to address this problem. Specifically, before generating the coming chunk, we dynamically update the memory bank by retrieving the most relevant historical frames with the text prompt of this chunk. This design enables narrative coherence even if new event happens or scenario switches in future frames. In addition, during generation, we only activate the most relevant tokens in the memory bank for each query in the attention layers, which effectively guarantees the generation efficiency. In this way, MemFlow achieves outstanding long-context consistency with negligible computation burden (7.9% speed reduction compared with the memory-free baseline) and keeps the compatibility with any streaming video generation model with KV cache.

CacheFlow: Compressive Streaming Memory for Efficient Long-Form Video Understanding

CV and Pattern Recognition

Lets computers watch long videos and answer questions.

17 Nov 2025 0

90%

VideoMem: Enhancing Ultra-Long Video Understanding via Adaptive Memory Management

CV and Pattern Recognition

Lets computers watch and remember long videos.

4 Dec 2025 0

90%

StreamMem: Query-Agnostic KV Cache Memory for Streaming Video Understanding

CV and Pattern Recognition

Lets computers watch long videos and answer questions.

21 Aug 2025 2

View PDF Login to Bookmark

Repos / Data Links

github.com

Page Count

10 pages

MemFlow: Flowing Adaptive Memory for Consistent and Efficient Long Video Narratives

Makes videos stay the same story over time.

Technical Abstract

CacheFlow: Compressive Streaming Memory for Efficient Long-Form Video Understanding

VideoMem: Enhancing Ultra-Long Video Understanding via Adaptive Memory Management

StreamMem: Query-Agnostic KV Cache Memory for Streaming Video Understanding