Score: 0

CloneMem: Benchmarking Long-Term Memory for AI Clones

Published: January 11, 2026 | arXiv ID: 2601.07023v1

By: Sen Hu , Zhiyu Zhang , Yuxiang Wei and more

Potential Business Impact:

AI remembers your whole life for personalized chats.

Business Areas:
Machine Learning Artificial Intelligence, Data and Analytics, Software

AI Clones aim to simulate an individual's thoughts and behaviors to enable long-term, personalized interaction, placing stringent demands on memory systems to model experiences, emotions, and opinions over time. Existing memory benchmarks primarily rely on user-agent conversational histories, which are temporally fragmented and insufficient for capturing continuous life trajectories. We introduce CloneMem, a benchmark for evaluating longterm memory in AI Clone scenarios grounded in non-conversational digital traces, including diaries, social media posts, and emails, spanning one to three years. CloneMem adopts a hierarchical data construction framework to ensure longitudinal coherence and defines tasks that assess an agent's ability to track evolving personal states. Experiments show that current memory mechanisms struggle in this setting, highlighting open challenges for life-grounded personalized AI. Code and dataset are available at https://github.com/AvatarMemory/CloneMemBench

Country of Origin
🇨🇳 China

Page Count
24 pages

Category
Computer Science:
Artificial Intelligence