CloneMem: Benchmarking Long-Term Memory for AI Clones
By: Sen Hu , Zhiyu Zhang , Yuxiang Wei and more
Potential Business Impact:
AI remembers your whole life for personalized chats.
AI Clones aim to simulate an individual's thoughts and behaviors to enable long-term, personalized interaction, placing stringent demands on memory systems to model experiences, emotions, and opinions over time. Existing memory benchmarks primarily rely on user-agent conversational histories, which are temporally fragmented and insufficient for capturing continuous life trajectories. We introduce CloneMem, a benchmark for evaluating longterm memory in AI Clone scenarios grounded in non-conversational digital traces, including diaries, social media posts, and emails, spanning one to three years. CloneMem adopts a hierarchical data construction framework to ensure longitudinal coherence and defines tasks that assess an agent's ability to track evolving personal states. Experiments show that current memory mechanisms struggle in this setting, highlighting open challenges for life-grounded personalized AI. Code and dataset are available at https://github.com/AvatarMemory/CloneMemBench
Similar Papers
RealMem: Benchmarking LLMs in Real-World Memory-Driven Interaction
Computation and Language
Helps AI remember long projects to finish them.
KnowMe-Bench: Benchmarking Person Understanding for Lifelong Digital Companions
Artificial Intelligence
Helps computers understand people's life stories.
SimpleMem: Efficient Lifelong Memory for LLM Agents
Artificial Intelligence
Makes AI remember more with less effort.