Score: 1

Memories Retrieved from Many Paths: A Multi-Prefix Framework for Robust Detection of Training Data Leakage in Large Language Models

Published: November 25, 2025 | arXiv ID: 2511.20799v1

By: Trung Cuong Dang, David Mohaisen

Potential Business Impact:

Finds when AI copies private information.

Business Areas:

Natural Language Processing Artificial Intelligence, Data and Analytics, Software

Large language models, trained on massive corpora, are prone to verbatim memorization of training data, creating significant privacy and copyright risks. While previous works have proposed various definitions for memorization, many exhibit shortcomings in comprehensively capturing this phenomenon, especially in aligned models. To address this, we introduce a novel framework: multi-prefix memorization. Our core insight is that memorized sequences are deeply encoded and thus retrievable via a significantly larger number of distinct prefixes than non-memorized content. We formalize this by defining a sequence as memorized if an external adversarial search can identify a target count of distinct prefixes that elicit it. This framework shifts the focus from single-path extraction to quantifying the robustness of a memory, measured by the diversity of its retrieval paths. Through experiments on open-source and aligned chat models, we demonstrate that our multi-prefix definition reliably distinguishes memorized from non-memorized data, providing a robust and practical tool for auditing data leakage in LLMs.

Assessing and Mitigating Data Memorization Risks in Fine-Tuned Large Language Models

Computation and Language

Keeps private info safe when computers learn.

10 Aug 2025 0

89%

Beyond Frequency: The Role of Redundancy in Large Language Model Memorization

Machine Learning (CS)

Makes AI forget private stuff, not important facts.

14 Jun 2025 1

88%

Early Detection and Reduction of Memorisation for Domain Adaptation and Instruction Tuning

Computation and Language

Stops AI from copying private text.

13 Oct 2025 0

View PDF Login to Bookmark

Country of Origin

🇺🇸 United States

Page Count

11 pages

Memories Retrieved from Many Paths: A Multi-Prefix Framework for Robust Detection of Training Data Leakage in Large Language Models

Finds when AI copies private information.

Technical Abstract

Assessing and Mitigating Data Memorization Risks in Fine-Tuned Large Language Models

Beyond Frequency: The Role of Redundancy in Large Language Model Memorization

Early Detection and Reduction of Memorisation for Domain Adaptation and Instruction Tuning