Score: 2

Forgetting-MarI: LLM Unlearning via Marginal Information Regularization

Published: November 14, 2025 | arXiv ID: 2511.11914v1

By: Shizhou Xu , Yuan Ni , Stefan Broecker and more

BigTech Affiliations: Stanford University

Potential Business Impact:

Makes AI forget specific information without breaking.

Business Areas:

Machine Learning Artificial Intelligence, Data and Analytics, Software

As AI models are trained on ever-expanding datasets, the ability to remove the influence of specific data from trained models has become essential for privacy protection and regulatory compliance. Unlearning addresses this challenge by selectively removing parametric knowledge from the trained models without retraining from scratch, which is critical for resource-intensive models such as Large Language Models (LLMs). Existing unlearning methods often degrade model performance by removing more information than necessary when attempting to ''forget'' specific data. We introduce Forgetting-MarI, an LLM unlearning framework that provably removes only the additional (marginal) information contributed by the data to be unlearned, while preserving the information supported by the data to be retained. By penalizing marginal information, our method yields an explicit upper bound on the unlearn dataset's residual influence in the trained models, providing provable undetectability. Extensive experiments confirm that our approach outperforms current state-of-the-art unlearning methods, delivering reliable forgetting and better preserved general model performance across diverse benchmarks. This advancement represents an important step toward making AI systems more controllable and compliant with privacy and copyright regulations without compromising their effectiveness.

Unlearning Imperative: Securing Trustworthy and Responsible LLMs through Engineered Forgetting

Machine Learning (CS)

Lets AI forget private information when asked.

13 Nov 2025 0

92%

Unlearning That Lasts: Utility-Preserving, Robust, and Almost Irreversible Forgetting in LLMs

Machine Learning (CS)

Removes bad info from AI, making it safer.

2 Sep 2025 1

92%

LLM Unlearning on Noisy Forget Sets: A Study of Incomplete, Rewritten, and Watermarked Data

Machine Learning (CS)

Cleans AI without needing perfect instructions.

10 Oct 2025 2

View PDF Login to Bookmark

Country of Origin

🇺🇸 United States

Page Count

26 pages

Forgetting-MarI: LLM Unlearning via Marginal Information Regularization

Makes AI forget specific information without breaking.

Technical Abstract

Unlearning Imperative: Securing Trustworthy and Responsible LLMs through Engineered Forgetting

Unlearning That Lasts: Utility-Preserving, Robust, and Almost Irreversible Forgetting in LLMs

LLM Unlearning on Noisy Forget Sets: A Study of Incomplete, Rewritten, and Watermarked Data