Score: 0

ForgetMark: Stealthy Fingerprint Embedding via Targeted Unlearning in Language Models

Published: January 13, 2026 | arXiv ID: 2601.08189v1

By: Zhenhua Xu , Haobo Zhang , Zhebo Wang and more

Existing invasive (backdoor) fingerprints suffer from high-perplexity triggers that are easily filtered, fixed response patterns exposed by heuristic detectors, and spurious activations on benign inputs. We introduce \textsc{ForgetMark}, a stealthy fingerprinting framework that encodes provenance via targeted unlearning. It builds a compact, human-readable key--value set with an assistant model and predictive-entropy ranking, then trains lightweight LoRA adapters to suppress the original values on their keys while preserving general capabilities. Ownership is verified under black/gray-box access by aggregating likelihood and semantic evidence into a fingerprint success rate. By relying on probabilistic forgetting traces rather than fixed trigger--response patterns, \textsc{ForgetMark} avoids high-perplexity triggers, reduces detectability, and lowers false triggers. Across diverse architectures and settings, it achieves 100\% ownership verification on fingerprinted models while maintaining standard performance, surpasses backdoor baselines in stealthiness and robustness to model merging, and remains effective under moderate incremental fine-tuning. Our code and data are available at \href{https://github.com/Xuzhenhua55/ForgetMark}{https://github.com/Xuzhenhua55/ForgetMark}.

EditMF: Drawing an Invisible Fingerprint for Your Large Language Models

Cryptography and Security

Protects AI secrets by hiding ownership codes.

12 Aug 2025 1

89%

EverTracer: Hunting Stolen Large Language Models via Stealthy and Robust Probabilistic Fingerprint

Cryptography and Security

Protects AI from being stolen by marking it.

3 Sep 2025 2

87%

Shadow Unlearning: A Neuro-Semantic Approach to Fidelity-Preserving Faceless Forgetting in LLMs

Cryptography and Security

Removes private data from AI without seeing it.

7 Jan 2026 0

View PDF Login to Bookmark

ForgetMark: Stealthy Fingerprint Embedding via Targeted Unlearning in Language Models

Technical Abstract

EditMF: Drawing an Invisible Fingerprint for Your Large Language Models

EverTracer: Hunting Stolen Large Language Models via Stealthy and Robust Probabilistic Fingerprint

Shadow Unlearning: A Neuro-Semantic Approach to Fidelity-Preserving Faceless Forgetting in LLMs