Score: 2

Differentially Private Subspace Fine-Tuning for Large Language Models

Published: January 16, 2026 | arXiv ID: 2601.11113v1

By: Lele Zheng , Xiang Wang , Tao Zhang and more

Potential Business Impact:

Protects private data while teaching computers new skills.

Business Areas:

Natural Language Processing Artificial Intelligence, Data and Analytics, Software

Fine-tuning large language models on downstream tasks is crucial for realizing their cross-domain potential but often relies on sensitive data, raising privacy concerns. Differential privacy (DP) offers rigorous privacy guarantees and has been widely adopted in fine-tuning; however, naively injecting noise across the high-dimensional parameter space creates perturbations with large norms, degrading performance and destabilizing training. To address this issue, we propose DP-SFT, a two-stage subspace fine-tuning method that substantially reduces noise magnitude while preserving formal DP guarantees. Our intuition is that, during fine-tuning, significant parameter updates lie within a low-dimensional, task-specific subspace, while other directions change minimally. Hence, we only inject DP noise into this subspace to protect privacy without perturbing irrelevant parameters. In phase one, we identify the subspace by analyzing principal gradient directions to capture task-specific update signals. In phase two, we project full gradients onto this subspace, add DP noise, and map the perturbed gradients back to the original parameter space for model updates, markedly lowering noise impact. Experiments on multiple datasets demonstrate that DP-SFT enhances accuracy and stability under rigorous DP constraints, accelerates convergence, and achieves substantial gains over DP fine-tuning baselines.

Parameter-Efficient Fine-Tuning with Differential Privacy for Robust Instruction Adaptation in Large Language Models

Computation and Language

Keeps AI learning private and fast.

7 Dec 2025 0

90%

Dual-Priv Pruning : Efficient Differential Private Fine-Tuning in Multimodal Large Language Models

Cryptography and Security

Keeps AI's private info safe while learning.

8 Jun 2025 0

90%

Sequential Subspace Noise Injection Prevents Accuracy Collapse in Certified Unlearning

Machine Learning (CS)

Removes data from AI without hurting its smarts.

8 Jan 2026 0

View PDF Login to Bookmark

Country of Origin

🇨🇳 🇯🇵 Japan, China

Repos / Data Links

github.com

Page Count

9 pages

Differentially Private Subspace Fine-Tuning for Large Language Models

Protects private data while teaching computers new skills.

Technical Abstract

Parameter-Efficient Fine-Tuning with Differential Privacy for Robust Instruction Adaptation in Large Language Models

Dual-Priv Pruning : Efficient Differential Private Fine-Tuning in Multimodal Large Language Models

Sequential Subspace Noise Injection Prevents Accuracy Collapse in Certified Unlearning