Score: 1

PrivacyPAD: A Reinforcement Learning Framework for Dynamic Privacy-Aware Delegation

Published: October 16, 2025 | arXiv ID: 2510.16054v1

By: Zheng Hui , Yijiang River Dong , Sanhanat Sivapiromrat and more

Potential Business Impact:

Keeps private info safe while using smart AI.

Business Areas:

Privacy Privacy and Security

When users submit queries to Large Language Models (LLMs), their prompts can often contain sensitive data, forcing a difficult choice: Send the query to a powerful proprietary LLM providers to achieving state-of-the-art performance and risk data exposure, or relying on smaller, local models guarantees data privacy but often results in a degradation of task performance. Prior approaches have relied on static pipelines that use LLM rewriting, which shatters linguistic coherence and indiscriminately removes privacy-sensitive information, including task-critical content. We reformulate this challenge (Privacy-Conscious Delegation) as a sequential decision-making problem and introduce a novel reinforcement learning (RL) framework called PrivacyPAD to solve it. Our framework trains an agent to dynamically route text chunks, learning a policy that optimally balances the trade-off between privacy leakage and task performance. It implicitly distinguishes between replaceable Personally Identifiable Information (PII) (which it shields locally) and task-critical PII (which it strategically sends to the remote model for maximal utility). To validate our approach in complex scenarios, we also introduce a new medical dataset with high PII density. Our framework achieves a new state-of-the-art on the privacy-utility frontier, demonstrating the necessity of learned, adaptive policies for deploying LLMs in sensitive environments.

Position Paper: Rethinking Privacy in RL for Sequential Decision-making in the Age of LLMs

Machine Learning (CS)

Keeps AI learning private from sneaky watchers.

15 Apr 2025 1

89%

Privacy-Aware Decoding: Mitigating Privacy Leakage of Large Language Models in Retrieval-Augmented Generation

Computation and Language

Keeps private info safe when computers talk.

5 Aug 2025 1

88%

1-2-3 Check: Enhancing Contextual Privacy in LLM via Multi-Agent Reasoning

Artificial Intelligence

Keeps private talk secret when computers help.

11 Aug 2025 0

View PDF Login to Bookmark

Country of Origin

🇬🇧 United Kingdom

Page Count

13 pages

PrivacyPAD: A Reinforcement Learning Framework for Dynamic Privacy-Aware Delegation

Keeps private info safe while using smart AI.

Technical Abstract

Position Paper: Rethinking Privacy in RL for Sequential Decision-making in the Age of LLMs

Privacy-Aware Decoding: Mitigating Privacy Leakage of Large Language Models in Retrieval-Augmented Generation

1-2-3 Check: Enhancing Contextual Privacy in LLM via Multi-Agent Reasoning