Rethinking Data Protection in the (Generative) Artificial Intelligence Era
By: Yiming Li , Shuo Shao , Yu He and more
Potential Business Impact:
Protects your information in smart computer programs.
The (generative) artificial intelligence (AI) era has profoundly reshaped the meaning and value of data. No longer confined to static content, data now permeates every stage of the AI lifecycle from the training samples that shape model parameters to the prompts and outputs that drive real-world model deployment. This shift renders traditional notions of data protection insufficient, while the boundaries of what needs safeguarding remain poorly defined. Failing to safeguard data in AI systems can inflict societal and individual, underscoring the urgent need to clearly delineate the scope of and rigorously enforce data protection. In this perspective, we propose a four-level taxonomy, including non-usability, privacy preservation, traceability, and deletability, that captures the diverse protection needs arising in modern (generative) AI models and systems. Our framework offers a structured understanding of the trade-offs between data utility and control, spanning the entire AI pipeline, including training datasets, model weights, system prompts, and AI-generated content. We analyze representative technical approaches at each level and reveal regulatory blind spots that leave critical assets exposed. By offering a structured lens to align future AI technologies and governance with trustworthy data practices, we underscore the urgency of rethinking data protection for modern AI techniques and provide timely guidance for developers, researchers, and regulators alike.
Similar Papers
Privacy Preservation in Gen AI Applications
Cryptography and Security
Keeps your private info safe from smart computer programs.
Responsible Data Stewardship: Generative AI and the Digital Waste Problem
Computers and Society
Cleans up computer "junk" to save the planet.
Protecting Human Cognition in the Age of AI
Computers and Society
AI changes how we think and learn.