Score: 0

Who Can See Through You? Adversarial Shielding Against VLM-Based Attribute Inference Attacks

Published: December 20, 2025 | arXiv ID: 2512.18264v1

By: Yucheng Fan , Jiawei Chen , Yu Tian and more

As vision-language models (VLMs) become widely adopted, VLM-based attribute inference attacks have emerged as a serious privacy concern, enabling adversaries to infer private attributes from images shared on social media. This escalating threat calls for dedicated protection methods to safeguard user privacy. However, existing methods often degrade the visual quality of images or interfere with vision-based functions on social media, thereby failing to achieve a desirable balance between privacy protection and user experience. To address this challenge, we propose a novel protection method that jointly optimizes privacy suppression and utility preservation under a visual consistency constraint. While our method is conceptually effective, fair comparisons between methods remain challenging due to the lack of publicly available evaluation datasets. To fill this gap, we introduce VPI-COCO, a publicly available benchmark comprising 522 images with hierarchically structured privacy questions and corresponding non-private counterparts, enabling fine-grained and joint evaluation of protection methods in terms of privacy preservation and user experience. Building upon this benchmark, experiments on multiple VLMs demonstrate that our method effectively reduces PAR below 25%, keeps NPAR above 88%, maintains high visual consistency, and generalizes well to unseen and paraphrased privacy questions, demonstrating its strong practical applicability for real-world VLM deployments.

VIP: Visual Information Protection through Adversarial Attacks on Vision-Language Models

Image and Video Processing

Hides private parts of pictures from smart AI.

11 Jul 2025 2

92%

Zero-shot image privacy classification with Vision-Language Models

CV and Pattern Recognition

Makes computers better at guessing private pictures.

10 Oct 2025 0

91%

Through Their Eyes: User Perceptions on Sensitive Attribute Inference of Social Media Videos by Visual Language Models

Human-Computer Interaction

AI can guess private things about you from photos.

11 Aug 2025 1

View PDF Login to Bookmark

Who Can See Through You? Adversarial Shielding Against VLM-Based Attribute Inference Attacks

Technical Abstract

VIP: Visual Information Protection through Adversarial Attacks on Vision-Language Models

Zero-shot image privacy classification with Vision-Language Models

Through Their Eyes: User Perceptions on Sensitive Attribute Inference of Social Media Videos by Visual Language Models