Score: 2

PRISM: Reducing Spurious Implicit Biases in Vision-Language Models with LLM-Guided Embedding Projection

Published: July 11, 2025 | arXiv ID: 2507.08979v1

By: Mahdiyar Molahasani , Azadeh Motamedi , Michael Greenspan and more

Potential Business Impact:

Makes AI see people fairly, not based on looks.

Business Areas:
Image Recognition Data and Analytics, Software

We introduce Projection-based Reduction of Implicit Spurious bias in vision-language Models (PRISM), a new data-free and task-agnostic solution for bias mitigation in VLMs like CLIP. VLMs often inherit and amplify biases in their training data, leading to skewed predictions. PRISM is designed to debias VLMs without relying on predefined bias categories or additional external data. It operates in two stages: first, an LLM is prompted with simple class prompts to generate scene descriptions that contain spurious correlations. Next, PRISM uses our novel contrastive-style debiasing loss to learn a projection that maps the embeddings onto a latent space that minimizes spurious correlations while preserving the alignment between image and text embeddings.Extensive experiments demonstrate that PRISM outperforms current debiasing methods on the commonly used Waterbirds and CelebA datasets We make our code public at: https://github.com/MahdiyarMM/PRISM.

Country of Origin
🇨🇦 Canada

Repos / Data Links

Page Count
12 pages

Category
Computer Science:
CV and Pattern Recognition