Score: 0

Calibrating Uncertainty for Zero-Shot Adversarial CLIP

Published: December 15, 2025 | arXiv ID: 2512.12997v1

By: Wenjing lu , Zerui Tao , Dongping Zhang and more

Potential Business Impact:

Makes AI more trustworthy against tricky tricks.

Business Areas:

Image Recognition Data and Analytics, Software

CLIP delivers strong zero-shot classification but remains highly vulnerable to adversarial attacks. Previous work of adversarial fine-tuning largely focuses on matching the predicted logits between clean and adversarial examples, which overlooks uncertainty calibration and may degrade the zero-shot generalization. A common expectation in reliable uncertainty estimation is that predictive uncertainty should increase as inputs become more difficult or shift away from the training distribution. However, we frequently observe the opposite in the adversarial setting: perturbations not only degrade accuracy but also suppress uncertainty, leading to severe miscalibration and unreliable over-confidence. This overlooked phenomenon highlights a critical reliability gap beyond robustness. To bridge this gap, we propose a novel adversarial fine-tuning objective for CLIP considering both prediction accuracy and uncertainty alignments. By reparameterizing the output of CLIP as the concentration parameter of a Dirichlet distribution, we propose a unified representation that captures relative semantic structure and the magnitude of predictive confidence. Our objective aligns these distributions holistically under perturbations, moving beyond single-logit anchoring and restoring calibrated uncertainty. Experiments on multiple zero-shot classification benchmarks demonstrate that our approach effectively restores calibrated uncertainty and achieves competitive adversarial robustness while maintaining clean accuracy.

Can Less Precise Be More Reliable? A Systematic Evaluation of Quantization's Impact on CLIP Beyond Accuracy

CV and Pattern Recognition

Makes AI see and understand better, faster.

25 Sep 2025 0

88%

CLIP is Strong Enough to Fight Back: Test-time Counterattacks towards Zero-shot Adversarial Robustness of CLIP

CV and Pattern Recognition

Protects AI from being tricked by fake pictures.

5 Mar 2025 1

88%

Self-Calibrated Consistency can Fight Back for Adversarial Robustness in Vision-Language Models

CV and Pattern Recognition

Protects AI from being tricked by fake pictures.

26 Oct 2025 0

View PDF Login to Bookmark

Page Count

22 pages

Calibrating Uncertainty for Zero-Shot Adversarial CLIP

Makes AI more trustworthy against tricky tricks.

Technical Abstract

Can Less Precise Be More Reliable? A Systematic Evaluation of Quantization's Impact on CLIP Beyond Accuracy

CLIP is Strong Enough to Fight Back: Test-time Counterattacks towards Zero-shot Adversarial Robustness of CLIP

Self-Calibrated Consistency can Fight Back for Adversarial Robustness in Vision-Language Models