Score: 0

SoC: Semantic Orthogonal Calibration for Test-Time Prompt Tuning

Published: January 13, 2026 | arXiv ID: 2601.08617v1

By: Leo Fillioux , Omprakash Chakraborty , Ismail Ben Ayed and more

With the increasing adoption of vision-language models (VLMs) in critical decision-making systems such as healthcare or autonomous driving, the calibration of their uncertainty estimates becomes paramount. Yet, this dimension has been largely underexplored in the VLM test-time prompt-tuning (TPT) literature, which has predominantly focused on improving their discriminative performance. Recent state-of-the-art advocates for enforcing full orthogonality over pairs of text prompt embeddings to enhance separability, and therefore calibration. Nevertheless, as we theoretically show in this work, the inherent gradients from fully orthogonal constraints will strongly push semantically related classes away, ultimately making the model overconfident. Based on our findings, we propose Semantic Orthogonal Calibration (SoC), a Huber-based regularizer that enforces smooth prototype separation while preserving semantic proximity, thereby improving calibration compared to prior orthogonality-based approaches. Across a comprehensive empirical validation, we demonstrate that SoC consistently improves calibration performance, while also maintaining competitive discriminative capabilities.

O-TPT: Orthogonality Constraints for Calibrating Test-time Prompt Tuning in Vision-Language Models

CV and Pattern Recognition

Makes AI image guesses more trustworthy and accurate.

15 Mar 2025 1

89%

D-TPT: Dimensional Entropy Maximization for Calibrating Test-Time Prompt Tuning in Vision-Language Models

CV and Pattern Recognition

Makes AI better at understanding new things.

10 Oct 2025 0

88%

Object-Level Verbalized Confidence Calibration in Vision-Language Models via Semantic Perturbation

CV and Pattern Recognition

Makes AI tell you when it's unsure.

21 Apr 2025 0

View PDF Login to Bookmark

SoC: Semantic Orthogonal Calibration for Test-Time Prompt Tuning

Technical Abstract

O-TPT: Orthogonality Constraints for Calibrating Test-time Prompt Tuning in Vision-Language Models

D-TPT: Dimensional Entropy Maximization for Calibrating Test-Time Prompt Tuning in Vision-Language Models

Object-Level Verbalized Confidence Calibration in Vision-Language Models via Semantic Perturbation