Retrieval-Augmented Prompt for OOD Detection
By: Ruisong Han , Zongbo Han , Jiahao Zhang and more
Potential Business Impact:
Helps computers spot weird pictures they haven't seen.
Out-of-Distribution (OOD) detection is crucial for the reliable deployment of machine learning models in-the-wild, enabling accurate identification of test samples that differ from the training data distribution. Existing methods rely on auxiliary outlier samples or in-distribution (ID) data to generate outlier information for training, but due to limited outliers and their mismatch with real test OOD samples, they often fail to provide sufficient semantic supervision, leading to suboptimal performance. To address this, we propose a novel OOD detection method called Retrieval-Augmented Prompt (RAP). RAP augments a pre-trained vision-language model's prompts by retrieving external knowledge, offering enhanced semantic supervision for OOD detection. During training, RAP retrieves descriptive words for outliers based on joint similarity with external textual knowledge and uses them to augment the model's OOD prompts. During testing, RAP dynamically updates OOD prompts in real-time based on the encountered OOD samples, enabling the model to rapidly adapt to the test environment. Our extensive experiments demonstrate that RAP achieves state-of-the-art performance on large-scale OOD detection benchmarks. For example, in 1-shot OOD detection on the ImageNet-1k dataset, RAP reduces the average FPR95 by 7.05% and improves the AUROC by 1.71% compared to previous methods. Additionally, comprehensive ablation studies validate the effectiveness of each module and the underlying motivations of our approach.
Similar Papers
Prompt Optimization Meets Subspace Representation Learning for Few-shot Out-of-Distribution Detection
Machine Learning (CS)
AI spots new things it hasn't seen before.
RS-OOD: A Vision-Language Augmented Framework for Out-of-Distribution Detection in Remote Sensing
CV and Pattern Recognition
Find new things in satellite pictures.
The Devil is in the Prompts: Retrieval-Augmented Prompt Optimization for Text-to-Video Generation
CV and Pattern Recognition
Makes videos better by improving text instructions.