Predictive Sample Assignment for Semantically Coherent Out-of-Distribution Detection
By: Zhimao Peng , Enguang Wang , Xialei Liu and more
Potential Business Impact:
Helps computers tell real from fake data better.
Semantically coherent out-of-distribution detection (SCOOD) is a recently proposed realistic OOD detection setting: given labeled in-distribution (ID) data and mixed in-distribution and out-of-distribution unlabeled data as the training data, SCOOD aims to enable the trained model to accurately identify OOD samples in the testing data. Current SCOOD methods mainly adopt various clustering-based in-distribution sample filtering (IDF) strategies to select clean ID samples from unlabeled data, and take the remaining samples as auxiliary OOD data, which inevitably introduces a large number of noisy samples in training. To address the above issue, we propose a concise SCOOD framework based on predictive sample assignment (PSA). PSA includes a dual-threshold ternary sample assignment strategy based on the predictive energy score that can significantly improve the purity of the selected ID and OOD sample sets by assigning unconfident unlabeled data to an additional discard sample set, and a concept contrastive representation learning loss to further expand the distance between ID and OOD samples in the representation space to assist ID/OOD discrimination. In addition, we also introduce a retraining strategy to help the model fully fit the selected auxiliary ID/OOD samples. Experiments on two standard SCOOD benchmarks demonstrate that our approach outperforms the state-of-the-art methods by a significant margin.
Similar Papers
BootOOD: Self-Supervised Out-of-Distribution Detection via Synthetic Sample Exposure under Neural Collapse
CV and Pattern Recognition
Helps computers spot fake pictures, even tricky ones.
Pseudo-label Induced Subspace Representation Learning for Robust Out-of-Distribution Detection
Machine Learning (CS)
Helps AI spot fake or new information.
SupLID: Geometrical Guidance for Out-of-Distribution Detection in Semantic Segmentation
CV and Pattern Recognition
Helps self-driving cars spot weird things.