Utilization of Neighbor Information for Image Classification with Different Levels of Supervision
By: Gihan Jayatilaka, Abhinav Shrivastava, Matthew Gwilliam
Potential Business Impact:
Teaches computers to sort pictures better.
We propose to bridge the gap between semi-supervised and unsupervised image recognition with a flexible method that performs well for both generalized category discovery (GCD) and image clustering. Despite the overlap in motivation between these tasks, the methods themselves are restricted to a single task -- GCD methods are reliant on the labeled portion of the data, and deep image clustering methods have no built-in way to leverage the labels efficiently. We connect the two regimes with an innovative approach that Utilizes Neighbor Information for Classification (UNIC) both in the unsupervised (clustering) and semisupervised (GCD) setting. State-of-the-art clustering methods already rely heavily on nearest neighbors. We improve on their results substantially in two parts, first with a sampling and cleaning strategy where we identify accurate positive and negative neighbors, and secondly by finetuning the backbone with clustering losses computed by sampling both types of neighbors. We then adapt this pipeline to GCD by utilizing the labelled images as ground truth neighbors. Our method yields state-of-the-art results for both clustering (+3% ImageNet-100, Imagenet200) and GCD (+0.8% ImageNet-100, +5% CUB, +2% SCars, +4% Aircraft).
Similar Papers
Unsupervised Image Classification with Adaptive Nearest Neighbor Selection and Cluster Ensembles
CV and Pattern Recognition
Groups pictures automatically, making computers smarter.
Unlocking the Potential of Unlabeled Data in Semi-Supervised Domain Generalization
CV and Pattern Recognition
Teaches computers to learn from less-sure guesses.
Learning Part Knowledge to Facilitate Category Understanding for Fine-Grained Generalized Category Discovery
CV and Pattern Recognition
Helps computers tell apart very similar things.