Patch-wise Retrieval: A Bag of Practical Techniques for Instance-level Matching
By: Wonseok Choi , Sohwi Lim , Nam Hyeon-Woo and more
Potential Business Impact:
Finds exact same objects in different pictures.
Instance-level image retrieval aims to find images containing the same object as a given query, despite variations in size, position, or appearance. To address this challenging task, we propose Patchify, a simple yet effective patch-wise retrieval framework that offers high performance, scalability, and interpretability without requiring fine-tuning. Patchify divides each database image into a small number of structured patches and performs retrieval by comparing these local features with a global query descriptor, enabling accurate and spatially grounded matching. To assess not just retrieval accuracy but also spatial correctness, we introduce LocScore, a localization-aware metric that quantifies whether the retrieved region aligns with the target object. This makes LocScore a valuable diagnostic tool for understanding and improving retrieval behavior. We conduct extensive experiments across multiple benchmarks, backbones, and region selection strategies, showing that Patchify outperforms global methods and complements state-of-the-art reranking pipelines. Furthermore, we apply Product Quantization for efficient large-scale retrieval and highlight the importance of using informative features during compression, which significantly boosts performance. Project website: https://wons20k.github.io/PatchwiseRetrieval/
Similar Papers
Spatially-Grounded Document Retrieval via Patch-to-Region Relevance Propagation
CV and Pattern Recognition
Finds exact text parts in documents.
Back To The Drawing Board: Rethinking Scene-Level Sketch-Based Image Retrieval
CV and Pattern Recognition
Finds real pictures from simple drawings.
Benchmarking Adversarial Patch Selection and Location
CV and Pattern Recognition
Makes computer vision models easily fooled.