Beyond Accuracy: Uncovering the Role of Similarity Perception and its Alignment with Semantics in Supervised Learning
By: Katarzyna Filus, Mateusz Żarski
Potential Business Impact:
Shows how computer "eyes" learn to see similar things.
Similarity manifests in various forms, including semantic similarity that is particularly important, serving as an approximation of human object categorization based on e.g. shared functionalities and evolutionary traits. It also offers practical advantages in computational modeling via lexical structures such as WordNet with constant and interpretable similarity. As in the domain of deep vision, there is still not enough focus on the phenomena regarding the similarity perception emergence. We introduce Deep Similarity Inspector (DSI) -- a systematic framework to inspect how deep vision networks develop their similarity perception and its alignment with semantic similarity. Our experiments show that both Convolutional Neural Networks' (CNNs) and Vision Transformers' (ViTs) develop a rich similarity perception during training with 3 phases (initial similarity surge, refinement, stabilization), with clear differences between CNNs and ViTs. Besides the gradual mistakes elimination, the mistakes refinement phenomenon can be observed.
Similar Papers
Semantic Depth Matters: Explaining Errors of Deep Vision Networks through Perceived Class Similarities
CV and Pattern Recognition
Shows how computers learn and make mistakes.
Rethinking Word Similarity: Semantic Similarity through Classification Confusion
Computation and Language
Helps computers understand word meanings better.
Seeing Eye to AI? Applying Deep-Feature-Based Similarity Metrics to Information Visualization
Human-Computer Interaction
Helps computers tell if charts look alike.