Score: 1

Semantic-Cohesive Knowledge Distillation for Deep Cross-modal Hashing

Published: October 7, 2025 | arXiv ID: 2510.09664v1

By: Changchang Sun, Vickie Chen, Yan Yan

Potential Business Impact:

Helps computers understand images and text together.

Business Areas:

Semantic Search Internet Services

Recently, deep supervised cross-modal hashing methods have achieve compelling success by learning semantic information in a self-supervised way. However, they still suffer from the key limitation that the multi-label semantic extraction process fail to explicitly interact with raw multimodal data, making the learned representation-level semantic information not compatible with the heterogeneous multimodal data and hindering the performance of bridging modality gap. To address this limitation, in this paper, we propose a novel semantic cohesive knowledge distillation scheme for deep cross-modal hashing, dubbed as SODA. Specifically, the multi-label information is introduced as a new textual modality and reformulated as a set of ground-truth label prompt, depicting the semantics presented in the image like the text modality. Then, a cross-modal teacher network is devised to effectively distill cross-modal semantic characteristics between image and label modalities and thus learn a well-mapped Hamming space for image modality. In a sense, such Hamming space can be regarded as a kind of prior knowledge to guide the learning of cross-modal student network and comprehensively preserve the semantic similarities between image and text modality. Extensive experiments on two benchmark datasets demonstrate the superiority of our model over the state-of-the-art methods.

Asymmetric Cross-Modal Knowledge Distillation: Bridging Modalities with Weak Semantic Consistency

CV and Pattern Recognition

Teaches computers to learn from different kinds of pictures.

12 Nov 2025 1

89%

Information-Theoretic Criteria for Knowledge Distillation in Multimodal Learning

Machine Learning (CS)

Teaches computers to learn better from different kinds of information.

15 Oct 2025 0

89%

CCSD: Cross-Modal Compositional Self-Distillation for Robust Brain Tumor Segmentation with Missing Modalities

CV and Pattern Recognition

Helps doctors find brain tumors even with missing scans.

18 Nov 2025 1

View PDF Login to Bookmark

Country of Origin

🇺🇸 United States

Page Count

11 pages

Semantic-Cohesive Knowledge Distillation for Deep Cross-modal Hashing

Helps computers understand images and text together.

Technical Abstract

Asymmetric Cross-Modal Knowledge Distillation: Bridging Modalities with Weak Semantic Consistency

Information-Theoretic Criteria for Knowledge Distillation in Multimodal Learning

CCSD: Cross-Modal Compositional Self-Distillation for Robust Brain Tumor Segmentation with Missing Modalities