CAE-DFKD: Bridging the Transferability Gap in Data-Free Knowledge Distillation
By: Zherui Zhang , Changwei Wang , Rongtao Xu and more
Potential Business Impact:
Teaches computers new skills without needing old examples.
Data-Free Knowledge Distillation (DFKD) enables the knowledge transfer from the given pre-trained teacher network to the target student model without access to the real training data. Existing DFKD methods focus primarily on improving image recognition performance on associated datasets, often neglecting the crucial aspect of the transferability of learned representations. In this paper, we propose Category-Aware Embedding Data-Free Knowledge Distillation (CAE-DFKD), which addresses at the embedding level the limitations of previous rely on image-level methods to improve model generalization but fail when directly applied to DFKD. The superiority and flexibility of CAE-DFKD are extensively evaluated, including: \textit{\textbf{i.)}} Significant efficiency advantages resulting from altering the generator training paradigm; \textit{\textbf{ii.)}} Competitive performance with existing DFKD state-of-the-art methods on image recognition tasks; \textit{\textbf{iii.)}} Remarkable transferability of data-free learned representations demonstrated in downstream tasks.
Similar Papers
Data-free Knowledge Distillation with Diffusion Models
CV and Pattern Recognition
Teaches computers new skills without needing old examples.
Adversarial Curriculum Graph-Free Knowledge Distillation for Graph Neural Networks
Machine Learning (CS)
Teaches computers to learn without needing real examples.
HFedCKD: Toward Robust Heterogeneous Federated Learning via Data-free Knowledge Distillation and Two-way Contrast
Machine Learning (CS)
Helps AI learn from more phones, better.