DiRe: Diversity-promoting Regularization for Dataset Condensation
By: Saumyaranjan Mohanty, Aravind Reddy, Konda Reddy Mopuri
Potential Business Impact:
Makes AI learn faster with less data.
In Dataset Condensation, the goal is to synthesize a small dataset that replicates the training utility of a large original dataset. Existing condensation methods synthesize datasets with significant redundancy, so there is a dire need to reduce redundancy and improve the diversity of the synthesized datasets. To tackle this, we propose an intuitive Diversity Regularizer (DiRe) composed of cosine similarity and Euclidean distance, which can be applied off-the-shelf to various state-of-the-art condensation methods. Through extensive experiments, we demonstrate that the addition of our regularizer improves state-of-the-art condensation methods on various benchmark datasets from CIFAR-10 to ImageNet-1K with respect to generalization and diversity metrics.
Similar Papers
Dataset Condensation with Color Compensation
CV and Pattern Recognition
Makes computer learning better by fixing image colors.
A Discrepancy-Based Perspective on Dataset Condensation
Machine Learning (CS)
Makes small fake data teach computers like big real data.
Improving Noise Efficiency in Privacy-preserving Dataset Distillation
CV and Pattern Recognition
Makes private data safe for computers to learn.