Score: 2

DiRe: Diversity-promoting Regularization for Dataset Condensation

Published: December 15, 2025 | arXiv ID: 2512.13083v1

By: Saumyaranjan Mohanty, Aravind Reddy, Konda Reddy Mopuri

Potential Business Impact:

Makes AI learn faster with less data.

Business Areas:
Image Recognition Data and Analytics, Software

In Dataset Condensation, the goal is to synthesize a small dataset that replicates the training utility of a large original dataset. Existing condensation methods synthesize datasets with significant redundancy, so there is a dire need to reduce redundancy and improve the diversity of the synthesized datasets. To tackle this, we propose an intuitive Diversity Regularizer (DiRe) composed of cosine similarity and Euclidean distance, which can be applied off-the-shelf to various state-of-the-art condensation methods. Through extensive experiments, we demonstrate that the addition of our regularizer improves state-of-the-art condensation methods on various benchmark datasets from CIFAR-10 to ImageNet-1K with respect to generalization and diversity metrics.

Country of Origin
🇮🇳 India

Repos / Data Links

Page Count
15 pages

Category
Computer Science:
CV and Pattern Recognition