Which Layer Causes Distribution Deviation? Entropy-Guided Adaptive Pruning for Diffusion and Flow Models
By: Changlin Li , Jiawei Zhang , Zeyi Shi and more
Potential Business Impact:
Makes AI art generators faster and smaller.
Large-scale vision generative models, including diffusion and flow models, have demonstrated remarkable performance in visual generation tasks. However, transferring these pre-trained models to downstream tasks often results in significant parameter redundancy. In this paper, we propose EntPruner, an entropy-guided automatic progressive pruning framework for diffusion and flow models. First, we introduce entropy-guided pruning, a block-level importance assessment strategy specifically designed for generative models. Unlike discriminative models, generative models require preserving the diversity and condition-fidelity of the output distribution. As the importance of each module can vary significantly across downstream tasks, EntPruner prioritizes pruning of less important blocks using data-dependent Conditional Entropy Deviation (CED) as a guiding metric. CED quantifies how much the distribution diverges from the learned conditional data distribution after removing a block. Second, we propose a zero-shot adaptive pruning framework to automatically determine when and how much to prune during training. This dynamic strategy avoids the pitfalls of one-shot pruning, mitigating mode collapse, and preserving model performance. Extensive experiments on DiT and SiT models demonstrate the effectiveness of EntPruner, achieving up to 2.22$\times$ inference speedup while maintaining competitive generation quality on ImageNet and three downstream datasets.
Similar Papers
Teacher-Guided One-Shot Pruning via Context-Aware Knowledge Distillation
CV and Pattern Recognition
Makes computer programs smaller without losing quality.
Efficient Training for Human Video Generation with Entropy-Guided Prioritized Progressive Learning
CV and Pattern Recognition
Makes AI video creation faster and use less power.
Pluggable Pruning with Contiguous Layer Distillation for Diffusion Transformers
CV and Pattern Recognition
Makes AI image makers smaller, faster, and cheaper.