Score: 0

Reusing Computation in Text-to-Image Diffusion for Efficient Generation of Image Sets

Published: August 28, 2025 | arXiv ID: 2508.21032v1

By: Dale Decatur , Thibault Groueix , Wang Yifan and more

Potential Business Impact:

Makes AI art faster and cheaper.

Business Areas:

Image Recognition Data and Analytics, Software

Text-to-image diffusion models enable high-quality image generation but are computationally expensive. While prior work optimizes per-inference efficiency, we explore an orthogonal approach: reducing redundancy across correlated prompts. Our method leverages the coarse-to-fine nature of diffusion models, where early denoising steps capture shared structures among similar prompts. We propose a training-free approach that clusters prompts based on semantic similarity and shares computation in early diffusion steps. Experiments show that for models trained conditioned on image embeddings, our approach significantly reduces compute cost while improving image quality. By leveraging UnClip's text-to-image prior, we enhance diffusion step allocation for greater efficiency. Our method seamlessly integrates with existing pipelines, scales with prompt sets, and reduces the environmental and financial burden of large-scale text-to-image generation. Project page: https://ddecatur.github.io/hierarchical-diffusion/

Cost-Aware Routing for Efficient Text-To-Image Generation

CV and Pattern Recognition

Makes AI art faster by choosing the right tool.

17 Jun 2025 1

90%

Beyond the Noise: Aligning Prompts with Latent Representations in Diffusion Models

CV and Pattern Recognition

Finds bad AI pictures while they're still being made.

9 Dec 2025 0

89%

DiffusionX: Efficient Edge-Cloud Collaborative Image Generation with Multi-Round Prompt Evolution

CV and Pattern Recognition

Makes AI art faster and better.

18 Oct 2025 0

View PDF Login to Bookmark

Page Count

12 pages

Reusing Computation in Text-to-Image Diffusion for Efficient Generation of Image Sets

Makes AI art faster and cheaper.

Technical Abstract

Cost-Aware Routing for Efficient Text-To-Image Generation

Beyond the Noise: Aligning Prompts with Latent Representations in Diffusion Models

DiffusionX: Efficient Edge-Cloud Collaborative Image Generation with Multi-Round Prompt Evolution