Score: 1

DrDiff: Dynamic Routing Diffusion with Hierarchical Attention for Breaking the Efficiency-Quality Trade-off

Published: September 2, 2025 | arXiv ID: 2509.02785v1

By: Jusheng Zhang , Yijia Fan , Kaitong Cai and more

Potential Business Impact:

Makes computers write long stories faster and better.

Business Areas:

Text Analytics Data and Analytics, Software

This paper introduces DrDiff, a novel framework for long-text generation that overcomes the efficiency-quality trade-off through three core technologies. First, we design a dynamic expert scheduling mechanism that intelligently allocates computational resources during the diffusion process based on text complexity, enabling more efficient handling of text generation tasks of varying difficulty. Second, we introduce a Hierarchical Sparse Attention (HSA) mechanism that adaptively adjusts attention patterns according to a variety of input lengths, reducing computational complexity from O($n^2$) to O($n$) while maintaining model performance. Finally, we propose a soft absorption guidance optimization strategy that combines with DPM-solver++ to reduce diffusion steps, significantly improving generation speed. Comprehensive experiments on various long-text generation benchmarks demonstrate the superiority of our DrDiff over the existing SOTA methods.

SA-DiffuSeq: Addressing Computational and Scalability Challenges in Long-Document Generation with Sparse Attention

Computation and Language

Makes computers write long stories faster.

23 Dec 2025 1

88%

MoE-DiffuSeq: Enhancing Long-Document Diffusion Models with Sparse Attention and Mixture of Experts

Computation and Language

Writes long stories and code much faster.

23 Dec 2025 0

87%

DiffHash: Text-Guided Targeted Attack via Diffusion Models against Deep Hashing Image Retrieval

Information Retrieval

Tricks computers into misidentifying images using text.

16 Sep 2025 1

View PDF Login to Bookmark

Page Count

20 pages

DrDiff: Dynamic Routing Diffusion with Hierarchical Attention for Breaking the Efficiency-Quality Trade-off

Makes computers write long stories faster and better.

Technical Abstract

SA-DiffuSeq: Addressing Computational and Scalability Challenges in Long-Document Generation with Sparse Attention

MoE-DiffuSeq: Enhancing Long-Document Diffusion Models with Sparse Attention and Mixture of Experts

DiffHash: Text-Guided Targeted Attack via Diffusion Models against Deep Hashing Image Retrieval