MC-SJD : Maximal Coupling Speculative Jacobi Decoding for Autoregressive Visual Generation Acceleration
By: Junhyuk So , Hyunho Kook , Chaeyeon Jang and more
Potential Business Impact:
Makes AI create pictures and videos much faster.
While autoregressive (AR) modeling has recently emerged as a new paradigm in visual generation, its practical adoption is severely constrained by the slow inference speed of per-token generation, which often requires thousands of steps to produce a single sample. To address this challenge, we propose MC-SJD, a training-free, lossless parallel decoding framework designed to accelerate AR visual generation by extending the recently introduced Speculative Jacobi Decoding (SJD). Although SJD shows strong potential for accelerating AR generation, we demonstrate that token instability across iterations significantly reduces the acceptance rate, a limitation that primarily arises from the independent sampling process used during draft token generation. To overcome this, we introduce MC-SJD, an information-theoretic approach based on coupling, which substantially accelerates standard SJD by maximizing the probability of sampling identical draft tokens across consecutive iterations, all while preserving its lossless property. Remarkably, this method requires only a single-line modification to the existing algorithm, yet achieves substantial performance gains, delivering up to a ~4.2x acceleration in image generation and ~13.3x acceleration in video generation compared to standard AR decoding, without any degradation in output quality.
Similar Papers
SJD++: Improved Speculative Jacobi Decoding for Training-free Acceleration of Discrete Auto-regressive Text-to-Image Generation
CV and Pattern Recognition
Makes AI create pictures much faster.
Speculative Jacobi-Denoising Decoding for Accelerating Autoregressive Text-to-image Generation
CV and Pattern Recognition
Makes AI draw pictures much faster.
VVS: Accelerating Speculative Decoding for Visual Autoregressive Generation via Partial Verification Skipping
CV and Pattern Recognition
Makes AI create pictures much faster.