An Iteration-Free Fixed-Point Estimator for Diffusion Inversion
By: Yifei Chen , Kaiyu Song , Yan Pan and more
Potential Business Impact:
Makes AI recreate images from noise faster.
Diffusion inversion aims to recover the initial noise corresponding to a given image such that this noise can reconstruct the original image through the denoising diffusion process. The key component of diffusion inversion is to minimize errors at each inversion step, thereby mitigating cumulative inaccuracies. Recently, fixed-point iteration has emerged as a widely adopted approach to minimize reconstruction errors at each inversion step. However, it suffers from high computational costs due to its iterative nature and the complexity of hyperparameter selection. To address these issues, we propose an iteration-free fixed-point estimator for diffusion inversion. First, we derive an explicit expression of the fixed point from an ideal inversion step. Unfortunately, it inherently contains an unknown data prediction error. Building upon this, we introduce the error approximation, which uses the calculable error from the previous inversion step to approximate the unknown error at the current inversion step. This yields a calculable, approximate expression for the fixed point, which is an unbiased estimator characterized by low variance, as shown by our theoretical analysis. We evaluate reconstruction performance on two text-image datasets, NOCAPS and MS-COCO. Compared to DDIM inversion and other inversion methods based on the fixed-point iteration, our method achieves consistent and superior performance in reconstruction tasks without additional iterations or training.
Similar Papers
Reconstruction-Free Anomaly Detection with Diffusion Models
CV and Pattern Recognition
Finds weird things in pictures much faster.
Injecting Measurement Information Yields a Fast and Noise-Robust Diffusion-Based Inverse Problem Solver
Machine Learning (CS)
Makes blurry pictures clear using smart guessing.
DiffIER: Optimizing Diffusion Models with Iterative Error Reduction
CV and Pattern Recognition
Makes AI pictures and sounds much better.