Enhancing diffusion models with Gaussianization preprocessing
By: Li Cunzhi, Louis Kang, Hideaki Shimazaki
Diffusion models are a class of generative models that have demonstrated remarkable success in tasks such as image generation. However, one of the bottlenecks of these models is slow sampling due to the delay before the onset of trajectory bifurcation, at which point substantial reconstruction begins. This issue degrades generation quality, especially in the early stages. Our primary objective is to mitigate bifurcation-related issues by preprocessing the training data to enhance reconstruction quality, particularly for small-scale network architectures. Specifically, we propose applying Gaussianization preprocessing to the training data to make the target distribution more closely resemble an independent Gaussian distribution, which serves as the initial density of the reconstruction process. This preprocessing step simplifies the model's task of learning the target distribution, thereby improving generation quality even in the early stages of reconstruction with small networks. The proposed method is, in principle, applicable to a broad range of generative tasks, enabling more stable and efficient sampling processes.
Similar Papers
Diffusion models for multivariate subsurface generation and efficient probabilistic inversion
CV and Pattern Recognition
Helps map underground resources faster and better.
Diffusion models for multivariate subsurface generation and efficient probabilistic inversion
CV and Pattern Recognition
Maps underground rock layers faster and better.
Guiding diffusion models to reconstruct flow fields from sparse data
Fluid Dynamics
Shows air movement details from few clues.