Polynomial Convergence of Riemannian Diffusion Models
By: Xingyu Xu , Ziyi Zhang , Yorie Nakahira and more
Potential Business Impact:
Makes AI create realistic images on curved surfaces.
Diffusion models have demonstrated remarkable empirical success in the recent years and are considered one of the state-of-the-art generative models in modern AI. These models consist of a forward process, which gradually diffuses the data distribution to a noise distribution spanning the whole space, and a backward process, which inverts this transformation to recover the data distribution from noise. Most of the existing literature assumes that the underlying space is Euclidean. However, in many practical applications, the data are constrained to lie on a submanifold of Euclidean space. Addressing this setting, De Bortoli et al. (2022) introduced Riemannian diffusion models and proved that using an exponentially small step size yields a small sampling error in the Wasserstein distance, provided the data distribution is smooth and strictly positive, and the score estimate is $L_\infty$-accurate. In this paper, we greatly strengthen this theory by establishing that, under $L_2$-accurate score estimate, a {\em polynomially small stepsize} suffices to guarantee small sampling error in the total variation distance, without requiring smoothness or positivity of the data distribution. Our analysis only requires mild and standard curvature assumptions on the underlying manifold. The main ingredients in our analysis are Li-Yau estimate for the log-gradient of heat kernel, and Minakshisundaram-Pleijel parametrix expansion of the perturbed heat equation. Our approach opens the door to a sharper analysis of diffusion models on non-Euclidean spaces.
Similar Papers
Convergence of Deterministic and Stochastic Diffusion-Model Samplers: A Simple Analysis in Wasserstein Distance
Machine Learning (CS)
Makes AI create better pictures by fixing math.
Non-Asymptotic Convergence of Discrete Diffusion Models: Masked and Random Walk dynamics
Machine Learning (CS)
Makes computers create better pictures from scratch.
Be Tangential to Manifold: Discovering Riemannian Metric for Diffusion Models
CV and Pattern Recognition
Makes AI-generated pictures smoothly change.