Schrödinger bridge for generative AI: Soft-constrained formulation and convergence analysis
By: Jin Ma, Ying Tan, Renyuan Xu
Potential Business Impact:
Makes AI create better pictures and music.
Generative AI can be framed as the problem of learning a model that maps simple reference measures into complex data distributions, and it has recently found a strong connection to the classical theory of the Schr\"odinger bridge problems (SBPs) due partly to their common nature of interpolating between prescribed marginals via entropy-regularized stochastic dynamics. However, the classical SBP enforces hard terminal constraints, which often leads to instability in practical implementations, especially in high-dimensional or data-scarce regimes. To address this challenge, we follow the idea of the so-called soft-constrained Schr\"odinger bridge problem (SCSBP), in which the terminal constraint is replaced by a general penalty function. This relaxation leads to a more flexible stochastic control formulation of McKean-Vlasov type. We establish the existence of optimal solutions for all penalty levels and prove that, as the penalty grows, both the controls and value functions converge to those of the classical SBP at a linear rate. Our analysis builds on Doob's h-transform representations, the stability results of Schr\"odinger potentials, Gamma-convergence, and a novel fixed-point argument that couples an optimization problem over the space of measures with an auxiliary entropic optimal transport problem. These results not only provide the first quantitative convergence guarantees for soft-constrained bridges but also shed light on how penalty regularization enables robust generative modeling, fine-tuning, and transfer learning.
Similar Papers
Schrödinger bridge for generative AI: Soft-constrained formulation and convergence analysis
Machine Learning (CS)
Makes AI create better pictures and sounds.
A Closed-Form Framework for Schrödinger Bridges Between Arbitrary Densities
Computation
Makes computers create realistic images from noise.
Regularized Schrödinger Bridge: Alleviating Distortion and Exposure Bias in Solving Inverse Problems
Machine Learning (CS)
Fixes blurry sounds and makes them clear.