Optimal Transport for Machine Learners
By: Gabriel Peyré
Potential Business Impact:
Helps computers learn by comparing data.
Optimal Transport is a foundational mathematical theory that connects optimization, partial differential equations, and probability. It offers a powerful framework for comparing probability distributions and has recently become an important tool in machine learning, especially for designing and evaluating generative models. These course notes cover the fundamental mathematical aspects of OT, including the Monge and Kantorovich formulations, Brenier's theorem, the dual and dynamic formulations, the Bures metric on Gaussian distributions, and gradient flows. It also introduces numerical methods such as linear programming, semi-discrete solvers, and entropic regularization. Applications in machine learning include topics like training neural networks via gradient flows, token dynamics in transformers, and the structure of GANs and diffusion models. These notes focus primarily on mathematical content rather than deep learning techniques.
Similar Papers
Control, Optimal Transport and Neural Differential Equations in Supervised Learning
Numerical Analysis
Teaches computers to move data smoothly and efficiently.
Optimal and Diffusion Transports in Machine Learning
Optimization and Control
Makes AI learn and create better using math.
Optimal Transport-Based Generative Models for Bayesian Posterior Sampling
Computation
Creates better computer guesses about data.