Score: 0

Improving the stability of the covariance-controlled adaptive Langevin thermostat for large-scale Bayesian sampling

Published: December 30, 2025 | arXiv ID: 2512.24515v1

By: Jiani Wei, Xiaocheng Shang

Stochastic gradient Langevin dynamics and its variants approximate the likelihood of an entire dataset, via random (and typically much smaller) subsets, in the setting of Bayesian sampling. Due to the (often substantial) improvement of the computational efficiency, they have been widely used in large-scale machine learning applications. It has been demonstrated that the so-called covariance-controlled adaptive Langevin (CCAdL) thermostat, which incorporates an additional term involving the covariance matrix of the noisy force, outperforms popular alternative methods. A moving average is used in CCAdL to estimate the covariance matrix of the noisy force, in which case the covariance matrix will converge to a constant matrix in long-time limit. Moreover, it appears in our numerical experiments that the use of a moving average could reduce the stability of the numerical integrators, thereby limiting the largest usable stepsize. In this article, we propose a modified CCAdL (i.e., mCCAdL) thermostat that uses the scaling part of the scaling and squaring method together with a truncated Taylor series approximation to the exponential to numerically approximate the exact solution to the subsystem involving the additional term proposed in CCAdL. We also propose a symmetric splitting method for mCCAdL, instead of an Euler-type discretisation used in the original CCAdL thermostat. We demonstrate in our numerical experiments that the newly proposed mCCAdL thermostat achieves a substantial improvement in the numerical stability over the original CCAdL thermostat, while significantly outperforming popular alternative stochastic gradient methods in terms of the numerical accuracy for large-scale machine learning applications.

Adaptive Stepsizing for Stochastic Gradient Langevin Dynamics in Bayesian Neural Networks

Machine Learning (CS)

Makes computer learning more accurate and stable.

11 Nov 2025 0

88%

Adaptive Stepsizing for Stochastic Gradient Langevin Dynamics in Bayesian Neural Networks

Machine Learning (CS)

Makes computer learning more accurate and stable.

11 Nov 2025 0

88%

A Langevin sampling algorithm inspired by the Adam optimizer

Computation

Helps computers learn faster and more accurately.

26 Apr 2025 1

View PDF Login to Bookmark

Improving the stability of the covariance-controlled adaptive Langevin thermostat for large-scale Bayesian sampling

Technical Abstract

Adaptive Stepsizing for Stochastic Gradient Langevin Dynamics in Bayesian Neural Networks

Adaptive Stepsizing for Stochastic Gradient Langevin Dynamics in Bayesian Neural Networks

A Langevin sampling algorithm inspired by the Adam optimizer