Score: 0

Stopping Rules for Stochastic Gradient Descent via Anytime-Valid Confidence Sequences

Published: December 15, 2025 | arXiv ID: 2512.13123v1

By: Liviu Aolaritei, Michael I. Jordan

We study stopping rules for stochastic gradient descent (SGD) for convex optimization from the perspective of anytime-valid confidence sequences. Classical analyses of SGD provide convergence guarantees in expectation or at a fixed horizon, but offer no statistically valid way to assess, at an arbitrary time, how close the current iterate is to the optimum. We develop an anytime-valid, data-dependent upper confidence sequence for the weighted average suboptimality of projected SGD, constructed via nonnegative supermartingales and requiring no smoothness or strong convexity. This confidence sequence yields a simple stopping rule that is provably $\varepsilon$-optimal with probability at least $1-α$ and is almost surely finite under standard stochastic approximation stepsizes. To the best of our knowledge, these are the first rigorous, time-uniform performance guarantees and finite-time $\varepsilon$-optimality certificates for projected SGD with general convex objectives, based solely on observable trajectory quantities.

Stochastic Gradient Descent in Non-Convex Problems: Asymptotic Convergence with Relaxed Step-Size via Stopping Time Methods

Machine Learning (CS)

Makes computer learning work with simpler rules.

17 Apr 2025 0

88%

Quantitative Convergence Analysis of Projected Stochastic Gradient Descent for Non-Convex Losses via the Goldstein Subdifferential

Optimization and Control

Makes AI learn faster without needing extra tricks.

3 Oct 2025 0

87%

Towards Continuous-Time Approximations for Stochastic Gradient Descent without Replacement

Machine Learning (CS)

Makes computer learning faster and more reliable.

4 Dec 2025 0

View PDF Login to Bookmark

Stopping Rules for Stochastic Gradient Descent via Anytime-Valid Confidence Sequences

Technical Abstract

Stochastic Gradient Descent in Non-Convex Problems: Asymptotic Convergence with Relaxed Step-Size via Stopping Time Methods

Quantitative Convergence Analysis of Projected Stochastic Gradient Descent for Non-Convex Losses via the Goldstein Subdifferential

Towards Continuous-Time Approximations for Stochastic Gradient Descent without Replacement