JOURNAL ARTICLE

Bridging the gap between constant step size stochastic gradient descent and Markov chains

Aymeric DieuleveutAlain DurmusFrancis Bach

Year: 2020 Journal:   The Annals of Statistics Vol: 48 (3)   Publisher: Institute of Mathematical Statistics

Abstract

We consider the minimization of a strongly convex objective function given access to unbiased estimates of its gradient through stochastic gradient descent (SGD) with constant step size. While the detailed analysis was only performed for quadratic functions, we provide an explicit asymptotic expansion of the moments of the averaged SGD iterates that outlines the dependence on initial conditions, the effect of noise and the step size, as well as the lack of convergence in the general (nonquadratic) case. For this analysis we bring tools from Markov chain theory into the analysis of stochastic gradient. We then show that Richardson–Romberg extrapolation may be used to get closer to the global optimum, and we show empirical improvements of the new extrapolation scheme.

Keywords:
Mathematics Markov chain Applied mathematics Extrapolation Iterated function Stochastic gradient descent Constant (computer programming) Convex function Gradient descent Quadratic equation Mathematical optimization Regular polygon Mathematical analysis Statistics

Metrics

73
Cited By
11.80
FWCI (Field Weighted Citation Impact)
71
Refs
0.99
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Markov Chains and Monte Carlo Methods
Physical Sciences →  Mathematics →  Statistics and Probability
Stochastic Gradient Optimization Techniques
Physical Sciences →  Computer Science →  Artificial Intelligence
Sparse and Compressive Sensing Techniques
Physical Sciences →  Engineering →  Computational Mechanics

Related Documents

JOURNAL ARTICLE

Convergence of Markov Chains for Constant Step-Size Stochastic Gradient Descent with Separable Functions

David ShirokoffPhilip Zaleski

Journal:   SIAM Journal on Applied Dynamical Systems Year: 2025 Vol: 24 (3)Pages: 2005-2043
JOURNAL ARTICLE

New logarithmic step size for stochastic gradient descent

Mahsa Soheil ShamaeeSajad Fathi HafshejaniZeinab Saeidian

Journal:   Frontiers of Computer Science Year: 2024 Vol: 19 (1)
JOURNAL ARTICLE

Stochastic Gradient Descent with Preconditioned Polyak Step-Size

Farshed AbdukhakimovCheng XiangDmitry KamzolovMartin Takáč

Journal:   Computational Mathematics and Mathematical Physics Year: 2024 Vol: 64 (4)Pages: 621-634
JOURNAL ARTICLE

Stochastic Gradient Descent with Pre-Conditioned Polyak Step-Size

Farshed AbdukhakimovCaiping XiangDmitry KamzolovMartin Takáč

Journal:   Журнал вычислительной математики и математической физики Year: 2024 Vol: 64 (4)Pages: 575-586
JOURNAL ARTICLE

Accelerated stochastic gradient descent with step size selection rules

Zhuang YangCheng WangZhemin ZhangJonathan Li

Journal:   Signal Processing Year: 2019 Vol: 159 Pages: 171-186
© 2026 ScienceGate Book Chapters — All rights reserved.