Improved Central Limit Theorem and Bootstrap Approximations for Linear Stochastic Approximation
Butyrin, Moulines, Naumov et al.
In this paper, we refine the Berry-Esseen bounds for the multivariate normal approximation of Polyak-Ruppert averaged iterates arising from the linear stochastic approximation (LSA) algorithm with decreasing step size. We consider the normal approximation by the Gaussian distribution with covariance matrix predicted by the Polyak-Juditsky central limit theorem and establish the rate up to order $n^{-1/3}$ in convex distance, where $n$ is the number of samples used in the algorithm. We also prove a non-asymptotic validity of the multiplier bootstrap procedure for approximating the distribution of the rescaled error of the averaged LSA estimator. We establish approximation rates of order up to $1/\sqrt{n}$ for the latter distribution, which significantly improves upon the previous results obtained by Samsonov et al. (2024).
academic
Improved Central Limit Theorem and Bootstrap Approximations for Linear Stochastic Approximation
This paper improves the Berry-Esseen bounds for multivariate Gaussian approximation of Polyak-Ruppert averaged iterates in linear stochastic approximation (LSA) algorithms. The study establishes a convergence rate of order n−1/3 in the convex distance sense for Gaussian approximation with the covariance matrix predicted by the Polyak-Juditsky central limit theorem, where n denotes the number of samples used by the algorithm. Additionally, the paper proves the non-asymptotic validity of the multiplier bootstrap procedure for approximating the rescaled error distribution of the averaged LSA estimator, achieving an approximation rate of order 1/n, which significantly improves upon previous results by Samsonov et al. (2024).
Linear stochastic approximation (LSA) is a fundamental method in statistics and machine learning for approximating the unique solution to the linear system Aˉθ∗=bˉ, where Aˉ∈Rd×d is a non-singular matrix. The algorithm performs iterative updates based on an observed sequence {(A(Zk),b(Zk))}k∈N.
Distribution Approximation Accuracy: Existing Gaussian approximation results exhibit slow convergence rates, limiting the precision of confidence interval construction in practical applications
Covariance Matrix Estimation: The asymptotic covariance matrix Σ∞ is unknown in practice, requiring effective estimation and approximation methods
Bootstrap Validity: Traditional bootstrap methods face theoretical and practical challenges in online learning algorithms
Improved Moment Bounds: Establishes higher-order moment bounds for n(θˉn−θ∗), obtaining for p≥2:
E1/p[∥θˉn−θ∗∥p]≲npTrΣ∞+n5/6p3/2
Improved Berry-Esseen Bounds: Establishes a Gaussian approximation rate of order n−1/3 in the convex distance sense, improving upon the previous n−1/4 result
Non-asymptotic Analysis of Multiplier Bootstrap: Proves the validity of the bootstrap procedure with an approximation rate of n−1/2, significantly superior to existing results
Technical Innovation: By choosing an appropriate covariance matrix Σn rather than Σ∞ for approximation, avoids direct Gaussian comparison steps
Polyak, B. T., & Juditsky, A. B. (1992). Acceleration of stochastic approximation by averaging. SIAM journal on control and optimization.
Shao, Q. M., & Zhang, Z. S. (2022). Berry–Esseen bounds for multivariate nonlinear statistics with applications to M-estimators and stochastic gradient descent algorithms. Bernoulli.
Fang, Y., Xu, J., & Yang, L. (2018). Online bootstrap confidence intervals for the stochastic gradient descent estimator. Journal of Machine Learning Research.
Durmus, A., Moulines, E., Naumov, A., & Samsonov, S. (2025). Finite-time high-probability bounds for Polyak–Ruppert averaged iterates of linear stochastic approximation. Mathematics of Operations Research.