This paper investigates the mixing times of the Projected Langevin Algorithm and privacy curves for noisy stochastic gradient descent (SGD), extending beyond the scope of non-expansive iterations. Specifically, the authors derive novel mixing time bounds for the Projected Langevin Algorithm, which in certain important cases are dimension-free and polylogarithmic in precision, closely matching existing results for the smooth convex case. Furthermore, the paper establishes new privacy curve upper bounds for subsampled noisy SGD algorithms. These bounds demonstrate critical dependence on gradient regularity and are applicable to a broad class of convex loss functions beyond the smooth case.
Importance of Sampling: Sampling from log-concave distributions π ∝ e^(-f) is a fundamental algorithmic problem and serves as a building block for volume estimation, optimization, Bayesian statistics, machine learning, and differential privacy.
Langevin Algorithm: The Langevin algorithm approximates the target distribution by discretizing the Langevin diffusion:
dL_t = -∇f(L_t)dt + √2dW_t
After discretization, this yields a Markov chain:
X_{t+1} = Π_X[X_t - η∇f(X_t) + √(2η)ξ_t]
Limitations of Existing Methods:
Most research focuses on strongly convex smooth potential function settings
Limited research on non-smooth convex potential functions
Privacy Amplification by Iteration (PABI) techniques are primarily restricted to non-expansive iterations
This paper aims to extend PABI techniques beyond non-expansive iterations by quantifying the regularity of underlying mappings through moduli of continuity, thereby handling a broader class of potential functions including non-differentiable functions.
Extended PABI Framework: Extends PABI techniques to general mappings with moduli of continuity, even accommodating discontinuous mappings.
Optimization Problem Design: Designs an optimization problem to obtain optimal Rényi divergence bounds for PABI applications, whose tractability is closely related to the modulus of continuity of the relevant gradient mapping.
Explicit Solutions: Proves that when the modulus of continuity has the form φ(δ) = √(cδ² + h), the optimization problem admits an exact explicit solution.
Mixing Time Bounds: Establishes novel mixing time bounds for convex and (p,M)-weakly smooth cases, which in certain scenarios are dimension-free and polylogarithmic in precision.
Privacy Curve Analysis: Establishes new privacy curve upper bounds for noisy SGD, demonstrating critical dependence on gradient regularity.
Lemma 3.2: Let X ⊆ ℝ^d be a closed convex set, and Φ have modulus of continuity φ. For random variables X, X' and Gaussian noise ξ ~ N(0, σ²I), let Y = Π_XΦ(X) + ξ, Y' = Π_XΦ(X') + ξ, then for any 0 < a ≤ φ(δ):
This is primarily a theoretical work establishing bounds through rigorous mathematical proofs rather than empirical experiments. The analysis includes:
Mixing Time Analysis: Evaluates convergence rates using total variation distance
Privacy Analysis: Employs the Rényi differential privacy framework
Optimization Analysis: Proves existence and uniqueness of optimal solutions to the shifted optimization problem
Altschuler, J. and Talwar, K. (2022, 2023). Privacy amplification by iteration
Feldman, V. et al. (2018). Privacy amplification by iteration
Dalalyan, A. (2017). Langevin Monte Carlo analysis
Bubeck, S. et al. (2018). Projected Langevin Monte Carlo
This paper makes important contributions to theoretical machine learning and differential privacy. Through innovative use of moduli of continuity, it successfully extends the applicability of PABI techniques and provides new theoretical tools for analyzing non-smooth optimization and privacy-preserving algorithms.