2025-11-22T03:37:15.627362

Nonasymptotic Analysis of Accelerated Methods With Inexact Oracle Under Absolute Error Bound

Liu, Tajbakhsh

Performance analysis of first-order algorithms with inexact oracles has gained recent attention due to various emerging applications in which obtaining exact gradients is impossible or computationally expensive. Previous research has demonstrated that the performance of accelerated first-order methods is more sensitive to gradient errors compared with non-accelerated ones. This paper investigates the nonasymptotic convergence bound of two accelerated methods with inexact gradients to solve deterministic smooth convex problems. Performance Estimation Problem (PEP) is used as the primary tool to analyze the convergence bounds of the underlying algorithms. By finding an analytical solution to PEP, we derive novel convergence bounds of Generalized Optimized Gradient Method (GOGM) and Generalized Fast Gradient Method (GFGM) with inexact gradient oracles following the absolute error bound. The derived bounds allow varying oracle inexactness along the iterations; furthermore, their accumulated error terms are independent of the initial condition and any unknown parameters. Furthermore, we analyze the tradeoff between the vanishing term and the accumulated error in the convergence bound that guides finding the optimal stepsize. Finally, we determine the optimal strategy to set the gradient inexactness along iterations (if possible in a given application), ensuring that the accumulated error remains subordinate to the vanishing term.

academic

Nonasymptotic Analysis of Accelerated Methods With Inexact Oracle Under Absolute Error Bound

Basic Information

Paper ID: 2408.00720
Title: Nonasymptotic Analysis of Accelerated Methods With Inexact Oracle Under Absolute Error Bound
Authors: Yin Liu (Peking University), Sam Davanloo Tajbakhsh (Ohio State University)
Classification: math.OC (Optimization and Control)
Publication Date: October 15, 2025
Paper Link: https://arxiv.org/abs/2408.00720

Abstract

This paper investigates nonasymptotic convergence bounds for accelerated first-order methods under inexact gradient oracles. Since obtaining exact gradients is infeasible or computationally expensive in many emerging applications, performance analysis of first-order algorithms under inexact oracles has received widespread attention. Previous research has shown that accelerated first-order methods are more sensitive to gradient errors compared to non-accelerated methods. This paper employs the Performance Estimation Problem (PEP) as the primary analytical tool, and by finding analytical solutions to the PEP, derives novel convergence bounds for the Generalized Optimized Gradient Method (GOGM) and Generalized Fast Gradient Method (GFGM) under absolute error bounds.

Research Background and Motivation

Problem Definition

This paper considers the optimization problem:

min_{x∈R^d} f(x)

where f is a convex function with Lipschitz continuous gradient. In practical applications, only inexact gradient estimates are available:

∇̃f(x) = ∇f(x) + e, ||e|| ≤ b

Research Motivation

Practical Requirements: In bilevel optimization, combinatorial optimization, and zeroth-order optimization, obtaining exact gradients is difficult or computationally expensive
Theoretical Gaps: Existing analyses rely on strong assumptions (e.g., bounded feasible sets) or contain non-quantifiable terms
Method Limitations: Accelerated methods are more sensitive to gradient errors, requiring more refined analysis

Application Scenarios

Bilevel Optimization: Lower-level problems can only be solved to suboptimal solutions
Combinatorial Optimization: Gradient estimation through mini-batch sampling
Zeroth-Order Optimization: Gradient estimation via finite differences or Gaussian smoothing

Core Contributions

Theoretical Breakthrough: Derives quantified convergence bounds for iGOGM and iGFGM under absolute error (AE) conditions, eliminating dependence on unknown parameters
Methodological Innovation: Finds analytical feasible solutions through PEP techniques, providing cumulative error terms independent of initial conditions
Practical Guidance: Analyzes the trade-off between convergence rate and cumulative error, providing optimal step-size selection strategies
Optimal Scheduling: Determines optimal strategies for setting gradient inexactness, minimizing total computational cost

Methodology Details

Task Definition

Study convergence of accelerated gradient methods under absolute error conditions:

Input: Inexact gradient oracle satisfying ||∇̃f(x) - ∇f(x)|| ≤ b_x
Output: Upper bound on convergence error f(x_K) - f*
Constraint: f ∈ F_{0,L} (convex and L-smooth only)

Core Algorithms

iGOGM Algorithm (Algorithm 2)

Input: z_0 = x_0, A_0 = α_0 = 1, step-size parameters {λ_k}
for k = 0 to K-1:
    y_{k+1} = x_k - (1/L)∇̃f(x_k)
    z_{k+1} = z_k - (2/L)α_k∇̃f(x_k)  # Key: coefficient is 2
    α_{k+1} = (λ_{k+1} + √(4λ_{k+1}A_k + λ²_{k+1}))/2
    A_{k+1} = A_k + α_{k+1}
    x_{k+1} = (1 - α_{k+1}/A_{k+1})y_{k+1} + (α_{k+1}/A_{k+1})z_{k+1}

iGFGM Algorithm (Algorithm 1)

Similar to iGOGM, but with coefficient 1 instead of 2 in step 3.

Technical Innovations

1. Analytical PEP Solutions

Through the dual form of the performance estimation problem, analytical feasible solutions are found:

τ = L/(4A_K), v_{i,i+1} = A_i/A_K
u_i = [A_i(1+2α_{i+1})(A_i+2α_iα_{i+1})]/(4LA_K(A_{i+1}-α²_{i+1})) + Σ_{k=i+1}^{K-1} [A_k(1+2α_{k+1})α_iα_{k+1}]/(2LA_K(A_{k+1}-α²_{k+1}))

2. Matrix Decomposition Techniques

Leveraging Schur complement and diagonally dominant matrix properties to ensure positive semi-definite constraints:

Constructing Gram matrix G = X^T X
Handling PSD constraints through block matrix decomposition
Proving that u_i satisfies feasibility conditions

Main Theoretical Results

Theorem 2.2 (iGOGM Convergence Bound)

f(x_K) - f* - (1/2L)||∇f(x_K)||² ≤ L||x_0-x*||²/(4A_K) + Σ_{k=0}^{K-1} u_k||e_k||²

Theorem 2.4 (iGFGM Convergence Bound)

f(x_K) - f* - (1/2L)||∇f(x_K)||² ≤ L||x_0-x*||²/(2A_K) + Σ_{k=0}^{K-1} u_k||e_k||²

Key Characteristics:

Cumulative error term is independent of initial condition ||x_0 - x*||
Allows error levels b_k to vary along iterations
Coefficients u_k can be computed explicitly

Experimental Setup and Results

Numerical Verification

Solution Verification: Validates analytical solutions through numerical solving with relative error < 10^{-3}
Trade-off Analysis: Analyzes convergence rate versus cumulative error trade-off for OGM-a (α_i = (i+a)/a)
Optimal Scheduling: Compares total complexity of fixed error versus optimally varying error

Key Findings

When a=4, cumulative error is on the order of O(b²K³/L)
Increasing a reduces cumulative error but decreases convergence rate
Optimal error scheduling can significantly reduce total η-complexity

Error Condition	Allow Varying Error?	Iteration Complexity	Limitations
BIE 8	×	O(1/A_K + δ)	Bounded feasible set
IFO 12	✓	O(1/A_K + Σδ_i/A_K)	Bounded feasible set
IFO-q 41	×	O(1/K² + δ/K^{3q/2-1})	Subgradient condition
AE 58	×	O(1/K² + R̃_Kδ + Kδ²)	Unknown R̃_K
This Work	✓	O(1/A_K + Σu_kb_k²)	No additional assumptions

Optimal Error Scheduling

Optimization Problem

min Σ_{k=0}^{K-1} η_k = Σ_{k=0}^{K-1} h^{-1}(b_k)
s.t. Σ_{k=0}^{K-1} u_kb_k² ≤ LR²/(4A_K)

Power-Law Decay Case

For h(η) = c₁η^{-c₂}, the optimal solution is:

b_k* = √(LR²/(4A_K)) · u_k^{1/(1+2c₂)} / (Σu_j^{1/(1+2c₂)})^{1/2}

Exponential Decay Case

For h(η) = q₁q₂^{-η}, the optimal solution is:

b_k* = √(LR²/(4KA_Ku_k))

Conclusions and Discussion

Main Conclusions

First to provide quantified, parameter-independent convergence bounds for accelerated methods under absolute error conditions
Cumulative error term is independent of initial conditions, depending only on Lipschitz constant and step-sizes
Optimal error scheduling can significantly reduce total computational complexity

Limitations

Theoretical results require α_k² < A_k (strict inequality), excluding the standard FGM case where α_k² = A_k
Optimal algorithm step-sizes lack recursive structure, making practical implementation difficult
Analysis primarily addresses deterministic settings; stochastic cases require further investigation

Future Directions

Extension to strongly convex cases and stochastic settings
Investigation of more general error conditions
Development of practical adaptive error control strategies

In-Depth Evaluation

Strengths

Significant Theoretical Contribution: Fills the gap in analyzing accelerated methods under absolute error conditions
Advanced Technical Approach: Innovative application of PEP techniques
Strong Practical Utility: Provides computable error bounds and optimal scheduling strategies
Comprehensive Analysis: Combines theoretical proofs with numerical verification

Weaknesses

Limited Practical Applicability: Non-recursive nature of optimal algorithms restricts practical application
Restrictive Conditions: Strict α_k² < A_k condition excludes some important cases
Insufficient Experiments: Lacks numerical experiments on practical optimization problems

Impact

This paper provides important theoretical foundations for accelerated optimization under inexact oracles, with guidance for applications in bilevel optimization, combinatorial optimization, and related fields. The application of PEP techniques also provides new methodological approaches for related analyses.

Applicable Scenarios

Large-scale optimization problems where gradient computation is expensive
Bilevel and combinatorial optimization problems
Applications requiring trade-offs between computational precision and iteration count
Theoretical analysis of zeroth-order optimization methods

References

Key references include Drori & Teboulle's PEP theoretical foundations 18, Kim & Fessler's OGM methods 32,33, and Devolder et al.'s inexact oracle analysis 12.