In this paper, we propose a novel extrapolation coefficient scheme within a new extrapolation term and develop an accelerated proximal gradient algorithm. We establish that the algorithm achieves a sublinear convergence rate. The proposed scheme only requires the Lipschitz constant estimate sequence to satisfy mild initial conditions, under which a key equality property can be derived to support the convergence analysis. Numerical experiments are provided to demonstrate the effectiveness and practical performance of the proposed method.
- Paper ID: 2507.06737
- Title: Fast Accelerated Proximal Gradient Method with New Extrapolation Term for Multiobjective Optimization
- Author: Huang Chengzhi
- Classification: math.OC (Optimization and Control)
- Publication Date: October 17, 2025
- Paper Link: https://arxiv.org/abs/2507.06737
This paper proposes a novel extrapolation coefficient scheme and extrapolation term, and develops an accelerated proximal gradient algorithm. The algorithm achieves sublinear convergence rates. The proposed scheme only requires the sequence of Lipschitz constant estimates to satisfy mild initial conditions, under which key equality properties can be derived to support convergence analysis. Numerical experiments validate the effectiveness and practical performance of the proposed method.
- Problem to be Solved: Multiobjective optimization problems, particularly composite unconstrained multiobjective optimization problems:
minx∈RnF(x)≡(f1(x)+g1(x),…,fm(x)+gm(x))T
where fi are smooth convex functions and gi are convex functions (possibly nonsmooth).
- Problem Significance: Multiobjective optimization is ubiquitous in practical applications, such as image restoration and compressed sensing. Such problems typically do not have a single optimal solution but rather a solution set consisting of Pareto optimal solutions.
- Limitations of Existing Methods:
- Tanabe et al. extended FISTA to multiobjective optimization, achieving O(1/k2) convergence rate
- Work by Sonntag et al. and Zhang et al. suffers from incomplete theoretical proofs, with convergence analysis depending on the nonnegativity of the auxiliary function σ(z)=mini=1,…,mFi(xk)−Fi(z), which is difficult to guarantee
- Research Motivation: To overcome defects in theoretical analysis of existing methods, propose methods with milder requirements on initial Lipschitz constant estimates, and avoid dependence on the nonnegativity of σ through key equalities.
- Proposes a New Extrapolation Term Scheme: Employs the extrapolation form yk=xk+k+α−1k+α−4(xk−xk−1), where α≥3
- Establishes Mild Initial Conditions: Only requires the sequence of Lipschitz constant estimates to satisfy weaker initial conditions
- Derives Key Equality Properties: Avoids dependence on the nonnegativity of auxiliary functions, perfecting the theoretical analysis
- Proves Sublinear Convergence Rate: Achieves O(1/k2) convergence rate in the smooth case and O(1/k) in the nonsmooth case
- Extends to Nonsmooth Case: Handles completely nonsmooth multiobjective optimization problems through smoothing techniques
Consider the composite unconstrained multiobjective optimization problem (MOP):
minx∈RnF(x)=(f1(x)+g1(x),…,fm(x)+gm(x))T
where:
- fi:Rn→R are continuously differentiable convex functions
- gi:Rn→R are convex functions (possibly nonsmooth)
- The objective is to find weakly Pareto optimal solutions
Core Subproblem:
minz∈RnϕL(f)(z;x,y)=maxi=1,…,m[⟨∇fi(y),z−y⟩+gi(z)+fi(y)−Fi(x)]+2L(f)∥z−y∥2
Algorithm Steps:
- Compute extrapolation point: yk=xk+k+α−1k+α−4(xk−xk−1)
- Solve subproblem: xk+1=psk(xk,yk)
- Update parameters: sk+1=ηsk, where η=(k+α−1)(k+α−3)(k+α−2)2
Parameter Conditions:
- When α>3: 0<α−3α−2s0<L(f)1
- When α=3: 0<s0<L(f)1
Approximates nonsmooth functions fi(x) through smoothing functions f~i(x,μ), where the smoothing function satisfies:
- Continuous Differentiability: For fixed μ>0, f~(⋅,μ) is continuously differentiable
- Consistency: limz→x,μ↓0f~(z,μ)=f(x)
- Gradient Consistency: {limz→x,μ↓0∇f~(z,μ)}⊆∂f(x)
- Novel Extrapolation Coefficient Design: Through specific parameter update scheme η=(k+α−1)(k+α−3)(k+α−2)2, ensures sk<L(f)1 holds consistently
- Key Equality Derivation: Through clever algebraic manipulation and parameter selection, avoids dependence on the nonnegativity of σk(z)
- Unified Framework: Degenerates to existing methods when α=3, but provides more complete theoretical analysis
The paper mentions numerical experiments on three tri-objective optimization problems:
- BK1&ℓ1 problem
- JOS1&ℓ1 problem
- SP1&ℓ1 problem
Uses merit function u0(x)=supz∈Rnmini=1,…,m[Fi(x)−Fi(z)] to evaluate algorithm performance, which satisfies:
- u0(x)≥0 for all x
- x is weakly Pareto optimal if and only if u0(x)=0
- Stopping criterion: ∥xk−xk+1∥<ε
- For nonsmooth case also requires μk<ε
- Parameter updates: μk+1=k+α−1k+α−2μk, sk+1=k+α−3k+α−2sk
The paper presents Pareto frontier plots for three tri-objective optimization problems, though specific numerical results and detailed performance comparison data are incomplete in the provided document.
Smooth Case (Theorem 4.3):
u0(xk)≤2(k+α−1)2L(f)(α−1)2R
Achieves O(1/k2) convergence rate.
Nonsmooth Case (Theorem 6.2):
u0(xk+1)≤O(k1)
Achieves O(1/k) convergence rate.
- Multiobjective FISTA Extension: Tanabe et al. first extended FISTA to multiobjective optimization, achieving O(1/k2) convergence rate
- Monotone Variants: Nishimura et al. proposed monotone variants of multiobjective FISTA
- Generalized Framework: Tanabe et al. generalized the framework to single-objective cases through hyperparameter introduction
- Nesterov-type Schemes: Sonntag et al. and Zhang et al. attempted to use more effective extrapolation terms, but with incomplete theoretical analysis
- Nonsmooth Methods: Gebken et al. proposed subgradient descent algorithms for nonsmooth multiobjective optimization
- Proposes an accelerated proximal gradient method with novel extrapolation term applicable to smooth and nonsmooth multiobjective optimization
- Establishes complete convergence theory, avoiding theoretical defects of existing methods
- Achieves O(1/k2) convergence rate in smooth case and O(1/k) in nonsmooth case
- Insufficient Experimental Validation: Numerical experiment results are incomplete, lacking detailed performance comparisons
- Parameter Selection Constraints: Specific requirements on initial parameters s0 and α
- Slower Convergence in Nonsmooth Case: Compared to the smooth case, the nonsmooth version has reduced convergence rate of O(1/k)
- Explore better smoothing techniques to improve convergence rate in nonsmooth case
- Investigate adaptive parameter selection strategies
- Extend to constrained multiobjective optimization problems
- Significant Theoretical Contribution: Resolves key defects in theoretical analysis of existing methods, providing complete convergence proofs
- Clever Method Design: Through specific parameter update strategies ensures theoretical guarantees of the algorithm
- Framework Unification: Incorporates both smooth and nonsmooth cases into a unified framework
- Mathematical Rigor: Detailed proofs with clear logical structure
- Insufficient Experimental Verification: Numerical experiments are overly simplistic, lacking detailed comparisons with other advanced methods
- Lack of Practical Analysis: Missing in-depth analysis of computational complexity and practical application scenarios
- Parameter Sensitivity Not Discussed: No analysis of parameter selection impact on algorithm performance
- High Theoretical Value: Provides more complete theoretical foundation for accelerated methods in multiobjective optimization
- Practical Value Pending Verification: Requires more experimental validation of effectiveness in real-world problems
- Good Reproducibility: Clear algorithm description and complete theoretical analysis
- Composite structure multiobjective optimization problems
- Application domains such as image processing and compressed sensing
- Optimization scenarios requiring theoretical guarantees
The paper cites important literature in multiobjective optimization, including:
- Pioneering work by Tanabe et al. on multiobjective FISTA
- Related theory of Nesterov acceleration methods
- Literature on smoothing techniques
- Classical theoretical foundations of multiobjective optimization
Overall Assessment: This is a theoretically strong paper that successfully addresses theoretical defects in existing multiobjective accelerated proximal gradient methods, providing complete convergence analysis. However, the paper has room for improvement in experimental validation and practical analysis.