In this paper, we study saddle point (SP) problems, focusing on convex-concave optimization involving functions that satisfy either two-sided quadratic functional growth (QFG) or two-sided quadratic gradient growth (QGG)--novel conditions tailored specifically for SP problems as extensions of quadratic growth conditions in minimization. These conditions relax the traditional requirement of strong convexity-strong concavity, thereby encompassing a broader class of problems. We propose a generalized accelerated primal-dual (GAPD) algorithm to solve SP problems with non-bilinear objective functions, unifying and extending existing methods. We prove that our method achieves a linear convergence rate under these relaxed conditions. Additionally, we provide examples of structured SP problems that satisfy either two-sided QFG or QGG, demonstrating the practical applicability and relevance of our approach.
- Paper ID: 2510.11990
- Title: Linear Convergence of a Unified Primal--Dual Algorithm for Convex--Concave Saddle Point Problems with Quadratic Growth
- Authors: Cody Melcher (University of Arizona), Afrooz Jalilzadeh (University of Arizona), Erfan Yazdandoost Hamedani (University of Arizona)
- Classification: math.OC (Optimization and Control)
- Publication Date: October 13, 2025 (arXiv preprint)
- Paper Link: https://arxiv.org/abs/2510.11990
This paper investigates saddle point (SP) problems, with particular focus on convex-concave optimization problems satisfying two-sided quadratic function growth (QFG) or two-sided quadratic gradient growth (QGG) conditions. These conditions are newly tailored for saddle point problems and extend the quadratic growth conditions from minimization problems. These conditions relax traditional strong convexity-strong concavity requirements, thereby encompassing a broader class of problems. The authors propose a generalized accelerated primal-dual (GAPD) algorithm to solve saddle point problems with non-bilinear objective functions, unifying and extending existing methods. Linear convergence rates are established under these relaxed conditions. Furthermore, concrete examples of structured saddle point problems satisfying two-sided QFG or QGG are provided, demonstrating the practical applicability and relevance of the proposed approach.
This paper studies the following saddle point problem:
minx∈Xmaxy∈Yf(x,y)
where f:X×Y→R is convex in x for any y∈Y and concave in y for any x∈X, with X⊆X and Y⊆Y being closed convex sets.
- Limitations of Traditional Methods: Existing linear convergence results for saddle point problems typically require strong convexity-strong concavity conditions, which are overly restrictive in many practical applications.
- Broad Applicability: Saddle point problems have important applications in game theory, distributionally robust learning, generative adversarial networks, and other fields.
- Theoretical Gap: While quadratic growth conditions (QFG and QGG) have been proven to guarantee linear convergence in minimization problems, extending these conditions to saddle point problems is a non-trivial challenge and remains largely unexplored.
- Method Unification: Existing primal-dual methods such as APD and OGDA lack a unified analytical framework.
- Two-Sided Growth Conditions: First extends QFG and QGG conditions to saddle point problems, defining two-sided quadratic function growth and two-sided quadratic gradient growth conditions.
- Unified Algorithm Framework: Proposes a generalized accelerated primal-dual (GAPD) algorithm that unifies existing APD and OGDA methods.
- Linear Convergence Guarantee: Establishes linear convergence rates for the GAPD algorithm under two-sided QFG or QGG conditions.
- Bregman Distance Extension: Extends the analytical framework to Bregman distances, enhancing method flexibility and applicability.
- Structured Problem Classes: Provides concrete examples of structured saddle point problems satisfying two-sided growth conditions.
Study convex-concave saddle point optimization problems where the objective function satisfies two-sided quadratic growth conditions rather than traditional strong convexity-strong concavity conditions.
For a saddle point problem, if there exist constants (μx,μy)∈R++2 such that for any x∈X and y∈Y:
⟨F(z)−F(zˉ),z−zˉ⟩≥2DZM(z,zˉ)
where z=[xT,yT]T, zˉ=PZ∗(z), F(z)=[∇xf(x,y)T,−∇yf(x,y)T]T, M=diag({μxIn,μyIm}).
If there exist constants (μx,μy)∈R++2 such that:
f(x,yˉ)−f(xˉ,y)≥DZM(z,zˉ)
The core update rules of the GAPD algorithm are:
- Momentum Term Computation:
- qky=∇yf(xk,yk)−∇yf(xk−1,yk−1)
- qkx=∇xf(xk,yk)−∇xf(xk−1,yk−1)
- Dual Variable Update:
yk+1=argminy∈Y{−⟨∇yf(xk,yk)+αkqky,y⟩+σk1DY(y,yk)}
- Aggregated Gradient Construction:
sk=θk∇xf(xk,yk+1)+(1−θk)∇xf(xk,yk)+βkqkx
- Primal Variable Update:
xk+1=argminx∈X{⟨sk,x⟩+τk1DX(x,xk)}
- Unification: Unifies existing methods through parameter θk:
- θk=0: Reduces to OGDA
- θk=1,βk=0: Reduces to APD
- Bregman Distance: Uses Bregman distance instead of Euclidean distance, providing greater flexibility.
- Two-Sided Conditions: First extends one-sided growth conditions to two-sided versions for saddle point problems.
Theorem 4.4: Let {(xk,yk)}k≥0 be the sequence generated by Algorithm 1. Assume Assumptions 2.1-4.3 hold. Then for any K≥1 and Γ≻0:
DZAK−ΓBK(zˉK,zK)≤tKt0DZA0(zˉ0,z0)
Corollary 4.5: Under appropriate parameter choices, the iterative sequence converges to the optimal solution set at a linear rate:
DZ(zˉK,zK)≤DZRK(zˉ0,z0)
where RK=(1−α)cMαK+1, and the convergence rate depends on parameter ς>0 (with ς=θ for QFG and ς=2(1−θ) for QGG).
Consider the following structured convex-concave saddle point problem:
minx∈Xmaxy∈Yh(C1x)+⟨Ax,y⟩−g(C2y)
where h:Rp→R and g:Rq→R are strongly convex functions.
Proposition 5.1: If there exist constants ξ1,ξ2,ξ3,ξ4>0 such that:
- ξ1C1TC1⪰ATA, ξ2C1TC1⪰∥λ∗∥2GTG
- ξ3C2TC2⪰AAT, ξ4C2TC2⪰∥ν∗∥2FTF
then this problem class satisfies two-sided QGG and QFG conditions.
Consider randomly generated saddle point problems:
minx∈Rnmaxy∈Rm21∥C1x−b1∥22+⟨Ax,y⟩−21∥C2y−b2∥22
- Dimension Testing: Tests conducted on three different dimensions (n,m,p,q)∈{(75,60,60,50),(150,120,120,100),(300,240,240,200)}.
- Performance Comparison: GAPD outperforms standard GDA methods across different values of θ.
- Parameter Impact: θ=0.99 achieves the best performance, slightly outperforming the case θ=1.
- QFG and QGG conditions are important in both deterministic and stochastic optimization settings
- Existing work primarily focuses on linear convergence in convex optimization problems
- Arrow-Hurwicz method (GDA): O(κ2log(1/ε)) complexity
- Extragradient method (EG): O(κlog(1/ε)) complexity
- Optimistic gradient method (OGDA): O(κlog(1/ε)) complexity
- Accelerated primal-dual method (APD): Achieves O(1/ε) and O(1/ε2) complexity in C-C and SC-C settings respectively
Quadratic growth conditions are closely related to error bound analysis and metric subregularity for monotone operators.
- Successfully extends quadratic growth conditions to saddle point problems, proposing two-sided QFG and QGG conditions
- GAPD algorithm achieves linear convergence under relaxed conditions, unifying existing methods
- Provides structured problem classes satisfying the new growth conditions
- Condition Verification: Verifying two-sided growth conditions in practical applications may be challenging
- Parameter Selection: Optimal parameter choice for θ requires problem-specific knowledge
- Constraint Handling: Primarily focuses on simple constraint sets with limited treatment of complex constraints
- Investigate convergence behavior under one-sided quadratic growth conditions
- Explore applications in distributed optimization
- Extend to more complex constrained optimization problems
- Theoretical Innovation: First systematically extends quadratic growth conditions to saddle point problems, filling an important theoretical gap
- Unified Framework: GAPD algorithm elegantly unifies multiple existing methods
- Practical Value: Relaxed conditions make the method applicable to a broader class of problems
- Rigorous Analysis: Provides complete convergence analysis and explicit convergence rates
- Limited Experiments: Numerical experiments are relatively simple, lacking validation on practical application scenarios
- Condition Relationships: Analysis of relationships between two-sided QFG and QGG conditions could be deeper
- Computational Complexity: Computational complexity per iteration is not analyzed in detail
- Academic Contribution: Provides important theoretical tools for saddle point optimization theory
- Practical Value: Method unification and flexibility make it promising for multiple application domains
- Extensibility: Provides a solid theoretical foundation for subsequent research
- Adversarial training in machine learning
- Distributionally robust optimization
- Game-theoretic applications
- Convex optimization problems with special structure
The paper cites 46 relevant references covering multiple related fields including saddle point optimization, variational inequalities, and quadratic growth conditions, providing a solid theoretical foundation for this research.