We study the algorithmic problem of multiplying large matrices that are rectangular. We prove that the method that has been used to construct the fastest algorithms for rectangular matrix multiplication cannot give algorithms with complexity $n^{p + 1}$ for $n \times n$ by $n \times n^p$ matrix multiplication. In fact, we prove a precise numerical barrier for this method. Our barrier improves the previously known barriers, both in the numerical sense, as well as in its generality. In particular, we prove that any lower bound on the dual exponent of matrix multiplication $α$ via the big Coppersmith-Winograd tensors cannot exceed 0.6218.
- Paper ID: 2003.03019
- Title: Barriers for rectangular matrix multiplication
- Authors: Matthias Christandl, François Le Gall, Vladimir Lysikov, Jeroen Zuiddam
- Classification: cs.CC (Computational Complexity), math.AC (Commutative Algebra)
- Publication Date: November 10, 2025 (arXiv version)
- Paper Link: https://arxiv.org/abs/2003.03019
This paper investigates algorithmic problems in large-scale rectangular matrix multiplication. The authors prove that methods used to construct the fastest rectangular matrix multiplication algorithms cannot provide algorithms with complexity np+1 for multiplying n×n matrices by n×np matrices. In fact, the authors establish precise numerical barriers for such methods. This barrier improves upon previously known barriers both in numerical significance and generality. In particular, the authors prove that any lower bound on the matrix multiplication dual exponent α obtained through large Coppersmith-Winograd tensors cannot exceed 0.6218.
- Matrix multiplication complexity problem: Given two large matrices, how many scalar arithmetic operations are required to compute their matrix product? The standard algorithm requires approximately 2n3 operations for two n×n square matrices, but the theoretical lower bound is only n2.
- Rectangular matrix multiplication: In practical applications, matrices to be multiplied are typically rectangular rather than square. For arbitrary non-negative real numbers p, given an n×⌈np⌉ matrix and an ⌈np⌉×n matrix, how many operations are needed to compute their product?
- Exponent definition: ω(p) denotes the optimal exponent of n in the number of operations required by any arithmetic algorithm, with prior bounds max(2,1+p)≤ω(p)≤2+p.
- Theoretical importance: Understanding ω(p) is not only meaningful for rectangular matrix multiplication but also serves as a means to prove ω=2 (the optimal exponent for square matrix multiplication).
- Practical applications: Rectangular matrix multiplication has direct applications in linear programming solving, empirical risk minimization, and other fields.
- Technical limitations: Current techniques encounter bottlenecks in improving upper bounds, necessitating understanding of their fundamental constraints.
- Established a universal barrier framework: Established precise numerical barriers for the main current techniques in constructing rectangular matrix multiplication algorithms.
- Improved numerical bounds: Improved upon previous barrier results in both numerical significance and generality.
- Introduced virtual matrix multiplication tensors: Introduced new mathematical tools to handle non-integer values of p.
- Analyzed catalytic methods: Investigated more complex algorithm structures involving catalytic tensors.
- Precise bounds on dual exponent: Proved that lower bounds on α obtained through Coppersmith-Winograd tensors cannot exceed 0.6218.
Study the rectangular matrix multiplication problem: given an n×⌈np⌉ matrix A and an ⌈np⌉×n matrix B, compute the number of arithmetic operations required to calculate the product AB. The goal is to understand the fundamental limitations of current techniques in improving the complexity upper bound ω(p).
Matrix multiplication problems correspond to tensor families:
- Multiplication of ℓ×m matrix by m×n matrix corresponds to tensor: ⟨ℓ,m,n⟩=∑i=1ℓ∑j=1m∑k=1nxijyjkzki
- Unit problem corresponds to diagonal tensor: ⟨n⟩=∑i=1nxiyizi
Defined multiple tensor reduction types:
- Restriction (S≤T): There exist linear maps such that S=T∘(A,B,C)
- Degeneration (S◃T): S=limϵ→0T(A(ϵ)x,B(ϵ)y,C(ϵ)z)
- Monomial restriction/degeneration: Matrices A,B,C have at most one non-zero element per row and column
Defined the class of appropriate tensor parameters F satisfying:
- ≤-monotonicity: S≤T⇒F(S)≤F(T)
- ⊗-submultiplicativity: F(S⊗T)≤F(S)⋅F(T)
- MaMu-⊗-multiplicativity: F(⟨ℓ1ℓ2,m1m2,n1n2⟩)=F(⟨ℓ1,m1,n1⟩)⋅F(⟨ℓ2,m2,n2⟩)
- Self-⊕-additivity: F(T⊕s)=s⋅F(T)
- Asymptotic rank bound: F(T)≤R~(T)
To handle real numbers p, introduced formal symbol ⟨2,2,2p⟩:
- When p=logab (a,b positive integers): F(⟨2,2,2p⟩)=2logaF(⟨a,a,b⟩)
- Otherwise defined through infimum: F(⟨2,2,2p⟩)=inf{F(⟨2,2,2P⟩)∣P≥p,∃a,b∈Z≥0:P=logab}
By applying appropriate parameters F,G to both ends of the algorithm chain:
⟨n,n,m⟩⊕s≤T⊗k≤⟨r⟩⊗kb
Obtained:
logF(T)logF(⟨2,2,2p⟩)logR~(T)≤ω(p)
Used Strassen's upper support functionals as appropriate parameters:
ζθ(T)=minS≅TmaxP∈P(supp(S))2∑i∈[3]θiH(Pi)
where θ=(θ1,θ2,θ3)∈P([3]), H is Shannon entropy.
Analyzed CW tensor:
CWq(x,y,z)=x0y0zq+1+x0yq+1z0+xq+1y0z0+∑i=1q(x0yizi+xiy0zi+xiyiz0)
Known that R~(CWq)=q+2.
Barrier computation transformed into convex optimization problem:
maxθmaxP∑i=13θiH(Pi)2θ1+(p+1)(θ2+θ3)log2(q+2)
For CWq tensor, barrier values for ω(2):
| q | ω(2)≥ | Optimal θ1 |
|---|
| 2 | 3.0626 | 0.096 |
| 6 | 3.1039 | 0.136 |
| 10 | 3.1409 | 0.165 |
| 14 | 3.1714 | 0.185 |
| q | α Barrier |
|---|
| 2 | 0.6218 |
| 6 | 0.5408 |
| 10 | 0.4914 |
| 14 | 0.4529 |
Key Result: Any lower bound on α obtained through degeneration of CWq (for arbitrary q) cannot exceed 0.6218.
- Alman-Vassilevska Williams AW18a: Monomial degeneration through CW6 can only yield α≥0.871
- This paper: Stronger degeneration through CW6 can only yield α≥0.543
- Current best lower bound: α>0.321334 WXXZ24
For κ-catalytic methods, the barrier strengthens to:
ω(p)≥logF(T)logF(⟨2,2,2p⟩)logR~(T)+κ(logF(T)logR~(T)−1)
- Ambainis-Filmus-Le Gall AFLG15: First proved barriers in matrix multiplication, showing certain methods cannot achieve ω=2.
- Alman-Vassilevska Williams AW18a,AW18b:
- Extended to monomial degenerations
- First studied barriers for rectangular matrix multiplication
- Based on asymptotic independence analysis
- Blasiak et al. BCC+17a,BCC+17b: Studied barriers for group-theoretic methods.
- Christandl-Vrana-Zuiddam CVZ19:
- More general degeneration barriers
- Based on tensor irreversibility
- Used quantum functionals and support functionals
- Higher numerical bounds: Tighter barriers compared to previous work
- Broader applicability: Applicable not only to 0≤p≤1 but also to p≥1
- Unified framework: Encompasses all known reduction concepts
- Mixed method analysis: First systematic analysis of mixed intermediate tensor methods
- Fundamental limitations: Current mainstream techniques (degeneration methods based on Coppersmith-Winograd tensors) have fundamental limitations in improving rectangular matrix multiplication complexity.
- Precise numerical bounds: Lower bounds on the dual exponent α obtained through any CWq tensor cannot exceed 0.6218, far below the theoretical maximum of 1.
- Technical bottlenecks: Demonstrated why current techniques cannot significantly narrow the gap between upper and lower bounds of ω(p).
- Method specificity: Barriers apply only to methods based on specific intermediate tensors (such as CW tensors), not excluding other possible algorithm design approaches.
- Lower bound nature: These are methodological barriers rather than lower bounds on the problem itself, not excluding the possibility of better algorithms.
- Computational complexity: Numerical computation relies on convex optimization, which may face computational challenges for larger tensors.
- New intermediate tensors: Search for new types of intermediate tensors not constrained by current barriers.
- Non-tensor methods: Explore entirely new algorithm design paradigms not based on tensor degeneration.
- Tightness of barriers: Study whether the proved barriers are tight.
- Other reduction types: Analyze barriers under more general reduction concepts.
- Theoretical depth: Established a complete barrier theory framework with high mathematical rigor.
- Technical innovations:
- Clever introduction of virtual matrix multiplication tensors to handle non-integer exponents
- Abstraction of appropriate tensor parameters provides unified analytical tools
- Practical value: Precise numerical results provide algorithm designers with clear technical limitation guidance.
- Comprehensiveness: Covers the complete chain from foundational theory to concrete computation.
- Barrier limitations: Apply only to specific algorithm types, potentially circumventable methods may exist.
- Computational dependence: Numerical results depend on support functional computation, potentially difficult for more complex tensors.
- Gap analysis: While barriers are proved, lacks deep analysis of what the gap between barriers and current best results implies.
- Theoretical contribution: Provides new analytical tools and perspectives for complexity theory.
- Practical guidance: Helps researchers understand current technique limitations and guides future research directions.
- Methodological value: Barrier analysis framework may apply to other algorithm design problems.
- Algorithm design: Provides theoretical guidance for matrix multiplication algorithm designers.
- Complexity analysis: Provides methodological reference for barrier analysis of other algebraic problems.
- Optimization theory: Has application value in scenarios requiring understanding of fundamental algorithm limitations.
Main related works include:
- AFLG15 Ambainis, Filmus, Le Gall: Fast matrix multiplication limitations
- AW18a Alman, Vassilevska Williams: Further limitations of known approaches
- CVZ19 Christandl, Vrana, Zuiddam: Barriers from irreversibility
- CW90 Coppersmith, Winograd: Matrix multiplication via arithmetic progressions
- Str91 Strassen: Degeneration and complexity of bilinear maps