2025-11-24T16:34:18.115626

Low-rank approximation of analytic kernels

Webb

Many algorithms in scientific computing and data science take advantage of low-rank approximation of matrices and kernels, and understanding why nearly-low-rank structure occurs is essential for their analysis and further development. This paper provides a framework for bounding the best low-rank approximation error of matrices arising from samples of a kernel that is analytically continuable in one of its variables to an open region of the complex plane. Elegantly, the low-rank approximations used in the proof are computable by rational interpolation using the roots and poles of Zolotarev rational functions, leading to a fast algorithm for their construction.

academic

Low-rank approximation of analytic kernels

Basic Information

Paper ID: 2509.14017
Title: Low-rank approximation of analytic kernels
Author: Marcus Webb (University of Manchester)
Classification: math.NA cs.NA
Publication Date: October 15, 2025 (arXiv version v3)
Paper Link: https://arxiv.org/abs/2509.14017

Abstract

Many algorithms in scientific computing and data science exploit low-rank approximations of matrices and kernel functions. Understanding why approximate low-rank structures arise is crucial for their analysis and further development. This paper provides a framework of bounds on the best low-rank approximation error for matrices arising from samples of kernel functions that can be analytically continued to an open region of the complex plane in one variable. Elegantly, the low-rank approximations used in the proof can be computed through rational interpolation using the roots and poles of Zolotarev rational functions, yielding a fast constructive algorithm.

Research Background and Motivation

Core Problem: Many matrices and kernel functions in scientific computing and data science exhibit approximate low-rank structure, yet lack a unified theoretical framework to understand and quantify this phenomenon. Existing methods are primarily based on polynomial approximation theory for smooth functions, but tend to be overly conservative for kernel functions with analytic properties.
Problem Significance: Low-rank approximation is a core technique in modern numerical algorithms, with widespread applications in system identification, particle simulation, image compression, recommendation systems, and other fields. Understanding the fundamental causes of low-rank structure is essential for algorithm analysis and performance optimization.
Limitations of Existing Methods:
- Methods based on Chebyshev polynomial interpolation (Little-Reade theory) are overly pessimistic
- Beckermann-Townsend's displacement structure theory ignores the analyticity of kernel functions
- Lack of a unified framework for handling continuous kernel functions and discrete matrices
Research Motivation: The author observes that many analytic kernel functions possess latent displacement structure through the Cauchy integral formula, providing a new perspective for establishing more precise low-rank approximation theory.

Core Contributions

Theoretical Framework: Proposes a new theoretical framework based on Cauchy-Zolotarev numbers for bounding low-rank approximation errors of analytic kernel functions
Unified Approach: Establishes a unified framework for handling continuous kernel functions and discrete matrices/tensors
Computable Approximation: Proves that optimal low-rank approximations can be constructed through rational interpolation of Zolotarev rational functions
Grothendieck Duality Theory: Introduces Grothendieck duality theory from functional analysis to numerical analysis
Practical Algorithm: Provides a fast algorithm based on rational interpolation that achieves or approaches optimal performance in multiple instances

Detailed Methodology

Problem Definition

Given a kernel function $K \in C(D \times E)$ , where $D$ and $E$ are compact metric spaces, the goal is to find a rank- $n$ kernel function $K_n$ that minimizes the operator norm $\|K - K_n\|_{L^2_\mu(E) \to L^2_\lambda(D)}$ .

Core Theoretical Framework

Main Theorem 1.1: Let $K \in C(D \times E)$ admit analytic continuation such that $K \in C(D \times F')$ and for each $x \in D$ , $K(x, \cdot)$ is analytic in $F'$ . Then for $n = 1,2,3,\ldots$ , there exists a rank- $n$ kernel function $K_n \in C(D \times E)$ satisfying:

$\|K - K_n\|_{L^2_\mu(E) \to L^2_\lambda(D)} \leq Z_n(L^2_\mu(E), L^p_\nu(F)) \|K'\|_{H^p_\nu(F) \to L^2_\lambda(D)}$

where $Z_n(L^2_\mu(E), L^p_\nu(F))$ is the Cauchy-Zolotarev number:

$Z_n(L^2_\mu(E), L^p_\nu(F)) = \inf_{\phi \in \mathcal{R}_n} \left\|\frac{\phi(z)^{-1}\phi(y)}{y-z}\right\|_{L^2_\mu(E) \to L^p_\nu(F)}$

Key Technical Components

Operator Decomposition: Establishes decomposition $K = K' \circ C$ $K = K^{'} \circ C$ through the Cauchy integral formula, where:
- $C$ : Cauchy transform operator, $C[g](z) = \int_E \frac{g(y)}{y-z} d\mu(y)$
- $K'$ : Grothendieck dual operator, $K'[h](x) = \frac{1}{2\pi i} \int_\Gamma K(x,\xi)h(\xi)d\xi$
Cauchy-Zolotarev Numbers: A new concept combining classical Zolotarev numbers and the Cauchy transform, providing guarantees of exponential decay.
Rational Interpolation Construction: Low-rank approximation is constructed via Hermite integral formula: $K_n(x,y) = \frac{1}{2\pi i} \int_\Gamma K(x,\xi) \left(1 - \frac{\phi(y)}{\phi(\xi)}\right) \frac{1}{y-\xi} d\xi$

Technical Innovations

Exploitation of Analyticity: First systematic utilization of analytic properties of kernel functions to establish low-rank approximation theory
Displacement Structure Revelation: Reveals latent displacement structure of analytic kernel functions through Cauchy integral formula
Functional Analysis Tools: Introduces Grothendieck duality theory to numerical analysis, providing new analytical tools
Constructive Proof: The proof not only provides error bounds but also furnishes computable approximation methods

Experimental Setup

Test Matrix Types

Gamma Function Matrix: $A_{i,j} = \frac{\Gamma(i+j+1/2)}{\Gamma(i+j+1)}$
Cauchy Matrix: $A_{i,j} = \frac{1}{x_i + y_j}$
Log-Cauchy Matrix: $A_{i,j} = \log(x_i + y_j)$
Twisted Hankel Transform Matrix: $A_{i,j} = H^{(1)}_0(\omega_i \omega_j / \omega_{N+1}) e^{-i\omega_i \omega_j / \omega_{N+1}}$
Beta-Cauchy Matrix: $A_{i,j} = B(i+j+\alpha, \beta)$

Evaluation Metrics

Relative error: $\|A - A_n\|_2 / \|A\|_2$
Comparison with optimal singular values: $\sigma_{n+1}(A) / \sigma_1(A)$

Comparison Methods

Little-Reade Bound: Based on Chebyshev polynomial interpolation
Beckermann-Townsend Bound: Based on displacement structure
Optimal Singular Values: Theoretical best performance
Proposed Method: Theorem 1.1 bound and Zolotarev rational interpolation

Implementation Details

Matrix size: typically $N = 50$ to $N = 100$
Zolotarev rational functions computed via Trefethen-Wilber algorithm
Barycentric form used for numerically stable rational interpolation evaluation

Experimental Results

Main Results

The proposed method significantly outperforms existing theoretical bounds across all test cases:

Gamma Function Matrix ( $N=100$ ): New bound is approximately 6 orders of magnitude tighter than Little-Reade method, and 3 orders of magnitude tighter than Beckermann-Townsend method
Cauchy Matrix: Completely recovers Beckermann-Townsend results, validating theoretical correctness
Log-Cauchy Matrix: Zolotarev rational interpolation performs approximately 50 times better than methods based on classical Zolotarev numbers
Twisted Hankel Transform Matrix: Semi-discrete Zolotarev interpolation achieves near-optimal performance

Key Findings

Exponential Decay: All test cases exhibit exponential singular value decay
Achievable Bounds: Low-rank approximations constructed via rational interpolation nearly achieve theoretical bounds
Discrete Optimization: Zolotarev rational functions optimized on discrete point sets typically outperform continuous versions
Practical Utility: The method exhibits good numerical stability in practical applications

Ablation Studies

Validates advantages of Cauchy-Zolotarev numbers over classical Zolotarev numbers
Demonstrates importance of Grothendieck dual operator norms
Compares effectiveness of different interpolation node selection strategies

Main Research Directions

Smooth Kernel Function Theory: Methods by Little-Reade and others based on polynomial approximation
Displacement Structure Theory: Methods by Beckermann-Townsend and others based on Sylvester equations
Rational Approximation Theory: Zolotarev numbers and conformal mapping methods
Functional Analysis: Grothendieck duality theory and holomorphic function spaces

Advantages of This Work

More Precise Bounds: Exploits analyticity to obtain tighter error bounds than existing methods
Unified Framework: Handles both continuous and discrete cases simultaneously
Constructive Method: Provides computable optimal approximations
Theoretical Depth: Establishes deep connections with functional analysis

Conclusions and Discussion

Main Conclusions

Low-rank structure of analytic kernel functions can be precisely quantified through Cauchy integral formula and Zolotarev rational functions
Cauchy-Zolotarev numbers provide tighter error bounds than existing methods
Optimal low-rank approximations can be efficiently computed via rational interpolation
Grothendieck duality theory provides new theoretical tools for numerical analysis

Limitations

Analyticity Requirement: Method applies only to analytically continuable kernel functions
Zolotarev Computation: Computing optimal Zolotarev rational functions for general sets remains difficult
Higher-Order Singularities: Handling higher-order singularities like $(y-x)^{-2}$ requires Sobolev spaces
Algorithm Reliability: 90% reliability of Trefethen-Wilber algorithm limits practical applicability

Future Directions

Zolotarev Computation: Develop more reliable methods for computing Zolotarev rational functions on discrete sets
Higher-Order Singularities: Extend theory to Cauchy-Sobolev-Zolotarev numbers
Potential Theory Applications: Apply theory to potential-theoretic methods in analytic function approximation
Adaptive Algorithms: Develop adaptive interpolation strategies when set $F$ is unknown

In-Depth Evaluation

Strengths

Theoretical Innovation: First complete theoretical framework for low-rank approximation of analytic kernel functions
Practical Value: Provides computable algorithms with excellent performance on practical problems
Mathematical Depth: Skillfully combines tools from complex analysis, functional analysis, and numerical analysis
Comprehensive Experiments: Validates theory through multiple representative examples
Clear Presentation: Well-structured paper with rigorous mathematical derivations

Weaknesses

Limited Scope: Restricted to analytic kernel functions; not applicable to general smooth kernels
Computational Complexity: Computation of Zolotarev rational functions remains difficult in some cases
Numerical Stability: Analysis of numerical stability for ill-conditioned problems is insufficient
Parameter Selection: Choice of sets $E$ and $F$ significantly affects results but lacks systematic guidance

Impact

Theoretical Contribution: Provides new perspectives and tools for low-rank approximation theory
Application Prospects: Broad potential applications in scientific computing and data science
Interdisciplinary Bridge: Promotes cross-disciplinary fusion between numerical analysis and functional analysis
Algorithm Development: Provides new theoretical foundations for fast algorithm design

Applicable Scenarios

Scientific Computing: PDE solving, integral equation discretization
Data Science: Kernel methods, recommendation systems, image processing
Signal Processing: Fast transforms, filtering algorithms
Machine Learning: Kernel machine learning, Gaussian processes

References

The paper cites 35 important references covering classical works in complex analysis, functional analysis, numerical analysis, and scientific computing, particularly relevant literature on Zolotarev rational approximation theory, displacement structure theory, and Grothendieck duality theory.

This paper makes important contributions at both theoretical and practical levels, providing powerful tools for understanding and exploiting low-rank structure of analytic kernel functions. Despite certain limitations, its innovation and practical value make it a significant advance in the field.