Structured covariance estimation via tensor-train decomposition
Patarusau, Puchkin, Rakhuba et al.
We consider a problem of covariance estimation from a sample of i.i.d. high-dimensional random vectors. To avoid the curse of dimensionality we impose an additional assumption on the structure of the covariance matrix $Σ$. To be more precise we study the case when $Σ$ can be approximated by a sum of double Kronecker products of smaller matrices in a tensor train (TT) format. Our setup naturally extends widely known Kronecker sum and CANDECOMP/PARAFAC models but admits richer interaction across modes. We suggest an iterative polynomial time algorithm based on TT-SVD and higher-order orthogonal iteration (HOOI) adapted to Tucker-2 hybrid structure. We derive non-asymptotic dimension-free bounds on the accuracy of covariance estimation taking into account hidden Kronecker product and tensor train structures. The efficiency of our approach is illustrated with numerical experiments.
academic
Structured Covariance Estimation via Tensor-Train Decomposition
This paper investigates the problem of estimating covariance matrices from independent and identically distributed samples of high-dimensional random vectors. To circumvent the curse of dimensionality, the authors impose additional structural assumptions on the covariance matrix Σ, specifically studying the case where Σ can be approximated as a sum of double Kronecker products of smaller matrices in tensor-train (TT) format. This setting naturally extends the well-known Kronecker sum and CANDECOMP/PARAFAC models while allowing richer interactions between modes. The authors propose polynomial-time iterative algorithms based on TT-SVD and higher-order orthogonal iteration (HOOI) adapted to Tucker-2 hybrid structures, and derive non-asymptotic dimension-free convergence bounds that account for the hidden Kronecker product and tensor-train structures.
Curse of Dimensionality: In high-dimensional settings, the classical sample covariance estimator Σ^=n1∑i=1nXiXiT suffers from the curse of dimensionality, with performance degrading sharply as d increases.
Necessity of Structural Assumptions: To overcome this challenge, statisticians typically impose additional structural assumptions on Σ to exploit data structure and reduce the total number of unknown parameters.
Limitations of Existing Methods:
Kronecker product models Σ=Φ⊗Ψ are overly simplistic
Kronecker sum models Σ=∑k=1KΦk⊗Ψk lack sufficient flexibility
CANDECOMP/PARAFAC models face NP-hardness computationally
Novel Covariance Model: Proposes a covariance structure based on tensor-train decomposition that naturally extends Kronecker sum and CANDECOMP/PARAFAC models while allowing richer inter-modal interactions.
Efficient Algorithm: Designs the HardTTh algorithm (Hard Tensor Train Thresholding) based on TT-SVD and HOOI adapted to Tucker-2 hybrid structures, with computational complexity O((J+K)Td1d2d3).
Theoretical Guarantees: Establishes non-asymptotic, dimension-free convergence bounds—the first dimension-free theoretical results for TT-structured tensor estimation.
Practical Validation: Numerical experiments verify the method's effectiveness and demonstrate the necessity of iterative refinement.
Input: Independent and identically distributed samples X1,…,Xn∈RpqrOutput: Estimate Σ~ of the covariance matrix ΣConstraint: Σ has TT structure, expressible as Σ=∑j=1J∑k=1KUj⊗Vjk⊗Wk
Tucker-2 Hybrid Structure: Unlike standard Tucker decomposition requiring three orthogonal factors, the TT structure requires only two, reducing computational complexity.
Iterative Refinement Strategy: Progressively improves estimation accuracy through alternating optimization of modal subspaces.
Hard Thresholding: Uses hard thresholding rather than soft thresholding, avoiding the NP-hardness of tensor nuclear norm approximation.
Necessity of Iterations: HardTTh shows significant improvement over TT-HOSVD, particularly at n=2000 where relative error decreases from 0.154 to 0.082.
Convergence Behavior:
At n=500: sinΘ(ImU^0,ImU∗)≈1, sinΘ(ImU^T,ImU∗)≈1
At n=2000: sinΘ(ImU^0,ImU∗)≈1, sinΘ(ImU^T,ImU∗)=0.33±0.08
Computational Efficiency: HardTTh has moderate time complexity, faster than full Tucker decomposition but slower than TT-HOSVD.
Experiments confirm the necessity of theoretical conditions: when singular value conditions are not satisfied (e.g., n=500), the algorithm cannot effectively recover subspaces; when conditions are satisfied (e.g., n≥2000), iterations significantly improve performance.
Model Advantages: The TT-format covariance model provides richer structure than traditional Kronecker models while maintaining computational feasibility.
Algorithm Effectiveness: The HardTTh algorithm achieves polynomial-time complexity and significantly improves estimation quality through iterations.
Theoretical Guarantees: Establishes the first dimension-free convergence bounds for TT-structured estimation, with variance term:
v~=96ω∥Σ∥nJr12(Σ)+JKr22(Σ)+Kr32(Σ)+log(48/δ)
Theoretical Innovation: First to provide dimension-free theoretical bounds for TT-structured covariance estimation, filling an important theoretical gap.
Practical Methodology: HardTTh algorithm has reasonable computational complexity and avoids NP-hardness.
Comprehensive Experiments: Validates effectiveness through multiple comparison methods and varying sample sizes.
Thorough Analysis: Provides detailed theoretical analysis and algorithm convergence studies.