2025-11-25T02:22:17.580847

Optimal Bounds for Tyler's M-Estimator for Elliptical Distributions

Lau, Ramachandran

A fundamental problem in statistics is estimating the shape matrix of an Elliptical distribution. This generalizes the familiar problem of Gaussian covariance estimation, for which the sample covariance achieves optimal estimation error. For Elliptical distributions, Tyler proposed a natural M-estimator and showed strong statistical properties in the asymptotic regime, independent of the underlying distribution. Numerical experiments show that this estimator performs very well, and that Tyler's iterative procedure converges quickly to the estimator. Franks and Moitra recently provided the first distribution-free error bounds in the finite sample setting, as well as the first rigorous convergence analysis of Tyler's iterative procedure. However, their results exceed the sample complexity of the Gaussian setting by a $\log^{2} d$ factor. We close this gap by proving optimal sample threshold and error bounds for Tyler's M-estimator for all Elliptical distributions, fully matching the Gaussian result. Moreover, we recover the algorithmic convergence even at this lower sample threshold. Our approach builds on the operator scaling connection of Franks and Moitra by introducing a novel pseudorandom condition, which we call $\infty$-expansion. We show that Elliptical distributions satisfy $\infty$-expansion at the optimal sample threshold, and then prove a novel scaling result for inputs satisfying this condition.

academic

Tyler's M-推定量の楕円分布に対する最適界

基本情報

論文ID: 2510.13751
タイトル: Optimal Bounds for Tyler's M-Estimator for Elliptical Distributions
著者: Lap Chi Lau (ウォータールー大学)、Akshay Ramachandran (ブリティッシュコロンビア大学)
分類: math.ST cs.LG stat.TH
発表時期: 2025年5月 (arXiv プレプリント)
論文リンク: https://arxiv.org/abs/2510.13751

要旨

楕円分布の形状行列推定は統計学における基本的な問題であり、ガウス共分散推定問題を一般化したものである。Tylerは自然なM-推定量を提案し、漸近的な場合に強い統計的性質を証明した。Franksとmoiraは最近、有限標本の場合における初めての分布無関の誤差界を提供したが、その結果は標本複雑度において高斯の場合より $\log^2 d$ 因子多い。本論文は新しい疑似ランダム条件 $\infty$ -expansionを導入することにより、Tyler M-推定量の最適標本閾値と誤差界を証明し、ガウス結果と完全に一致させ、より低い標本閾値の下でアルゴリズム収束性を回復する。

研究背景と動機

問題背景

中核的問題：楕円分布の形状行列(shape matrix)を推定すること。これは高次元分布共分散推定の重要な一般化である
実用的意義：
- 楕円分布は多変量ガウス分布とt-分布などの重要な特例を含む
- 重尾分布に対して、共分散行列は存在しないかもしれないが、形状行列は依然として幾何学的性質を捉えることができる
- 金融、信号処理などの分野で広く応用されている

既存方法の限界

標本共分散の限界：重尾分布に対する性能が劣り、存在しない可能性さえある
Tyler推定量の理論的欠陥：
- Tyler(1987)は漸近保証のみを与えた
- Franksとmoira(2020)の有限標本界には $\log^2 d$ の追加因子がある
- 標本複雑度は $n \gtrsim d\log^2 d$ であり、ガウス場合の最適値 $n \gtrsim d$ を超える

研究動機

本論文は以下の問いに答えることを目指す：Tyler推定量は楕円分布上でガウス共分散推定と同じ最適保証を達成できるか、それとも形状推定は本質的により困難なのか？

核心的貢献

最適標本複雑度：標本数 $n \gtrsim \frac{d}{\varepsilon^2}$ の時、Tyler M-推定量が相対作用素ノルム誤差 $\varepsilon$ を達成することを証明
最適誤差界：ガウス場合の下界と完全に一致し、結果の緊密性を証明
アルゴリズム収束性：最適標本閾値 $n \gtrsim d$ の下でTyler反復過程の線形収束を回復
新しい理論的ツール： $\infty$ -expansion条件を導入し、frame scalingに対してより強力な分析ツールを提供
技術的革新：Franks-Moitra方法における2つの重要な成分を改善し、 $\log d$ 因子を除去

方法の詳細説明

タスク定義

入力：楕円分布 $E(\Sigma, u)$ からの $n$ 個のサンプル $x_1, \ldots, x_n \in \mathbb{R}^d$ 出力：形状行列 $\Sigma$ の推定値 $\hat{\Sigma}$ 目標：相対作用素ノルム誤差 $\|I_d - \Sigma^{1/2}\hat{\Sigma}^{-1}\Sigma^{1/2}\|_{op}$ を最小化

楕円分布とTyler推定量

楕円分布の定義： $X := \Sigma^{1/2}V \cdot u$ ここで $V \sim S^{d-1}$ は均一ランダム単位ベクトル、 $u \in \mathbb{R}$ は独立のスカラー確率変数である。

Tyler M-推定量：以下の方程式の唯一の解 $\hat{\Sigma}$ ： $\frac{d}{n}\sum_{j=1}^n \frac{x_jx_j^T}{x_j^T\hat{\Sigma}^{-1}x_j} = \hat{\Sigma}, \quad \text{Tr}[\hat{\Sigma}] = d$

核心的技術フレームワーク

1. Frame Scalingの接続

Tyler推定量はframe scaling問題と等価である：

Frame： $V = \{v_1, \ldots, v_n\} \in \mathbb{R}^{d \times n}$
目標：左右のスケーリング $L \in \mathbb{R}^{d \times d}$ $L \in R^{d \times d}$ と $R \in \text{diag}(n)$ $R \in diag (n)$ を見つけて $V' = LVR$ $V^{'} = L V R$ が以下を満たすようにする：
- 等距性： $V'V'^T = \frac{s(V')}{d}I_d$
- 等ノルム： $\|v'_j\|_2^2 = \frac{s(V')}{n}$

2. ∞-Expansion条件

定義：Frame $V$ が $(1-\lambda)$ - $\infty$ -expansionを満たすとは： $\forall y \perp \mathbf{1}_n, \|y\|_\infty \leq 1: \left\|\sum_{j=1}^n y_j v_j v_j^T\right\|_{op} \leq \frac{s(V)(1-\lambda)}{d}$

これはquantum expansionより強い条件であり、重要な改善は：

制約が $\|y\|_2 \leq 1$ から $\|y\|_\infty \leq 1$ に強化される
出力がFrobenius ノルムから作用素ノルムに変わる

3. 疑似ランダム条件

定義：Frame $V$ が $(\alpha_{\min}, \alpha_{\max}, \beta)$ -疑似ランダムであるとは： $\forall |B| = \beta n: \beta\frac{\alpha_{\min}}{d}I_d \preceq V_BV_B^T \preceq \beta\frac{\alpha_{\max}}{d}I_d$

主要な理論的結果

定理1.1（標本複雑度）： $n \gtrsim \frac{d}{\varepsilon^2}$ かつ $\varepsilon$ が小定数の時、Tyler M-推定量は以下を満たす： $\|I_d - \Sigma^{1/2}\hat{\Sigma}^{-1}\Sigma^{1/2}\|_{op} \leq \varepsilon$ 確率は少なくとも $1 - \exp(-\Omega(\varepsilon^2 n))$ である。

定理1.2（アルゴリズム収束）： $n \gtrsim d$ の時、Tyler反復過程の第 $T$ ステップの反復 $\Sigma^{(T)}$ は以下を満たす： $\|I_d - \hat{\Sigma}^{1/2}\Sigma^{(T),-1}\hat{\Sigma}^{1/2}\|_F \leq \delta$ は $T \lesssim |\log \det \Sigma| + d + \log(1/\delta)$ ステップ内に達成される。