2025-11-12T17:04:10.344292

Bootstrap tests for almost goodness-of-fit

BaÃllo, CÃ¡rcamo

We introduce the \textit{almost goodness-of-fit} test, a procedure to assess whether a (parametric) model provides a good representation of the probability distribution generating the observed sample. Specifically, given a distribution function $F$ and a parametric family $\mathcal{G}=\{ G(\boldsymbolÎ¸) : \boldsymbolÎ¸ \in Î\}$, we consider the testing problem \[ H_0: \| F - G(\boldsymbolÎ¸_F) \|_p \geq Îµ\quad \text{vs} \quad H_1: \| F - G(\boldsymbolÎ¸_F) \|_p < Îµ, \] where $Îµ>0$ is a margin of error and $G(\boldsymbolÎ¸_F)$ denotes a representative of $F$ within the parametric class. The approximate model is determined via an M-estimator of the parameters. %The objective is the approximate validation of a distribution or an entire parametric family up to a pre-specified threshold value. The methodology also quantifies the percentage improvement of the proposed model relative to a non-informative (constant) benchmark. The test statistic is the $\mathrm{L}^p$-distance between the empirical distribution function and that of the estimated model. We present two consistent, easy-to-implement, and flexible bootstrap schemes to carry out the test. The performance of the proposal is illustrated through simulation studies and analysis and real-data applications.

academic

ブートストラップ法によるほぼ適合度検定

基本情報

論文ID: 2410.20918
タイトル: Bootstrap tests for almost goodness-of-fit
著者: Amparo Báıllo (マドリード自治大学)、Javier Cárcamo (バスク国立大学)
分類: stat.ME (統計方法論)、math.ST (数理統計)、stat.AP (応用統計)、stat.TH (統計理論)
発表日: 2025年10月15日 (arXiv プレプリント)
論文リンク: https://arxiv.org/abs/2410.20918

要旨

本論文では、パラメトリックモデルが観測標本の確率分布をよく表現しているかを評価するための「ほぼ適合度」(almost goodness-of-fit, AGoF)検定を導入する。具体的には、分布関数 $F$ とパラメータ族 $\mathcal{G}=\{G(\theta) : \theta \in \Theta\}$ が与えられたとき、以下の仮説検定問題を考える： $H_0: \|F - G(\theta_F)\|_p \geq \epsilon \quad \text{vs} \quad H_1: \|F - G(\theta_F)\|_p < \epsilon$ ここで $\epsilon > 0$ は許容誤差、 $G(\theta_F)$ はパラメータ族における $F$ の代表である。M-推定法によって近似モデルを決定し、検定を実行するための2つの一貫性のあるブートストラップ方案を提供する。

研究背景と動機

問題背景

従来の適合度検定には根本的な問題がある：「モデルはデータの合理的な近似である」という陳述を帰無仮説 $H_0$ に置くため、モデルの「不適合」に対する統計的証拠しか提供できず、実際の「適合度」に対する証拠は提供できない。

研究動機

従来のGoF検定の限界：古典的方法はモデルを棄却することのみが可能で、モデルの適用可能性を検証できない
実践的必要性：実務では、モデルが「十分に良い」かどうかが重要であり、完全に正確であるかどうかではない
近似モデリングの重要性：現実ではデータを完璧に記述するモデルはほぼ存在せず、一定程度の偏差を許容する必要がある

既存手法の不足

Kolmogorov-Smirnov型統計量のパラメータ推定下での極限分布は複雑で非ガウス的
ブートストラップ法はsup-ノルム推定時に通常一貫性を持たない
パラメータ族の近似検証を扱う統一的枠組みが欠如している

核心的貢献

AGoF検定枠組みの提案：「近似適合」を対立仮説に置くことで、モデルの適用可能性に対する統計的証拠を提供できる
$L^p$ 距離の使用：従来のsupremumノルムと比較して、 $L^p$ ノルムはより優れた理論的性質と計算上の利点を持つ
2つのブートストラップ方案の開発：一貫性を証明し、実用的な実装アルゴリズムを提供する
AGoF統計量の導入：非情報的基準に対するモデルの改善率を定量化する
完全な理論分析の提供：漸近分布、ブートストラップ一貫性などの理論的保証を含む

方法の詳細

タスク定義

未知分布 $F$ からの標本 $X_1, \ldots, X_n$ とパラメトリックモデル族 $\mathcal{G} = \{G(\theta) : \theta \in \Theta \subset \mathbb{R}^k\}$ が与えられたとき、以下を検定する： $H_0: \|F - G(\theta_F)\|_p \geq \epsilon \quad \text{vs} \quad H_1: \|F - G(\theta_F)\|_p < \epsilon$

ここで $\theta_F$ はM-推定により決定される： $E_F[\psi_{\theta_F}(X)] = 0$ 。

核心的方法アーキテクチャ

1. パラメータ推定

M-推定器により解く： $\Psi_n(\theta) = \frac{1}{n}\sum_{i=1}^n \psi_\theta(X_i) = 0$

2. 検定統計量

標準化統計量： $T_n(F,G(\theta_F),p) = \sqrt{n}(\|F_n - G(\hat{\theta}_n)\|_p - \|F - G(\theta_F)\|_p)$

3. 棄却域の構成

棄却域を提案する： $R_n = \{\|F_n - G(\hat{\theta}_n)\|_p < \epsilon - c_n(\alpha)\}$ ここで $c_n(\alpha) = -Q_T(\alpha)/\sqrt{n}$ 、 $Q_T(\alpha)$ は極限分布の $\alpha$ 分位数である。

技術的革新点

1. $L^p$ 距離選択の利点

Hadamard微分可能性： $1 < p < \infty$ に対して、 $L^p$ ノルムはHadamard微分可能であり、関数デルタ法の適用が容易
ガウス極限：一般的な仮定の下で、漸近分布はガウス的
ブートストラップ一貫性：適切な条件下で、標準ブートストラップ推定量は一貫性を持つ
柔軟性： $p$ 値を調整することで分布の裾への感度を制御できる

2. 理論的枠組み

完全な漸近理論を確立する：

$L^p$ 空間における経験過程の弱収束
推定パラメータを伴う過程の極限分布
ブートストラップ過程の一貫性

$p = 1$ のとき： $T(F,G(\theta_F),1) = \int_{C_{\theta_F}} |G_{\theta_F}| + \int_{\mathbb{R}\setminus C_{\theta_F}} G_{\theta_F}\text{sgn}(F-G(\theta_F))$
$1 < p < \infty$ のとき： $T(F,G(\theta_F),p) = \frac{1}{\|F-G(\theta_F)\|_p^{p-1}} \int G_{\theta_F} |F-G(\theta_F)|^{p-1}\text{sgn}(F-G(\theta_F))$

推論1：正規性条件

極限分布が正規である必要十分条件：

$p = 1$ ：接触集合 $C_{\theta_F} = \{F = G(\theta_F)\}$ のLebesgue測度がゼロ
$1 < p < \infty$ ： $F \neq G(\theta_F)$

標本サイズ： $n = 30, 50, 100, 500$
ブートストラップ回数： $B = 2000$
有意水準： $\alpha = 0.05$
モンテカルロ反復：1000回

テストシナリオ

Weibull vs 指数モデル： $p = 1$ 、真の分布はWeibull(2,1)
ガウス混合 vs 正規モデル： $p = 2$ 、真の分布は2成分ガウス混合
負二項 vs ポアソンモデル： $p = 1$ 、離散分布の場合
Kumaraswamy vs Betaモデル： $p = 1$ 、有界台の場合
Student t vs 正規モデル： $p = 4$ 、重尾分布の場合
対数正規 vs Gammaモデル： $p = 1$ 、歪分布の場合

2つのブートストラップ方法

ブートストラップ1：分位数ベースの方法、棄却条件： $2\|F_n - G(\hat{\theta}_n)\|_p - \hat{\epsilon}^*(\alpha) < \epsilon$
ブートストラップ2：正規近似ベースの方法、棄却条件： $\|F_n - G(\hat{\theta}_n)\|_p - \hat{\sigma}_{\text{boot}}z_\alpha < \epsilon$