2025-11-16T12:28:12.323029

Almost sure convergence rates of adaptive increasingly rare Markov chain Monte Carlo

Hofstadler, Latuszynski, Roberts et al.

We consider adaptive increasingly rare Markov chain Monte Carlo (MCMC) algorithms, which are adaptive MCMC methods, where the adaptation concerning the "past'' happens less and less frequently over time. Under a contraction assumption with respect to a Wasserstein-like function we deduce upper bounds of the convergence rate of Monte Carlo sums taking a renormalisation factor into account that is "almost'' the one that appears in a law of the iterated logarithm. We demonstrate the applicability of our results by considering different settings, among which are those of simultaneous geometric and uniform ergodicity. All proofs are carried out on an augmented state space, including the classical non-augmented setting as a special case. In contrast to other adaptive MCMC limit theory, some technical assumptions, like diminishing adaptation, are not needed.

academic

Almost sure convergence rates of adaptive increasingly rare Markov chain Monte Carlo

基本信息

论文ID: 2402.12122
标题: Almost sure convergence rates of adaptive increasingly rare Markov chain Monte Carlo
作者: Julian Hofstadler (University of Bath), Krzysztof Latuszyński (University of Warwick), Gareth O. Roberts (University of Warwick), Daniel Rudolf (University of Passau)
分类: math.NA cs.NA math.PR math.ST stat.TH
发表时间: October 14, 2025 (arXiv版本)
论文链接: https://arxiv.org/abs/2402.12122

摘要

本文研究自适应渐稀Markov链Monte Carlo (AIR MCMC)算法，这是一类自适应MCMC方法，其中对"过去"的自适应随时间推移变得越来越稀少。在关于Wasserstein-like函数的收缩假设下，作者推导出Monte Carlo求和的收敛速度上界，该上界考虑了"几乎"出现在迭代对数律中的重正化因子。论文通过考虑同时几何遍历性和一致遍历性等不同设置来证明结果的适用性。所有证明都在增广状态空间上进行，包括经典非增广设置作为特例。与其他自适应MCMC极限理论相比，不需要一些技术假设，如递减自适应。

研究背景与动机

问题定义

在计算统计学中，一个普遍存在的挑战是近似期望值： $\nu(f) = \int_X f(x)\nu(dx)$ 其中 $\nu$ 是目标分布， $f: X \to \mathbb{R}$ 是感兴趣的可积函数。

研究动机

直接采样困难：当从 $\nu$ 直接采样不可能或计算上不可行时（例如密度包含未知正规化常数），需要替代方法。
自适应MCMC的挑战：传统自适应MCMC方法通过考虑整个历史来更新单步转移机制，导致非Markov过程，使得数学分析复杂化。
技术假设的简化需求：现有自适应MCMC理论通常需要技术性假设（如递减自适应），限制了方法的适用性。

现有方法的局限性

自适应MCMC的非Markov性质导致复杂的证明技术
需要严格的技术条件才能保证收敛性
缺乏关于重正化Monte Carlo求和收敛性的结果

核心贡献

提出AIR MCMC理论框架：在Wasserstein收缩假设下，为AIR算法建立了几乎必然收敛速度理论。
改进的收敛速度：获得了形如 $r(n) = \sqrt{n}(\log n)^{1/2+\varepsilon}$ 或 $r(n) = n^{1/2+\varepsilon}$ 的收敛速度，接近迭代对数律的最优速度。
技术假设的简化：不需要递减自适应等传统技术假设，扩大了方法的适用范围。
增广状态空间分析：在增广状态空间上进行分析，包含经典非增广设置作为特例。
广泛的适用性：结果适用于同时几何遍历性和一致遍历性等多种设置。

方法详解

AIR MCMC算法定义

给定参数 $\beta > 0$ ，设置 $k_j = \lceil j^\beta \rceil$ ，仅在特定时间点进行自适应： $T_m = \sum_{j=1}^m k_j$

关键观察：对于任何 $\beta > 0$ ，存在常数 $c_\beta, C_\beta$ 使得： $c_\beta m^{1+\beta} \leq T_m \leq C_\beta m^{1+\beta}$

这意味着自适应频率递减。

核心技术框架

1. Wasserstein-like函数

对于距离类函数 $d: Y \times Y \to \mathbb{R}_+$ ，定义： $W(\mu_1, \mu_2) := \inf_{\xi \in C(\mu_1,\mu_2)} \int_{Y^2} d(x,y)\xi(dx,dy)$

2. 主要假设（Assumption 3.1）

对每个 $\gamma \in I$ ，假设：

$\pi_\gamma$ 是 $P_\gamma$ 的不变分布
$\tau(P_\gamma) \leq M$ 且 $\tau(P_\gamma^{k_0}) \leq \tau$ 其中 $M \in [1,\infty)$ ， $\tau \in [0,1)$ ， $k_0 \in \mathbb{N}$ 独立于 $\gamma$ 。