2025-11-20T06:13:15.069423

Operation with Concentration Inequalities

Louart

Following the concentration of the measure theory formalism, we consider the transformation $Î¦(Z)$ of a random variable $Z$ having a general concentration function $Î±$. If the transformation $Î¦$ is $Î»$-Lipschitz with $Î»>0$ deterministic, the concentration function of $Î¦(Z)$ is immediately deduced to be equal to $Î±(\cdot/Î»)$. If the variations of $Î¦$ are bounded by a random variable $Î$ having a concentration function (around $0$) $Î²: \mathbb R_+\to \mathbb R$, this paper sets that $Î¦(Z)$ has a concentration function analogous to the so-called parallel product of $Î±$ and $Î²$. With this result at hand (i) we express the concentration of random vectors with independent heavy-tailed entries, (ii) given a transformation $Î¦$ with bounded $k^{\text{th}}$ differential, we express the so-called "multi-level" concentration of $Î¦(Z)$ as a function of $Î±$, and the operator norms of the successive differentials up to the $k^{\text{th}}$ (iii) we obtain a heavy-tailed version of the Hanson-Wright inequality.

academic

Operation with Concentration Inequalities

基本信息

论文ID: 2402.08206
标题: Operation with Concentration Inequalities
作者: Cosme Louart (香港中文大学（深圳）数据科学学院)
分类: math.PR (概率论), math.FA (泛函分析)
发表时间: 2024年2月提交，2025年10月修订版本
论文链接: https://arxiv.org/abs/2402.08206v9

摘要

本文在测度集中理论的框架下，研究随机变量 $Z$ 具有一般集中函数 $\alpha$ 时，其变换 $\Phi(Z)$ 的集中性质。当变换 $\Phi$ 是确定性的 $\lambda$ -Lipschitz函数时， $\Phi(Z)$ 的集中函数为 $\alpha(\cdot/\lambda)$ 。当 $\Phi$ 的变化被具有集中函数 $\beta: \mathbb{R}_+ \to \mathbb{R}$ 的随机变量 $\Lambda$ 界定时，本文证明 $\Phi(Z)$ 具有类似于 $\alpha$ 和 $\beta$ 的"并联乘积"的集中函数。基于此结果，论文：(i) 表达了具有独立重尾分量的随机向量的集中性；(ii) 对于具有有界 $k$ 阶微分的变换 $\Phi$ ，表达了 $\Phi(Z)$ 的"多层次"集中性；(iii) 获得了Hanson-Wright不等式的重尾版本。

研究背景与动机

核心问题

测度集中理论的一个基本结果是：对于高斯随机向量 $Z \sim N(0, I_n)$ 和任何欧几里德范数的1-Lipschitz映射 $f: \mathbb{R}^n \to \mathbb{R}$ ，有： $\forall t \geq 0: P(|f(Z) - E[f(Z)]| > t) \leq 2e^{-t^2/2}$

当变换 $F$ 是 $\lambda$ -Lipschitz时， $F(Z)$ 的集中函数为 $\alpha(\cdot/\lambda)$ 。但当 $\lambda$ 不是常数而是随机变量 $\Lambda(Z)$ 时，如何刻画 $F(Z)$ 的集中性质？

研究重要性

理论完善性: 扩展经典集中不等式到更一般的情形
应用广泛性: 涵盖重尾分布、非Lipschitz泛函等实际场景
技术创新性: 引入并联运算处理随机Lipschitz常数

现有方法局限

经典结果仅适用于确定性Lipschitz常数
重尾分布的集中性质研究不够系统
缺乏统一框架处理多层次集中现象

核心贡献

建立了随机Lipschitz常数下的集中不等式理论框架，将经典结果推广到 $\Lambda$ 为随机变量的情形
引入了最大单调算子的并联运算，提供了处理集中函数运算的数学工具
发展了重尾随机向量的集中理论，系统研究了独立重尾分量向量的集中性质
建立了多层次集中不等式，刻画了具有有界高阶微分函数的集中性
获得了Hanson-Wright不等式的重尾推广，扩展了二次型的集中结果

方法详解

核心理论框架

主要定理

定理0.1: 设 $(E,d)$ , $(E',d')$ 为度量空间， $Z \in E$ 为随机变量， $\Lambda: E \to \mathbb{R}$ 为可测映射。若存在严格递减映射 $\alpha, \beta: \mathbb{R}_+ \to \mathbb{R}_+$ 使得对任何1-Lipschitz映射 $f: E \to \mathbb{R}$ 和 $Z$ 的独立副本 $Z'$ ：

$P(|f(Z) - f(Z')| > t) \leq \alpha(t), \quad P(\Lambda(Z) > t) \leq \beta(t)$

且变换 $\Phi: E \to E'$ 满足： $d'(\Phi(z), \Phi(z')) \leq \max(\Lambda(z), \Lambda(z')) \cdot d(z,z')$

则对任何1-Lipschitz映射 $g: E' \to \mathbb{R}$ ： $P(|g(\Phi(Z)) - g(\Phi(Z'))| > t) \leq 3(\alpha^{-1} \cdot \beta^{-1})^{-1}(t)$

并联运算理论

最大单调算子

论文引入最大单调算子类 $\mathcal{M}$ ，包括：

$\mathcal{M}^{\uparrow}$ : 最大非递减算子类
$\mathcal{M}^{\downarrow}$ : 最大非递增算子类

并联运算定义

对于算子 $f, g: \mathbb{R} \to 2^{\mathbb{R}}$ ：

并联和: $f \boxplus g = (f^{-1} + g^{-1})^{-1}$
并联积: $f \boxminus g = (f^{-1} \cdot g^{-1})^{-1}$

这些运算满足交换律、结合律和分配律。

重尾向量集中理论

指数集中基础

命题2.21: 考虑随机向量 $X = (X_1, \ldots, X_n)$ ，其中 $X_i = \phi_i(Z_i)$ ， $Z_i$ 为独立的双边拉普拉斯随机变量。定义： $h(t) = \sup_{|u-v| \leq t, i \in [n]} \frac{|\phi_i(u) - \phi_i(v)|}{|u-v|}$

对任何1-Lipschitz映射 $f: \mathbb{R}^n \to \mathbb{R}$ ： $P(|f(X) - f(X')| > t) \leq 3CE_1 \circ \min\left((Id \cdot h)^{-1}(2ct), \frac{ct}{2h(\log n)}\right)$

多层次集中理论

微分函数的集中性

定理0.2: 设 $Z \in \mathbb{R}^n$ 满足对任何1-Lipschitz映射 $f$ ： $P(|f(Z) - m_f| > t) \leq \alpha(t)$

对于 $d$ 次可微映射 $\Phi: \mathbb{R}^n \to \mathbb{R}^p$ 和1-Lipschitz映射 $g: \mathbb{R}^p \to \mathbb{R}$ ： $P(|g(\Phi(Z)) - m_g| > t) \leq 2^d \alpha\left(\frac{1}{e}\min_{k \in [d]}\left(\frac{t}{dm_k}\right)^{1/k}\right)$

其中 $m_k$ 为 $\|d^k\Phi|_Z\|$ 的中位数。

实验设置

理论验证

论文主要通过理论分析验证结果，包括：

算子性质验证: 证明并联运算的各种代数性质
集中函数计算: 具体计算各种分布的集中函数
界的紧性分析: 通过构造例子验证界的紧性

应用实例

重尾分布: 考虑密度为 $t \mapsto \frac{q}{2}(1+|t|)^{-1-q}$ 的分布
Hanson-Wright应用: 二次型 $X^TAX$ 的集中性
多项式函数: 具有有界高阶微分的函数类

实验结果

主要理论结果

重尾集中不等式

对于具有 $q$ 阶矩的重尾分布，获得集中率： $P(|f(X) - m_f| \geq t) \leq C\left(\frac{\log^2(1+ct)}{ct}\right)^q$

Hanson-Wright推广

定理2.50: 对于随机矩阵 $X \in M_{p,n}$ 和矩阵 $A \in M_p$ , $B \in M_n$ ： $P(|\text{Tr}(B(X^TAX - E[X^TAX]))| > t) \leq \frac{2}{\alpha(\sigma_\alpha)}\alpha \circ \min\left(\frac{\alpha(\sigma_\alpha)t}{10\|A\|_F\|B\|_F\sigma_\alpha}, \sqrt{\frac{t}{6\|A\|\|B\|}}\right)$

技术创新验证

并联运算的有效性

证明了并联运算能够自然地处理独立随机变量和与积的集中性：

和的集中性: $S_{\sum X_k} \leq n\alpha_1 \boxplus \cdots \boxplus \alpha_n$
积的集中性: $S_{\prod X_k} \leq n\alpha_1 \boxminus \cdots \boxminus \alpha_n$

多层次结构的自然出现

通过递归应用并联运算，自然得到多层次集中函数： $\boxplus_{a_k \in A^{(k)}, k \in [n]} \alpha \circ \left(\frac{Id}{\sigma_1^{(1)} \cdots \sigma_n^{(n)}}\right)^{\frac{1}{1+a_1+\cdots+a_n}}$