2025-11-25T21:58:18.737394

A Principled Approach to Bayesian Transfer Learning

Bretherton, Bon, Warne et al.

Updating $\textit{a priori}$ information given some observed data is the core tenet of Bayesian inference. Bayesian transfer learning extends this idea by incorporating information from a related dataset to improve the inference on the observed target dataset which may have been collected under slightly different settings. The use of related information can be useful when the target dataset is scarce, for example. There exist various Bayesian transfer learning methods that decide how to incorporate the related data in different ways. Unfortunately, there is no principled approach for comparing Bayesian transfer methods in real data settings. Additionally, some Bayesian transfer learning methods, such as the so-called power prior approaches, rely on conjugacy or costly specialised techniques. In this paper, we find an effective approach to compare Bayesian transfer learning methods is to apply leave-one-out cross validation on the target dataset. Further, we introduce a new framework, $\textit{transfer sequential Monte Carlo}$, that efficiently implements power prior methods in an automated fashion. We demonstrate the performance of our proposed methods in two comprehensive simulation studies.

academic

A Principled Approach to Bayesian Transfer Learning

基本信息

论文ID: 2502.19796
标题: A Principled Approach to Bayesian Transfer Learning
作者: Adam Bretherton, Joshua J. Bon, David J. Warne, Kerrie Mengersen, Christopher Drovandi
分类: stat.ME (Statistics - Methodology), stat.CO (Statistics - Computation)
发表时间: 2025年10月14日 (arXiv v3)
论文链接: https://arxiv.org/abs/2502.19796v3

摘要

本文研究贝叶斯迁移学习的原则性方法。贝叶斯推断的核心是基于观测数据更新先验信息，而贝叶斯迁移学习扩展了这一思想，通过整合相关数据集的信息来改善对目标数据集的推断。当目标数据集稀缺时，相关信息的使用特别有价值。现有的贝叶斯迁移学习方法在如何整合相关数据方面采用不同策略，但缺乏在真实数据环境中比较这些方法的原则性方法。此外，一些方法（如power prior方法）依赖于共轭性或昂贵的专门技术。本文发现留一法交叉验证是比较贝叶斯迁移学习方法的有效途径，并提出了迁移序列蒙特卡洛（TSMC）框架，能够自动化高效实现power prior方法。

研究背景与动机

问题定义

贝叶斯迁移学习旨在解决如何有效利用相关源数据来改善对目标数据的推断问题。在实际应用中，目标数据往往稀缺且昂贵，而相关的历史数据或类似研究的数据可能丰富但与目标数据存在一定差异。

问题重要性

数据稀缺性：在流行病学、临床试验等领域，新数据获取成本高昂且耗时
信息利用效率：完全丢弃相关源数据是低效的，但直接合并可能引入偏差
实用性需求：需要在不同程度的数据相似性下做出合理的迁移决策

现有方法局限性

缺乏比较标准：没有原则性方法在真实数据环境中比较不同迁移学习方法的性能
计算复杂性：power prior方法依赖共轭先验或专门的MCMC技术，计算成本高
参数选择困难：固定power prior需要网格搜索，归一化power prior存在双重难解性问题

研究动机

本文旨在提供一个统一的框架来：

建立比较贝叶斯迁移学习方法的原则性标准
开发计算高效的power prior实现方法
在不需要真实参数值的情况下评估方法性能

核心贡献

提出后验预测检验框架：使用留一法交叉验证（LOO-CV）作为在真实数据环境中比较贝叶斯迁移学习方法的原则性标准
开发TSMC计算框架：提出迁移序列蒙特卡洛方法，能够同时高效实现固定power prior（FPP）和归一化power prior（NPP）
解决双重难解性问题：通过巧妙的分解策略克服NPP中参数依赖归一化常数的计算挑战
提供系统性评估：在两个综合仿真研究中验证了所提方法的有效性

方法详解

任务定义

给定目标数据集 $y_T$ （大小为 $n$ ）和相关源数据集 $y_S$ （大小为 $m$ ，其中 $n < m$ ），目标是利用源数据改善对目标数据的贝叶斯推断，同时避免源数据与目标数据差异带来的负面影响。

Power Prior方法

基本形式

Power prior通过调节参数 $\alpha \in (0,1)$ 来控制源数据的影响：

$\pi(\theta|y_S, \alpha) = \frac{p(y_S|\theta)^\alpha \pi(\theta)}{C_S(\alpha)}$

其中 $C_S(\alpha)$ 是归一化常数。目标后验为：

$\pi(\theta|y_T, y_S, \alpha) = \frac{p(y_T|\theta)p(y_S|\theta)^\alpha \pi(\theta)}{C_{T,S}(\alpha)}$

两种变体

固定Power Prior (FPP)： $\alpha$ 为固定值，通过模型选择准则确定
归一化Power Prior (NPP)： $\alpha$ 为随机变量，赋予先验分布 $\alpha \sim \text{Beta}(\alpha_0, \beta_0)$

迁移序列蒙特卡洛（TSMC）框架

核心思想

利用分解关系 $C_T(\alpha) = \frac{C_{T,S}(\alpha)}{C_S(\alpha)}$ 来间接估计归一化常数，避免直接计算的困难。

双调度SMC算法

调度1：估计 $C_S(\alpha)$

目标分布： $\pi_{t,S}(\theta|y_S, \alpha_t) \propto p(y_S|\theta)^{\alpha_t}\pi(\theta)$
逆温度序列： $0 = \alpha_0 < \alpha_1 < \cdots < \alpha_T = 1$

调度2：估计 $C_{T,S}(\alpha)$

目标分布： $\pi_{t,TSMC}(\theta|y_S, y_T, \gamma_t, \alpha_t) \propto p(y_T|\theta)^{\gamma_t}p(y_S|\theta)^{\alpha_t}\pi(\theta)$
两阶段设计：先用 $\gamma$ 整合目标数据，再用 $\alpha$ 整合源数据