2025-11-11T11:34:09.241880

LUME-DBN: Full Bayesian Learning of DBNs from Incomplete data in Intensive Care

Pirola, Stella, Grzegorczyk

Dynamic Bayesian networks (DBNs) are increasingly used in healthcare due to their ability to model complex temporal relationships in patient data while maintaining interpretability, an essential feature for clinical decision-making. However, existing approaches to handling missing data in longitudinal clinical datasets are largely derived from static Bayesian networks literature, failing to properly account for the temporal nature of the data. This gap limits the ability to quantify uncertainty over time, which is particularly critical in settings such as intensive care, where understanding the temporal dynamics is fundamental for model trustworthiness and applicability across diverse patient groups. Despite the potential of DBNs, a full Bayesian framework that integrates missing data handling remains underdeveloped. In this work, we propose a novel Gibbs sampling-based method for learning DBNs from incomplete data. Our method treats each missing value as an unknown parameter following a Gaussian distribution. At each iteration, the unobserved values are sampled from their full conditional distributions, allowing for principled imputation and uncertainty estimation. We evaluate our method on both simulated datasets and real-world intensive care data from critically ill patients. Compared to standard model-agnostic techniques such as MICE, our Bayesian approach demonstrates superior reconstruction accuracy and convergence properties. These results highlight the clinical relevance of incorporating full Bayesian inference in temporal models, providing more reliable imputations and offering deeper insight into model behavior. Our approach supports safer and more informed clinical decision-making, particularly in settings where missing data are frequent and potentially impactful.

academic

LUME-DBN: Full Bayesian Learning of DBNs from Incomplete data in Intensive Care

基本信息

论文ID: 2511.04333
标题: LUME-DBN: Full Bayesian Learning of DBNs from Incomplete data in Intensive Care
作者: Federico Pirola (University of Milano-Bicocca), Fabio Stella (University of Milano-Bicocca), Marco Grzegorczyk (University of Groningen)
分类: cs.LG (Machine Learning), cs.AI (Artificial Intelligence)
发表时间: 2025年11月6日 (arXiv预印本)
论文链接: https://arxiv.org/abs/2511.04333

摘要

动态贝叶斯网络(DBNs)在医疗保健领域应用日益广泛，因其能够建模患者数据中复杂的时间关系，同时保持可解释性——这是临床决策的重要特征。然而，现有处理纵向临床数据集缺失值的方法主要来源于静态贝叶斯网络文献，未能恰当考虑数据的时间性质。这一差距限制了对时间不确定性的量化能力，在重症监护等场景中尤为关键，理解时间动态对模型可信度和跨不同患者群体的适用性至关重要。本文提出了一种基于Gibbs采样的新方法来从不完整数据中学习DBNs，将每个缺失值视为遵循高斯分布的未知参数，通过全条件分布采样实现有原则的插补和不确定性估计。

研究背景与动机

核心问题

本研究要解决的核心问题是如何在存在大量缺失数据的情况下，有效学习动态贝叶斯网络，特别是在重症监护环境中的应用。

问题重要性

临床紧迫性: 在ICU中，及时准确评估患者病情演变对指导干预措施至关重要
数据质量挑战: ICU数据经常受到缺失值、不规则采样和测量偏差的困扰
不确定性量化: 传统方法无法充分考虑缺失性引入的不确定性，可能导致参数估计偏差

现有方法局限性

静态方法的时间盲区: 现有缺失数据处理方法主要源于静态贝叶斯网络，未考虑时间性质
频率派方法的不足: 传统插补或频率派方法可能无法充分考虑缺失性引入的不确定性
局部最优问题: 结构期望最大化(SEM)算法等方法容易收敛到局部最优解

研究动机

开发一个完全贝叶斯框架，能够同时处理网络结构、参数和缺失值的不确定性，为临床决策提供更可靠的支持。

核心贡献

理论贡献: 推导了DBN中缺失值的全条件分布(FCDs)的闭式解，证明了其可处理性
方法创新: 提出LUME-DBN算法，结合Gibbs采样进行缺失数据插补与MCMC结构学习
实验验证: 在模拟数据和真实ICU数据上验证了方法的有效性，相比MICE等方法显示出优越的重构准确性
临床应用: 在PhysioNet 2012数据集上展示了方法在不同ICU类型中发现的有意义的时间关系

方法详解

任务定义

输入: 包含缺失值的多变量时间序列数据 $D \in \mathbb{R}^{N \times k \times (T+1)}$ ，其中 $N$ 为样本数， $k$ 为变量数， $T+1$ 为时间点数

输出: DBN结构、参数和缺失值的后验分布样本

约束: 假设一阶马尔可夫性质和无瞬时效应

模型架构

DBN基础框架

DBN被建模为 $k$ 个独立的贝叶斯线性回归(BLR)模型：

$x_i^t = \beta_0^{(i)} + \sum_{j:(X_j^{t-1} \in \pi(i))} \beta_j^{(i)} x_j^{t-1} + \epsilon_i^t$

其中 $\pi(i)$ 表示变量 $X_i$ 的父节点集合， $\epsilon_i^t \sim N(0, \sigma^2_{(i)})$ 。

先验分布设定

回归系数： $\beta^{(i)} \sim N(\mu^{(i)}, \sigma^2_{(i)}\delta^2_{(i)}I)$
噪声参数： $\sigma^2_{(i)} \sim \text{Inv-Gamma}(a, b)$
不确定性参数： $\delta^2_{(i)} \sim \text{Inv-Gamma}(\alpha_\delta, \beta_\delta)$
父节点集合大小： $|\pi(i)| \sim \text{Poisson}(\lambda)$

缺失值的全条件分布

对于时刻 $t$ 变量 $X_i$ 的缺失值 $x_i^t[MIS]$ ，其FCD为：

$P(x_i^t[MIS] | \cdot) = N(\mu_*, \sigma^2_*)$

其中： $\sigma^2_* = \left(\frac{1}{\sigma^2_{(i)}} + \sum_{j:(X_i^t \in \pi(j))} \frac{(\beta_i^{(j)})^2}{\sigma^2_{(j)}}\right)^{-1}$

$\mu_* = \sigma^2_* \cdot \left(\frac{\mu_i^t}{\sigma^2_{(i)}} + \sum_{j:(X_i^t \in \pi(j))} \frac{\beta_i^{(j)}(x_j^{t+1} - \mu_{{\{-i\}}}^{(j)(t+1)})}{\sigma^2_{(j)}}\right)$