2025-11-11T11:34:09.241880

LUME-DBN: Full Bayesian Learning of DBNs from Incomplete data in Intensive Care

Pirola, Stella, Grzegorczyk

Dynamic Bayesian networks (DBNs) are increasingly used in healthcare due to their ability to model complex temporal relationships in patient data while maintaining interpretability, an essential feature for clinical decision-making. However, existing approaches to handling missing data in longitudinal clinical datasets are largely derived from static Bayesian networks literature, failing to properly account for the temporal nature of the data. This gap limits the ability to quantify uncertainty over time, which is particularly critical in settings such as intensive care, where understanding the temporal dynamics is fundamental for model trustworthiness and applicability across diverse patient groups. Despite the potential of DBNs, a full Bayesian framework that integrates missing data handling remains underdeveloped. In this work, we propose a novel Gibbs sampling-based method for learning DBNs from incomplete data. Our method treats each missing value as an unknown parameter following a Gaussian distribution. At each iteration, the unobserved values are sampled from their full conditional distributions, allowing for principled imputation and uncertainty estimation. We evaluate our method on both simulated datasets and real-world intensive care data from critically ill patients. Compared to standard model-agnostic techniques such as MICE, our Bayesian approach demonstrates superior reconstruction accuracy and convergence properties. These results highlight the clinical relevance of incorporating full Bayesian inference in temporal models, providing more reliable imputations and offering deeper insight into model behavior. Our approach supports safer and more informed clinical decision-making, particularly in settings where missing data are frequent and potentially impactful.

academic

LUME-DBN: 집중치료에서 불완전 데이터로부터의 DBN 완전 베이지안 학습

기본 정보

논문 ID: 2511.04333
제목: LUME-DBN: Full Bayesian Learning of DBNs from Incomplete data in Intensive Care
저자: Federico Pirola (University of Milano-Bicocca), Fabio Stella (University of Milano-Bicocca), Marco Grzegorczyk (University of Groningen)
분류: cs.LG (기계학습), cs.AI (인공지능)
발표 시간: 2025년 11월 6일 (arXiv 사전인쇄본)
논문 링크: https://arxiv.org/abs/2511.04333

초록

동적 베이지안 네트워크(DBN)는 의료 분야에서 환자 데이터의 복잡한 시간적 관계를 모델링하면서 해석 가능성을 유지할 수 있어 임상 의사결정에 중요한 특징으로 인해 점점 더 널리 사용되고 있습니다. 그러나 종단 임상 데이터 세트의 결측값을 처리하는 기존 방법은 주로 정적 베이지안 네트워크 문헌에서 비롯되었으며, 데이터의 시간적 특성을 적절히 고려하지 못합니다. 이러한 격차는 시간적 불확실성의 정량화 능력을 제한하며, 중환자실과 같은 시나리오에서 특히 중요합니다. 여기서 시간 역학을 이해하는 것이 모델 신뢰도와 다양한 환자 집단 간의 적용 가능성에 매우 중요합니다. 본 논문은 불완전 데이터로부터 DBN을 학습하기 위한 깁스 샘플링 기반의 새로운 방법을 제안하며, 각 결측값을 가우스 분포를 따르는 미지의 매개변수로 취급하고, 전체 조건부 분포 샘플링을 통해 원칙적인 대체 및 불확실성 추정을 구현합니다.

연구 배경 및 동기

핵심 문제

본 연구가 해결하고자 하는 핵심 문제는 대량의 결측 데이터가 존재하는 상황에서 동적 베이지안 네트워크를 효과적으로 학습하는 방법이며, 특히 중환자실 환경에서의 응용입니다.

문제의 중요성

임상적 긴급성: ICU에서 환자 병상 진행 상황을 적시에 정확하게 평가하는 것이 중재 조치를 지도하는 데 매우 중요합니다
데이터 품질 도전: ICU 데이터는 종종 결측값, 불규칙한 샘플링 및 측정 편향으로 인해 어려움을 겪습니다
불확실성 정량화: 기존 방법은 결측으로 인한 불확실성을 충분히 고려하지 못하여 매개변수 추정 편향을 초래할 수 있습니다

기존 방법의 한계

정적 방법의 시간적 맹점: 기존 결측 데이터 처리 방법은 주로 정적 베이지안 네트워크에서 비롯되었으며 시간적 특성을 고려하지 않습니다
빈도주의 방법의 부족: 전통적인 대체 또는 빈도주의 방법은 결측으로 인한 불확실성을 충분히 고려하지 못할 수 있습니다
국소 최적 문제: 구조 기댓값 최대화(SEM) 알고리즘 등의 방법은 국소 최적해로 수렴하기 쉽습니다

연구 동기

네트워크 구조, 매개변수 및 결측값의 불확실성을 동시에 처리할 수 있는 완전 베이지안 프레임워크를 개발하여 임상 의사결정을 위한 더욱 신뢰할 수 있는 지원을 제공합니다.

핵심 기여

이론적 기여: DBN의 결측값에 대한 전체 조건부 분포(FCD)의 폐쇄형 해를 도출하고 그 처리 가능성을 증명했습니다
방법론적 혁신: LUME-DBN 알고리즘을 제안하며, 결측 데이터 대체를 위한 깁스 샘플링과 MCMC 구조 학습을 결합합니다
실험적 검증: 모의 데이터 및 실제 ICU 데이터에서 방법의 유효성을 검증했으며, MICE 등의 방법과 비교하여 우수한 재구성 정확도를 보여줍니다
임상 응용: PhysioNet 2012 데이터 세트에서 다양한 ICU 유형에서 발견된 의미 있는 시간적 관계를 시연합니다

방법론 상세 설명

작업 정의

입력: 결측값을 포함하는 다변량 시계열 데이터 $D \in \mathbb{R}^{N \times k \times (T+1)}$ , 여기서 $N$ 은 샘플 수, $k$ 는 변수 수, $T+1$ 은 시간점 수입니다

출력: DBN 구조, 매개변수 및 결측값의 사후 분포 샘플

제약: 1차 마르코프 성질 및 순간 효과 없음을 가정합니다

모델 아키텍처

DBN 기본 프레임워크

DBN은 $k$ 개의 독립적인 베이지안 선형 회귀(BLR) 모델로 모델링됩니다:

$x_i^t = \beta_0^{(i)} + \sum_{j:(X_j^{t-1} \in \pi(i))} \beta_j^{(i)} x_j^{t-1} + \epsilon_i^t$

여기서 $\pi(i)$ 는 변수 $X_i$ 의 부모 노드 집합을 나타내고, $\epsilon_i^t \sim N(0, \sigma^2_{(i)})$ 입니다.

사전 분포 설정

회귀 계수: $\beta^{(i)} \sim N(\mu^{(i)}, \sigma^2_{(i)}\delta^2_{(i)}I)$
잡음 매개변수: $\sigma^2_{(i)} \sim \text{Inv-Gamma}(a, b)$
불확실성 매개변수: $\delta^2_{(i)} \sim \text{Inv-Gamma}(\alpha_\delta, \beta_\delta)$
부모 노드 집합 크기: $|\pi(i)| \sim \text{Poisson}(\lambda)$

결측값의 전체 조건부 분포

시간 $t$ 에서 변수 $X_i$ 의 결측값 $x_i^t[MIS]$ 에 대해, 그 FCD는:

$P(x_i^t[MIS] | \cdot) = N(\mu_*, \sigma^2_*)$

여기서: $\sigma^2_* = \left(\frac{1}{\sigma^2_{(i)}} + \sum_{j:(X_i^t \in \pi(j))} \frac{(\beta_i^{(j)})^2}{\sigma^2_{(j)}}\right)^{-1}$

$\mu_* = \sigma^2_* \cdot \left(\frac{\mu_i^t}{\sigma^2_{(i)}} + \sum_{j:(X_i^t \in \pi(j))} \frac{\beta_i^{(j)}(x_j^{t+1} - \mu_{{\{-i\}}}^{(j)(t+1)})}{\sigma^2_{(j)}}\right)$