Dynamic Bayesian networks (DBNs) are increasingly used in healthcare due to their ability to model complex temporal relationships in patient data while maintaining interpretability, an essential feature for clinical decision-making. However, existing approaches to handling missing data in longitudinal clinical datasets are largely derived from static Bayesian networks literature, failing to properly account for the temporal nature of the data. This gap limits the ability to quantify uncertainty over time, which is particularly critical in settings such as intensive care, where understanding the temporal dynamics is fundamental for model trustworthiness and applicability across diverse patient groups. Despite the potential of DBNs, a full Bayesian framework that integrates missing data handling remains underdeveloped. In this work, we propose a novel Gibbs sampling-based method for learning DBNs from incomplete data. Our method treats each missing value as an unknown parameter following a Gaussian distribution. At each iteration, the unobserved values are sampled from their full conditional distributions, allowing for principled imputation and uncertainty estimation. We evaluate our method on both simulated datasets and real-world intensive care data from critically ill patients. Compared to standard model-agnostic techniques such as MICE, our Bayesian approach demonstrates superior reconstruction accuracy and convergence properties. These results highlight the clinical relevance of incorporating full Bayesian inference in temporal models, providing more reliable imputations and offering deeper insight into model behavior. Our approach supports safer and more informed clinical decision-making, particularly in settings where missing data are frequent and potentially impactful.
- Paper ID: 2511.04333
- Title: LUME-DBN: Full Bayesian Learning of DBNs from Incomplete data in Intensive Care
- Authors: Federico Pirola (University of Milano-Bicocca), Fabio Stella (University of Milano-Bicocca), Marco Grzegorczyk (University of Groningen)
- Classification: cs.LG (Machine Learning), cs.AI (Artificial Intelligence)
- Publication Date: November 6, 2025 (arXiv preprint)
- Paper Link: https://arxiv.org/abs/2511.04333
Dynamic Bayesian Networks (DBNs) are increasingly applied in healthcare due to their ability to model complex temporal relationships in patient data while maintaining interpretability—a critical feature for clinical decision-making. However, existing methods for handling missing values in longitudinal clinical datasets primarily derive from static Bayesian network literature and fail to appropriately account for the temporal nature of the data. This gap limits the quantification of temporal uncertainty, which is particularly crucial in intensive care settings where understanding temporal dynamics is essential for model credibility and applicability across different patient populations. This paper proposes a novel Gibbs sampling-based approach for learning DBNs from incomplete data, treating each missing value as an unknown parameter following a Gaussian distribution, enabling principled imputation and uncertainty estimation through full conditional distribution sampling.
The core problem addressed by this research is how to effectively learn dynamic Bayesian networks in the presence of substantial missing data, particularly in intensive care unit (ICU) applications.
- Clinical Urgency: In ICUs, timely and accurate assessment of patient condition evolution is crucial for guiding intervention measures
- Data Quality Challenges: ICU data frequently suffers from missing values, irregular sampling, and measurement bias
- Uncertainty Quantification: Traditional methods fail to adequately account for uncertainty introduced by missingness, potentially leading to biased parameter estimates
- Temporal Blindness of Static Methods: Existing missing data handling methods primarily originate from static Bayesian networks and do not account for temporal properties
- Insufficiency of Frequentist Approaches: Traditional imputation or frequentist methods may inadequately consider uncertainty introduced by missingness
- Local Optimality Problem: Structural expectation-maximization (SEM) algorithms and similar methods are prone to converging to local optima
To develop a fully Bayesian framework capable of simultaneously handling uncertainty in network structure, parameters, and missing values, providing more reliable support for clinical decision-making.
- Theoretical Contribution: Derives closed-form solutions for full conditional distributions (FCDs) of missing values in DBNs, proving their tractability
- Methodological Innovation: Proposes the LUME-DBN algorithm, combining Gibbs sampling for missing data imputation with MCMC structure learning
- Experimental Validation: Validates the method's effectiveness on simulated and real ICU data, demonstrating superior reconstruction accuracy compared to methods such as MICE
- Clinical Application: Demonstrates the discovery of meaningful temporal relationships in different ICU types using the PhysioNet 2012 dataset
Input: Multivariate time series data with missing values D∈RN×k×(T+1), where N is the number of samples, k is the number of variables, and T+1 is the number of time points
Output: Posterior distribution samples of DBN structure, parameters, and missing values
Constraints: Assumes first-order Markovian property and absence of instantaneous effects
The DBN is modeled as k independent Bayesian linear regression (BLR) models:
xit=β0(i)+∑j:(Xjt−1∈π(i))βj(i)xjt−1+ϵit
where π(i) denotes the set of parent nodes of variable Xi, and ϵit∼N(0,σ(i)2).
- Regression coefficients: β(i)∼N(μ(i),σ(i)2δ(i)2I)
- Noise parameters: σ(i)2∼Inv-Gamma(a,b)
- Uncertainty parameters: δ(i)2∼Inv-Gamma(αδ,βδ)
- Parent set size: ∣π(i)∣∼Poisson(λ)
For the missing value xit[MIS] of variable Xi at time t, its FCD is:
P(xit[MIS]∣⋅)=N(μ∗,σ∗2)
where:
σ∗2=(σ(i)21+∑j:(Xit∈π(j))σ(j)2(βi(j))2)−1
μ∗=σ∗2⋅(σ(i)2μit+∑j:(Xit∈π(j))σ(j)2βi(j)(xjt+1−μ{−i}(j)(t+1)))
- Unified Imputation Strategy: Designs Gibbs steps for jointly updating missing values across all regression models
- Closed-Form Solution Derivation: Proves the tractability of missing value FCDs, enabling efficient MCMC inference
- Temporal Invariance: The FCD structure exhibits temporal invariance relative to DBN parameters, improving computational efficiency
- Escaping Local Optima: MCMC sampling enables escape from local minima, achieving more accurate network reconstruction
- Structure: 10 independent 10-node DBN structures, with each node having at most 5 parents
- Temporal Length: T∈{50,100,200}
- Missing Rates: {10%,20%,30%,40%}
- Parameter Settings: Regression coefficients sampled from Uniform[0.2,0.8], noise variance σ2=1
- Data Source: PhysioNet 2012 Challenge dataset
- Patient Count: 20,000+ adult ICU patients
- Time Window: First 48 hours of ICU admission
- Variable Count: 11 clinical variables (vital signs, blood indicators, physiological characteristics)
- ICU Grouping: MICU (34 cases), SICU (104 cases), CCU (114 cases), CSRU (62 cases)
- Structure Reconstruction: Area under the precision-recall curve (AUC-PR)
- Convergence Diagnosis: Potential scale reduction factor (PSRF < 1.1)
- Statistical Significance: Paired t-test
- MICE: Multivariate imputation by chained equations
- Temporal MICE: Temporal MICE variant using lagged predictor variables
- Complete Data: Serves as performance upper bound reference
- Sampling Iterations: 20,000 iterations, with first 5,000 as burn-in
- Missing Value Update Frequency: Updated every 10 iterations (EM=10)
- Chain Thinning: Retain 1 sample per 5 to reduce autocorrelation
- Prior Parameters: λ=1, σ(i)2=δ(i)2=1
LUME-DBN significantly outperforms baseline methods across all experimental settings:
- MICE Performance: Completely fails when missing rates exceed 20%, reflecting its ineffectiveness on temporal data
- Temporal MICE: Outperforms MICE but remains significantly inferior to LUME-DBN
- LUME-DBN Advantages: Particularly outstanding at high missing rates, with minimal performance loss compared to complete data in large sample scenarios
- Structure Convergence: Converges within 1.5k iterations across all missing rates
- Missing Value Convergence: Requires 5k iterations at 40% missing rate
- Convergence Stability: Convergence time increases with missing rate, but eventually converges in all cases
- Self-Regulatory Loops: Strong internal connections within pressure parameters (MAP, Sys, Dias) and respiratory variables (FiO2, PaCO2, PaO2, pH)
- Neurological Interactions: Decreased consciousness level leads to increased heart rate (CCU: GCS → HR)
- Hemodynamic Effects: Blood pressure strongly influences consciousness level (Medical patients: Dias, MAP → GCS)
- Temperature Regulation Dynamics: Temperature changes during surgical recovery affect urine output (Temp → Urine)
- Cardiopulmonary Feedback: Low oxygen levels trigger compensatory heart rate increase (FiO2 → HR)
- Local Standardization: Reveals more ICU-specific relationships
- Global Standardization: Network displays more commonalities, but some relationships lack clinical evidence support
- SEM Algorithm: Hard EM variants effective with limited data but prone to local optima
- MCMC Methods: Recent sampling methods escape local minima, achieving more accurate reconstruction
- Existing Methods: Primarily use model-agnostic methods such as MICE for missing data handling
- This Paper's Contribution: First extension of sampling methods to missing data handling in DBNs
- Organ Failure Prediction: DBNs used for predicting organ failure trajectories
- Physiological Change Prediction: Predicting physiological changes and mortality risk
- Decision Support: Providing interpretable decision support
- Method Effectiveness: LUME-DBN outperforms existing methods in both structure reconstruction and missing value imputation
- Clinical Relevance: Discovered temporal relationships have clinical significance, supporting safer clinical decision-making
- Uncertainty Quantification: The fully Bayesian framework provides explicit uncertainty encoding for models, parameters, and missing values
- Computational Complexity: MCMC sampling has high computational costs, requiring parallelization optimization
- Missing Completely at Random Assumption: Current method only handles MCAR; non-random missing patterns in clinical data require further investigation
- Sample Size Constraints: Some relationships may lack stability in small sample scenarios
- Prior Knowledge Integration: Better integration of clinical prior knowledge to guide model inference is needed
- MNAR Handling: Integrate missing data graph methods to handle non-random missing patterns
- Non-Homogeneous DBNs: Extend to globally coupled non-homogeneous DBNs to capture non-stationary relationships
- Mixed Variables: Handle mixed continuous and discrete variable types
- Real-Time Applications: Develop real-time clinical decision support systems
- Theoretical Rigor: Complete derivation of closed-form solutions for missing value FCDs with solid theoretical foundation
- Methodological Innovation: First application of fully Bayesian methods to missing data learning in DBNs
- Experimental Sufficiency: Includes simulated and real data validation, covering different missing rates and sample sizes
- Clinical Relevance: Discovered relationships have clinical significance, validating practical utility
- Reproducibility: Provides complete algorithm description and open-source code
- Computational Efficiency: Lacks detailed computational time analysis and optimization strategies
- Frequentist Comparison: Missing comparisons with classical frequentist DBN learning methods
- Parameter Sensitivity: Insufficient sensitivity analysis regarding hyperparameter selection
- Scalability: Performance on larger-scale networks remains unknown
- Academic Contribution: Provides new theoretical framework for missing data handling in DBNs
- Practical Value: Has significant application prospects in critical domains such as healthcare
- Method Generalizability: Extensible to other fields requiring temporal sequence missing data handling
- Healthcare: ICU monitoring, chronic disease management, clinical trial analysis
- Finance: Time series risk modeling, market prediction
- Industry: Equipment health monitoring, quality control
- Environment: Climate modeling, pollution monitoring
The paper cites 42 relevant references, covering important works in Bayesian network learning, missing data handling, medical informatics, and other domains, providing a solid theoretical foundation for the research.
Overall Assessment: This is a high-quality paper with significant methodological innovation, demonstrating both theoretical breakthroughs and practical value. While there is room for improvement in computational efficiency and method comparison, its contributions are sufficient to advance the field.