MedFuse: Multiplicative Embedding Fusion For Irregular Clinical Time Series
Hsieh, Chien, Huang et al.
Clinical time series derived from electronic health records (EHRs) are inherently irregular, with asynchronous sampling, missing values, and heterogeneous feature dynamics. While numerical laboratory measurements are highly informative, existing embedding strategies usually combine feature identity and value embeddings through additive operations, which constrains their ability to capture value-dependent feature interactions. We propose MedFuse, a framework for irregular clinical time series centered on the MuFuse (Multiplicative Embedding Fusion) module. MuFuse fuses value and feature embeddings through multiplicative modulation, preserving feature-specific information while modeling higher-order dependencies across features. Experiments on three real-world datasets covering both intensive and chronic care show that MedFuse consistently outperforms state-of-the-art baselines on key predictive tasks. Analysis of the learned representations further demonstrates that multiplicative fusion enhances expressiveness and supports cross-dataset pretraining. These results establish MedFuse as a generalizable approach for modeling irregular clinical time series.
academic
MedFuse: Multiplicative Embedding Fusion For Irregular Clinical Time Series
Clinical time series in electronic health records (EHR) exhibit inherent irregularity, including asynchronous sampling, missing values, and heterogeneous feature dynamics. Existing embedding strategies typically combine feature identity and numerical embeddings through additive operations, which limits the ability to capture value-dependent feature interactions. This paper proposes the MedFuse framework, centered on the MuFuse (Multiplicative Embedding Fusion) module. MuFuse fuses numerical and feature embeddings through multiplicative modulation, modeling higher-order dependencies while preserving feature-specific information. Experiments on three real-world datasets demonstrate that MedFuse consistently outperforms state-of-the-art baselines on critical prediction tasks. Analysis of learned representations further confirms that multiplicative fusion enhances expressiveness and supports cross-dataset pretraining.
Clinical time series modeling faces three major challenges:
Irregular Sampling: Vital signs may be monitored frequently, while laboratory tests are performed only when clinically necessary; patients may miss scheduled visits
High Missing Rate: Average missing rates in datasets reach 73.77%-88.14%
Difficult Numerical Representation: Laboratory values encode complex information within continuous ranges, theoretically requiring infinite representations
Existing EVAT (Each Value As Token) methods primarily employ additive fusion:
Treat numerical embeddings as additive offsets to feature embeddings
Limited Expressiveness: Difficult to capture value-dependent nonlinear interactions
Loss of Clinical Semantics: Cannot distinguish qualitative differences between small and large deviations in laboratory measurements (e.g., mild creatinine elevation vs. sharp increase)
Multiplicative fusion has proven to provide stronger semantic integration than additive or concatenation approaches in other domains
The special nature of clinical data (e.g., medical equifinality: different abnormal deviations may correspond to the same clinical risk) requires more flexible fusion mechanisms
Need for a universal framework that requires no imputation and can directly handle irregular observations
Multiplicative Value-Feature Fusion: Proposes the MuFuse module, performing nonlinear, feature-specific modulation through value-conditioned multiplicative fusion without expanding the embedding vocabulary
Universal Imputation-Free Framework: Constructs MedFuse based on MuFuse, adopting a (feature, value, timestamp) triplet tokenization scheme to directly model irregular measurements
Comprehensive Validation and Transferability:
Consistently outperforms strong baselines on ICU and chronic disease datasets
Ablation studies confirm multiplicative superiority over additive fusion
Transfer experiments show learned feature embeddings can be reused across datasets
Theoretical Insights: Proves that the recent SOTA method SCANE is actually a special case of MuFuse (d'=1), establishing a more general fusion mechanism
SCANE directly multiplies observed values as scalars with feature embeddings, which is actually a special case of MuFuse (d'=1, no value transformation). MuFuse provides stronger expressiveness through flexible dimension selection and nonlinear projection.
Conclusion: Multiplicative fusion shows 5.5% improvement in AUPRC over additive, confirming the effectiveness of value-conditioned multiplicative modulation
MedFuse is a paper with substantial contributions to clinical time series modeling. Its core innovation—multiplicative embedding fusion (MuFuse)—not only elegantly generalizes existing SOTA methods in theory but also achieves consistent performance improvements on multiple real-world datasets. The paper's experimental design is comprehensive, systematically validating method effectiveness from main performance comparisons through ablation studies, dimension analysis, and transfer learning.
Particularly commendable is the paper's insight into medical equifinality—naturally modeling the phenomenon where different abnormal deviations correspond to the same clinical risk through the masking effect of multiplicative fusion. This demonstrates the authors' deep understanding of the clinical domain.
However, the paper has some limitations: relatively high computational cost, limited transfer learning experiments, and lack of code release. Nevertheless, MedFuse provides a powerful and universal framework for irregular clinical time series modeling, with significant implications for advancing medical AI. Future work on multimodal extension, interpretability, and practical clinical deployment is anticipated.