Graph Transformer with Disease Subgraph Positional Encoding for Improved Comorbidity Prediction
Qin, Liao
Comorbidity, the co-occurrence of multiple medical conditions in a single patient, profoundly impacts disease management and outcomes. Understanding these complex interconnections is crucial, especially in contexts where comorbidities exacerbate outcomes. Leveraging insights from the human interactome (HI) and advancements in graph-based methodologies, this study introduces Transformer with Subgraph Positional Encoding (TSPE) for disease comorbidity prediction. Inspired by Biologically Supervised Embedding (BSE), TSPE employs Transformer's attention mechanisms and Subgraph Positional Encoding (SPE) to capture interactions between nodes and disease associations. Our proposed SPE proves more effective than LPE, as used in Dwivedi et al.'s Graph Transformer, underscoring the importance of integrating clustering and disease-specific information for improved predictive accuracy. Evaluated on real clinical benchmark datasets (RR0 and RR1), TSPE demonstrates substantial performance enhancements over the state-of-the-art method, achieving up to 28.24% higher ROC AUC and 4.93% higher accuracy. This method shows promise for adaptation to other complex graph-based tasks and applications. The source code is available in the GitHub repository at: https://github.com/xihan-qin/TSPE-GraphTransformer.
academic
Graph Transformer with Disease Subgraph Positional Encoding for Improved Comorbidity Prediction
This study addresses the disease comorbidity prediction problem by proposing a Graph Transformer method with subgraph positional encoding (TSPE). The approach leverages Human Interactome (HI) data and employs Transformer attention mechanisms combined with a novel Subgraph Positional Encoding (SPE) to capture inter-node interactions and disease associations. Experiments on clinical benchmark datasets RR0 and RR1 demonstrate that TSPE achieves improvements of up to 28.24% in ROC AUC and 4.93% in accuracy compared to existing state-of-the-art methods.
Core Problem: Disease comorbidity prediction, i.e., predicting the likelihood of multiple diseases occurring simultaneously in the same patient
Significance: Comorbidity significantly affects disease management, treatment strategies, and prognostic outcomes, particularly in pandemics such as COVID-19, where specific comorbidities lead to more severe outcomes
Limitations of Existing Methods:
Traditional methods such as geodesic embedding (GE) have limited performance
The current state-of-the-art method BSE, while introducing supervised selection mechanisms, still relies on traditional SVM classifiers
The Graph Transformer proposed by Dwivedi et al. uses Laplacian Positional Encoding (LPE) that lacks disease-specific information
Building on the emphasis of BSE research on node connectivity and disease associations, this work explores leveraging Transformer attention mechanisms and specially designed subgraph positional encoding to improve comorbidity prediction performance.
Proposes TSPE Framework: First application of Transformer architecture to disease comorbidity prediction, designing an encoder-decoder structure suitable for graph data
Innovative Subgraph Positional Encoding (SPE): Combines clustering information from Laplacian Positional Encoding (LPE) with disease label information from Graph Positional Encoding (GPE)
Significant Performance Improvements: Substantially outperforms existing state-of-the-art methods on both benchmark datasets
Comprehensive Ablation Studies: Validates the effectiveness of different positional encoding methods
Overall Assessment: This is a high-quality research paper that successfully introduces Transformer architecture to disease comorbidity prediction. The proposed SPE positional encoding method demonstrates clear biological motivation and technical innovation. The experimental results are impressive and provide valuable references for related research.