Hybrid Explanation-Guided Learning for Transformer-Based Chest X-Ray Diagnosis
Shu, Luo, Poellinger et al.
Transformer-based deep learning models have demonstrated exceptional performance in medical imaging by leveraging attention mechanisms for feature representation and interpretability. However, these models are prone to learning spurious correlations, leading to biases and limited generalization. While human-AI attention alignment can mitigate these issues, it often depends on costly manual supervision. In this work, we propose a Hybrid Explanation-Guided Learning (H-EGL) framework that combines self-supervised and human-guided constraints to enhance attention alignment and improve generalization. The self-supervised component of H-EGL leverages class-distinctive attention without relying on restrictive priors, promoting robustness and flexibility. We validate our approach on chest X-ray classification using the Vision Transformer (ViT), where H-EGL outperforms two state-of-the-art Explanation-Guided Learning (EGL) methods, demonstrating superior classification accuracy and generalization capability. Additionally, it produces attention maps that are better aligned with human expertise.
academic
Hybrid Explanation-Guided Learning for Transformer-Based Chest X-Ray Diagnosis
Transformer-based deep learning models have demonstrated superior feature representation and interpretability capabilities through attention mechanisms in medical imaging. However, these models are prone to learning spurious correlations, leading to bias and limited generalization ability. While human-machine attention alignment can mitigate these issues, it often relies on expensive manual supervision. This work proposes a Hybrid Explanation-Guided Learning (H-EGL) framework that combines self-supervised and human-guided constraints to enhance attention alignment and improve generalization. The self-supervised component of H-EGL leverages class-discriminative attention without relying on restrictive priors, promoting robustness and flexibility. Validated on chest X-ray classification tasks using Vision Transformer (ViT), H-EGL surpasses two state-of-the-art explanation-guided learning methods, demonstrating superior classification accuracy and generalization ability while producing attention maps better aligned with human expert knowledge.
The core problems addressed in this research are spurious correlation learning and attention alignment in Transformer-based medical imaging models. Specifically:
Spurious Correlation Problem: Deep neural networks tend to learn spurious correlations in data, leading to shortcut learning, bias, and fairness issues
Attention Alignment Challenge: While human-machine attention alignment can improve model robustness, it requires expensive manual annotation
Limitations of Existing Methods: Pure self-supervised methods may reinforce incorrect interpretations, while contrastive learning methods lack standardized positive and negative sample generation approaches
Proposes H-EGL Framework: First application of hybrid explanation-guided methods to Transformer architecture, evaluating and enhancing human-machine attention alignment
Designs DAL Component: Proposes Discriminative Attention Learning (DAL) that leverages class-discriminative attention maps for self-supervised learning
Achieves Performance Improvement: Surpasses existing state-of-the-art methods on chest X-ray classification tasks with AUC reaching 89.3%
Enhances Interpretability: Generates attention maps better aligned with expert knowledge while maintaining classification performance
Input: Chest X-ray images and disease label text
Output: Multi-label disease classification predictions and class-specific attention maps
Objective: Improve classification accuracy while generating attention maps aligned with human expert-annotated regions
The paper cites multiple important related works, including:
Original Vision Transformer (ViT) paper 3
Spurious correlation research in medical imaging 2,5,6
Explanation-guided learning survey 4
DWARF method 11 and KAD method 19
Overall Assessment: This is a high-quality research paper making meaningful contributions to medical imaging interpretability. The hybrid explanation-guided learning framework is well-designed with sufficient experimental validation and convincing results. Despite some limitations, it provides a solid foundation and direction for future research.