2025-11-16T10:43:13.528960

PruneGCRN: Minimizing and explaining spatio-temporal problems through node pruning

GarcÃa-SigÃ¼enza, Nanni, Llorens-Largo et al.

This work addresses the challenge of using a deep learning model to prune graphs and the ability of this method to integrate explainability into spatio-temporal problems through a new approach. Instead of applying explainability to the model's behavior, we seek to gain a better understanding of the problem itself. To this end, we propose a novel model that integrates an optimized pruning mechanism capable of removing nodes from the graph during the training process, rather than doing so as a separate procedure. This integration allows the architecture to learn how to minimize prediction error while selecting the most relevant nodes. Thus, during training, the model searches for the most relevant subset of nodes, obtaining the most important elements of the problem, facilitating its analysis. To evaluate the proposed approach, we used several widely used traffic datasets, comparing the accuracy obtained by pruning with the model and with other methods. The experiments demonstrate that our method is capable of retaining a greater amount of information as the graph reduces in size compared to the other methods used. These results highlight the potential of pruning as a tool for developing models capable of simplifying spatio-temporal problems, thereby obtaining their most important elements.

academic

PruneGCRN: Minimizing and explaining spatio-temporal problems through node pruning

基本信息

论文ID: 2510.10803
标题: PruneGCRN: Minimizing and explaining spatio-temporal problems through node pruning
作者: Javier García-Sigüenza, Mirco Nanni, Faraón Llorens-Largo, José F. Vicent
分类: cs.LG cs.AI
发表时间: October 14, 2025 (arXiv preprint)
论文链接: https://arxiv.org/abs/2510.10803

AI透明度需求：随着AI的广泛应用，特别是在高风险领域（医疗、金融、自动驾驶），可解释性变得至关重要
时空问题复杂性：结合图神经网络(GNN)和循环神经网络(RNN)的时空模型复杂度高，传统可解释性方法难以适用
实际应用价值：在交通预测中，识别最重要的传感器位置对城市规划和交通管理具有重要意义

现有方法局限性

注意力机制：存在"组合捷径"问题，可能关注不相关的标记
原型网络：主要适用于分类任务，不包含时间维度
模糊系统：准确性较低，与深度学习结合后复杂度增加
后验可解释性方法：通常会损害性能，且主要关注空间维度

核心贡献

提出PruneGCRN模型：一种新颖的图卷积循环网络，集成了节点剪枝机制
创新的可解释性范式：从理解模型行为转向理解问题本身
训练时集成剪枝：将节点选择集成到训练过程中，而非作为独立的后处理步骤
Binary Clamp技术：提出比Hard Concrete更简单有效的掩码生成方法
实验验证：在多个交通数据集上验证了方法的有效性

方法详解

任务定义

给定一个时空图序列，其中每个节点代表一个空间位置（如交通传感器），任务是：

预测未来时间步的节点值
同时学习一个掩码，识别对预测最重要的节点子集
在保持预测准确性的同时最小化使用的节点数量

模型架构

PruneGCRN模型包含两个核心模块：

1. 节点自适应参数学习模块 (NAPL)

NAPL模块通过节点嵌入学习特定模式的滤波器：

Θ = EN · WN
b = EN · bN

其中：

EN ∈ R^(n×d)：节点嵌入矩阵
WN ∈ R^(d×c×f)：共享权重
bN：共享偏置

修改后的图卷积操作为：

Z = (IN + D^(-1/2)AD^(-1/2))XENWN + ENbN

2. 剪枝图学习模块 (PGL)

PGL模块生成用于节点选择的掩码M̃：

掩码生成流程：

Raw Mask：初始化为1的浮点值掩码
Binary Clamp：将<0的值设为0，>0的值设为1
Inverse Mask：计算反向掩码
Graph Bias：为被掩码的节点学习替代值

Binary Clamp优势：

比Hard Concrete更简单
训练和验证时行为一致
单步优化节点选择

3. 完整的PruneGCRN架构

将NAPL和PGL模块集成到GRU中：

zt = σ(L̃[X̃:,t, ht-1]ENWzr + Ebzr)
rt = σ(In[X̃:,t, ht-1]ENWzr + Ebzr)  
ĥt = tanh([In + L̃][X̃:,t, r ⊙ ht-1]ENWĥ + ENbĥ)
ht = zt ⊙ ĥt-1 + (1-zt) ⊙ ĥt-1

技术创新点

训练时节点剪枝：与传统的后处理剪枝不同，PruneGCRN在训练过程中同时优化预测准确性和节点选择
Binary Clamp机制：相比SEGCRN使用的Hard Concrete，提供更稳定和简单的掩码生成
问题导向的可解释性：关注识别问题的关键元素而非模型行为
联合优化：通过损失函数同时考虑预测误差和节点使用数量

实验设置

数据集

使用5个广泛采用的交通数据集：

数据集	传感器数量	时间范围	特点
PeMSD3	358	2018.9.9-11.30	5分钟间隔交通量
PeMSD4	307	2018.1.1-2.28	5分钟间隔交通量
PeMSD7	883	2017.5.1-2018.8.31	5分钟间隔交通量
PeMSD8	170	2018.7.1-8.31	5分钟间隔交通量
PeMS-Bay	325	2017.1.1-5.31	包含地理位置信息