2025-11-23T20:34:17.570355

Causal Explanation of Concept Drift -- A Truly Actionable Approach

Komnick, Lammers, Hammer et al.

In a world that constantly changes, it is crucial to understand how those changes impact different systems, such as industrial manufacturing or critical infrastructure. Explaining critical changes, referred to as concept drift in the field of machine learning, is the first step towards enabling targeted interventions to avoid or correct model failures, as well as malfunctions and errors in the physical world. Therefore, in this work, we extend model-based drift explanations towards causal explanations, which increases the actionability of the provided explanations. We evaluate our explanation strategy on a number of use cases, demonstrating the practical usefulness of our framework, which isolates the causally relevant features impacted by concept drift and, thus, allows for targeted intervention.

academic

Causal Explanation of Concept Drift -- A Truly Actionable Approach

基本信息

论文ID: 2507.23389
标题: Causal Explanation of Concept Drift -- A Truly Actionable Approach
作者: David Komnick, Kathrin Lammers, Barbara Hammer, Valerie Vaquet, Fabian Hinder (Bielefeld University)
分类: cs.LG (Machine Learning)
发表时间/会议: TempXAI workshop at ECML-PKDD 2025
论文链接: https://arxiv.org/abs/2507.23389

概念漂移问题：在实际应用中，数据分布会随时间发生变化，这种现象称为概念漂移，会导致机器学习模型性能下降
解释性需求：仅检测到漂移是不够的，需要理解漂移的原因以便采取有效的干预措施
可操作性缺失：现有的漂移解释方法主要是探索性的，缺乏直接的可操作性指导

重要性

工业应用：在关键基础设施（如电网、水分配网络）中，理解漂移原因对系统监控和故障预防至关重要
模型维护：准确的漂移解释能够指导模型适应和改进策略
决策支持：为操作员提供可操作的解释，支持自主程序或人工干预决策

现有方法局限性

基于模型的漂移解释：虽然versatile但主要关注探索性解释技术
特征重要性方法：缺乏因果推理能力，无法提供直接的干预指导
因果漂移解释研究有限：相关工作很少，且主要关注预测或检测任务

核心贡献

理论框架：将基于模型的漂移解释框架扩展到因果解释领域
数学形式化：提供了漂移逆转干预（drift-reversing intervention）的严格数学定义
算法实现：提出了实用的因果漂移解释算法，基于因果发现方法
实验验证：在半合成数据集上验证了方法的有效性和稳定性

方法详解

任务定义

输入：包含时间标签的数据流 S = ((X₁, T₁), (X₂, T₂), ...) 输出：

核心干预特征集合 C（时间节点的直接子节点）
条件特征集合 P（核心特征的其他父节点）
完整干预特征集合 A（核心特征及其所有祖先）

理论基础

概念漂移的因果建模

论文将概念漂移形式化为数据和时间的依赖关系：

定义1（概念漂移）：分布过程(P_T, D_t)存在漂移当且仅当：

存在s,t使得D_t ≠ D_s，概率大于0
数据X和时间T不独立

因果模型与干预

基于贝叶斯网络和do-演算：

贝叶斯网络：(G, P_f)，其中G是有向无环图，P_f是条件分布集合
do-操作：P_G(· | do(X_F = x))表示对特征F进行干预后的分布
因果模型：如果网络对所有干预的预测都与实验结果一致

漂移逆转干预

定义5：特征集合F提供漂移逆转干预，当且仅当通过控制F中特征的值，能够产生与改变时间流相同的效果。

核心定理

定理2：在忠实因果模型中：

时间节点没有父节点
时间节点有子节点当且仅当存在漂移
每个漂移逆转集必须包含时间节点的所有子节点
时间节点所有子节点及其祖先构成漂移逆转集

定理3：最小需要改变的特征集合恰好是时间节点的所有直接子节点。

算法实现

Algorithm 1: Causal Explanation of Drift
Input: S = ((X₁, T₁), ...) data stream
1. G ← DetermineDAG(S)  // 运行因果发现算法
2. C ← GetChildren(G, f_T)  // 获取时间节点的子节点
3. P ← ∪_{f∈C} GetParents(G, f) \ ({f_T} ∪ C)
4. A ← ∪_{f∈C} GetAncesters(G, f) \ {f_T}
5. return (C, P, A)