2025-11-23T20:34:17.570355

Causal Explanation of Concept Drift -- A Truly Actionable Approach

Komnick, Lammers, Hammer et al.

In a world that constantly changes, it is crucial to understand how those changes impact different systems, such as industrial manufacturing or critical infrastructure. Explaining critical changes, referred to as concept drift in the field of machine learning, is the first step towards enabling targeted interventions to avoid or correct model failures, as well as malfunctions and errors in the physical world. Therefore, in this work, we extend model-based drift explanations towards causal explanations, which increases the actionability of the provided explanations. We evaluate our explanation strategy on a number of use cases, demonstrating the practical usefulness of our framework, which isolates the causally relevant features impacted by concept drift and, thus, allows for targeted intervention.

academic

Causal Explanation of Concept Drift -- A Truly Actionable Approach

Basic Information

Paper ID: 2507.23389
Title: Causal Explanation of Concept Drift -- A Truly Actionable Approach
Authors: David Komnick, Kathrin Lammers, Barbara Hammer, Valerie Vaquet, Fabian Hinder (Bielefeld University)
Classification: cs.LG (Machine Learning)
Publication Time/Conference: TempXAI workshop at ECML-PKDD 2025
Paper Link: https://arxiv.org/abs/2507.23389

Abstract

In a constantly changing world, understanding how these changes affect different systems such as industrial manufacturing or critical infrastructure is crucial. Explaining critical changes (referred to as concept drift in the machine learning field) is the first step toward implementing targeted interventions to prevent or correct model failures and failures and errors in the physical world. Therefore, this paper extends model-based drift explanation to causal explanation, improving the actionability of the provided explanations. The authors evaluate the explanation strategy on multiple use cases, demonstrating the practicality of the framework, which can isolate causally relevant features affected by concept drift, thereby allowing targeted interventions.

Research Background and Motivation

Problem Definition

Concept Drift Problem: In practical applications, data distribution changes over time, a phenomenon called concept drift, which leads to degraded machine learning model performance
Explainability Requirements: Merely detecting drift is insufficient; understanding the causes of drift is necessary to enable effective intervention measures
Missing Actionability: Existing drift explanation methods are primarily exploratory, lacking direct actionable guidance

Significance

Industrial Applications: In critical infrastructure (such as power grids, water distribution networks), understanding drift causes is essential for system monitoring and failure prevention
Model Maintenance: Accurate drift explanation can guide model adaptation and improvement strategies
Decision Support: Providing operators with actionable explanations to support autonomous procedures or human intervention decisions

Limitations of Existing Methods

Model-Based Drift Explanation: While versatile, primarily focuses on exploratory explanation techniques
Feature Importance Methods: Lack causal reasoning capability, unable to provide direct intervention guidance
Limited Causal Drift Explanation Research: Few related works, primarily focusing on prediction or detection tasks

Core Contributions

Theoretical Framework: Extends the model-based drift explanation framework to the causal explanation domain
Mathematical Formalization: Provides rigorous mathematical definitions of drift-reversing interventions
Algorithm Implementation: Proposes practical causal drift explanation algorithms based on causal discovery methods
Experimental Validation: Validates method effectiveness and stability on semi-synthetic datasets

Methodology Details

Task Definition

Input: Data stream with temporal labels S = ((X₁, T₁), (X₂, T₂), ...) Output:

Core intervention feature set C (direct children of temporal node)
Conditional feature set P (other parents of core features)
Complete intervention feature set A (core features and all their ancestors)

Theoretical Foundation

Causal Modeling of Concept Drift

The paper formalizes concept drift as a dependency relationship between data and time:

Definition 1 (Concept Drift): A distribution process (P_T, D_t) exhibits drift if and only if:

There exist s, t such that D_t ≠ D_s with probability greater than 0
Data X and time T are not independent

Causal Models and Interventions

Based on Bayesian networks and do-calculus:

Bayesian Network: (G, P_f), where G is a directed acyclic graph and P_f is a set of conditional distributions
do-operation: P_G(· | do(X_F = x)) represents the distribution after intervention on feature F
Causal Model: A network is causal if its predictions for all interventions match experimental results

Drift-Reversing Intervention

Definition 5: A feature set F provides drift-reversing intervention if and only if controlling the values of features in F produces the same effect as changing the temporal flow.

Core Theorems

Theorem 2: In a faithful causal model:

The temporal node has no parents
The temporal node has children if and only if drift exists
Every drift-reversing set must contain all children of the temporal node
All children of the temporal node and their ancestors constitute a drift-reversing set

Theorem 3: The minimal set of features that need to be changed consists exactly of all direct children of the temporal node.

Algorithm Implementation

Algorithm 1: Causal Explanation of Drift
Input: S = ((X₁, T₁), ...) data stream
1. G ← DetermineDAG(S)  // Run causal discovery algorithm
2. C ← GetChildren(G, f_T)  // Get children of temporal node
3. P ← ∪_{f∈C} GetParents(G, f) \ ({f_T} ∪ C)
4. A ← ∪_{f∈C} GetAncesters(G, f) \ {f_T}
5. return (C, P, A)

Experimental Setup

Datasets

Semi-synthetic datasets constructed from Adult and Portuguese Student Performance datasets:

Adult Dataset Scenarios:

Adult Inflation: Inflation increases the likelihood of high monetary values
Adult Women in STEM: Women are more likely to work in STEM fields

Student Dataset Scenarios:

Student Girls Support: Female students participate in support programs
Student Boys Support: Male students participate in support programs

Data Scale:

Adult: approximately 48,800 samples, drift point at 25,000
Student: 5,000 samples, drift point at 2,000

Evaluation Methods

Causal Structure Recovery Accuracy: Comparing detected edges with ground truth causal graphs
Drift Feature Identification Stability: Assessing consistency in identifying temporal node children
10 Independent Experiments: Evaluating result stability

Implementation Details

PC algorithm from causal-learn Python package
g-square independence test
Default parameter settings

Experimental Results

PC Algorithm Performance Analysis

Adult Dataset:

Correct detection rate: 50% (19/38 edges)
9 edges with incorrect direction, 10 edges undetected

Student Dataset:

Correct detection rate: 30.77% (8/26 edges)
Poor performance primarily due to insufficient samples relative to feature count

Drift Explanation Results

Adult Women in STEM Scenario

Success Rate: Correctly identified occupation as drift feature in 9/10 experiments
Temporal Relationship: T detected as parent of occupation
Stability: Other ancestor features consistent with non-drifted data

Adult Inflation Scenario

Multi-Feature Drift: Correctly identified all three drift features (capital-gain, capital-loss, income) in 8/10 experiments
Correlation Impact: capital-gain undetected in 2 experiments due to strong feature correlation

Student Support Scenario

High Accuracy: Despite poor overall causal graph recovery quality, correctly identified schoolsup as the sole drift feature in all experiments
Conditional Features: Correctly identified sex as conditional variable

Key Findings

No False Positives: Temporal feature never connected to unrelated features
Strong Drift Detection: Strong concept drift reliably detected even with poor data quality
Stability: Drift feature identification demonstrates stable performance in most cases

Concept Drift Explanation

Visualization Methods: Primarily focus on drift detection and quantification
Feature-Level Explanation: Provides feature-level drift analysis
Model-Based Explanation: Uses surrogate models to compute explanations

Causal Drift Explanation

Limited Research: Very few related works
DAG Comparison Methods: Comparing causal models before and after drift
NOTEARS Algorithm: Online method for causal discovery

Feature Correlation Theory

Drift-Inducing Features: Features causing drift
Faithful Drift Features: Features following drift
Markov Boundary: Connection to the paper's theoretical framework

Conclusions and Discussion

Main Conclusions

Theoretical Contribution: Successfully introduces causal reasoning into concept drift explanation
Practicality: Provides direct actionable intervention guidance
Stability: Reliably identifies drift features even when causal discovery algorithms perform poorly

Limitations

Causal Discovery Dependency: Method effectiveness limited by underlying causal discovery algorithm accuracy
Single Causal Graph Assumption: Assumes entire dataset can be described by one causal graph
Data Requirements: Requires sufficient samples to support reliable independence testing

Future Directions

Feature Correlation Theory Integration: Exploring relationships between causal discovery and feature correlation to reduce data requirements
Local Explanations: Providing more localized explanations for specific subgroups
Multiple Causal Graph Extension: Handling complex scenarios requiring multiple causal graphs

In-Depth Evaluation

Strengths

Theoretical Rigor: Provides complete mathematical formalization framework with rigorous proofs from definitions to theorems
Strong Innovation: First systematic combination of causal reasoning with concept drift explanation
High Practical Value: Provides direct intervention guidance with genuine actionability
Reasonable Experimental Design: Uses semi-synthetic data to ensure ground truth availability

Weaknesses

Limited Experimental Scale: Testing only on variants of two datasets
Single Causal Discovery Algorithm: Primarily relies on PC algorithm, unexplored effects of other algorithms
Insufficient Real-World Validation: Lacks verification in real industrial scenarios
Missing Complexity Analysis: Lacks detailed analysis of algorithm time and space complexity

Impact

Academic Contribution: Opens new research direction in concept drift explanation field
Practical Value: Provides new tools for critical infrastructure monitoring
Reproducibility: Provides open-source code facilitating reproduction and extension

Applicable Scenarios

Industrial Monitoring: Power grids, water distribution networks, and other critical infrastructure
Quality Control: Product quality monitoring in manufacturing
Financial Risk Management: Drift analysis in credit assessment models
Medical Diagnosis: Diagnostic model applicability analysis across different populations

References

The paper cites 30 related references, primarily including:

Classical works on concept drift detection and adaptation (Gama et al., 2014)
Survey articles on explainable AI (Adadi & Berrada, 2018)
Foundational theories in causal reasoning (Pearl, 2009)
Model-based drift explanation frameworks (Hinder et al., 2023)

This paper makes important contributions at both theoretical and practical levels, providing a novel causal perspective on concept drift explanation with significant academic value and practical significance. Despite some limitations, it establishes a solid foundation for future research.