Formal XAI is an emerging field that focuses on providing explanations with mathematical guarantees for the decisions made by machine learning models. A significant amount of work in this area is centered on the computation of "sufficient reasons". Given a model $M$ and an input instance $\vec{x}$, a sufficient reason for the decision $M(\vec{x})$ is a subset $S$ of the features of $\vec{x}$ such that for any instance $\vec{z}$ that has the same values as $\vec{x}$ for every feature in $S$, it holds that $M(\vec{x}) = M(\vec{z})$. Intuitively, this means that the features in $S$ are sufficient to fully justify the classification of $\vec{x}$ by $M$. For sufficient reasons to be useful in practice, they should be as small as possible, and a natural way to reduce the size of sufficient reasons is to consider a probabilistic relaxation; the probability of $M(\vec{x}) = M(\vec{z})$ must be at least some value $δ\in (0,1]$, for a random instance $\vec{z}$ that coincides with $\vec{x}$ on the features in $S$. Computing small $δ$-sufficient reasons ($δ$-SRs) is known to be a theoretically hard problem; even over decision trees--traditionally deemed simple and interpretable models--strong inapproximability results make the efficient computation of small $δ$-SRs unlikely. We propose the notion of $(δ, ε)$-SR, a simple relaxation of $δ$-SRs, and show that this kind of explanation can be computed efficiently over linear models.
Title: Probabilistic Explanations for Linear Models
Authors: Bernardo Subercaseaux (Carnegie Mellon University), Marcelo Arenas (PUC Chile, IMFD Chile, RelationalAI), Kuldeep S. Meel (Georgia Institute of Technology, University of Toronto)
This paper investigates the computational problem of computing "sufficient reasons" in Formal Explainable AI (Formal XAI). Given a model M and input instance x, a sufficient reason is a feature subset S such that any instance z agreeing with x on S satisfies M(x) = M(z). To reduce the size of sufficient reasons, the authors consider a probabilistic relaxation: requiring that for random instances z agreeing with x on a feature set, M(x) = M(z) with probability at least δ ∈ (0,1]. Computing small δ-sufficient reasons (δ-SRs) is theoretically difficult, with strong inapproximability results even for "interpretable" models like decision trees. The paper proposes the (δ,ε)-SR concept, a simple relaxation of δ-SRs, and proves that such explanations can be computed efficiently on linear models.
Core Problem: How to provide mathematically guaranteed small-sized explanations for machine learning model decisions. Traditional sufficient reasons require 100% certainty, but this often results in explanations too large for human comprehension.
Problem Significance:
Miller (1956) noted that explanations exceeding nine features may be too large for humans
Empirical studies demonstrate that explanations should be concise (Narayanan et al., 2018; Lage et al., 2019)
In practical applications, users care more about explanation size than minor variations in probabilistic guarantees
Limitations of Existing Approaches:
Computing minimum δ-SRs is NP-hard even for decision trees
For linear models, exact probability computation is #P-hard
Strong inapproximability results exist: no good approximation ratios achievable in polynomial time
Research Motivation:
Users are more sensitive to explanation size than minor changes in probabilistic guarantees
Need to balance theoretical tractability with practical utility
The special structure of linear models may allow efficient algorithms
Input: Linear model L, instance x, parameters δ, ε, β
1. δ* ← Sample uniformly from [δ-ε, δ+ε]
2. Compute all feature scores s_i
3. Construct partial instance sequence y^(0), ..., y^(n)
4. Set sample count m = (log 2n)/(2ε²δ²) log(2log n/β)
5. Use binary search to find minimum k such that estimated probability ≥ δ*
6. Return (δ*, y^(k*))
Darwiche, A. and Hirth, A. (2020). On the Reasons Behind Decisions. ECAI 2020.
Barceló, P., Monet, M., Pérez, J., and Subercaseaux, B. (2020). Model interpretability through the lens of computational complexity. NeurIPS 2020.
Wäldchen, S., MacDonald, J., Hauch, S., and Kutyniok, G. (2021). The computational complexity of understanding binary classifier decisions. JAIR.
Arenas, M., Barceló, P., Romero Orth, M., and Subercaseaux, B. (2022). On computing probabilistic explanations for decision trees. NeurIPS 2022.
Kozachinskiy, A. (2023). Inapproximability of sufficient reasons for decision trees. arXiv:2304.02781.
This paper makes important theoretical contributions to Formal Explainable AI, being the first to prove tractability of probabilistic explanations on linear models, providing rare positive results in the field. While there is room for improvement in practical applicability, its theoretical value and methodological innovation make it an important work in the field.