Comparative Explanations via Counterfactual Reasoning in Recommendations
Yu, Hu
Explainable recommendation through counterfactual reasoning seeks to identify the influential aspects of items in recommendations, which can then be used as explanations. However, state-of-the-art approaches, which aim to minimize changes in product aspects while reversing their recommended decisions according to an aggregated decision boundary score, often lead to factual inaccuracies in explanations. To solve this problem, in this work we propose a novel method of Comparative Counterfactual Explanations for Recommendation (CoCountER). CoCountER creates counterfactual data based on soft swap operations, enabling explanations for recommendations of arbitrary pairs of comparative items. Empirical experiments validate the effectiveness of our approach.
academic
Comparative Explanations via Counterfactual Reasoning in Recommendations
Explainable recommendation systems seek to identify influential factors of recommended items through counterfactual reasoning, which can serve as explanations. However, existing state-of-the-art methods aim to minimize changes in product attributes while reversing recommendation decisions based on aggregated decision boundary scores, often resulting in factual inaccuracies in explanations. To address this issue, this paper proposes a novel comparative counterfactual explanation method for recommendations (CoCountER). CoCountER creates counterfactual data based on soft swap operations, enabling explanations for recommendations of arbitrary comparative item pairs. Empirical experiments validate the effectiveness of the proposed method.
Explainable recommendation systems aim to provide users with high-quality recommendations while offering clear explanations to help users understand the logic behind recommendations, thereby increasing user trust and satisfaction with the system.
Issues with Matching-based Methods: Template-based explanation methods (e.g., EFM, MTER, A2CF) optimize recommendations and attribute representations through tensor factorization techniques, but may select attributes with high matching scores but poor actual performance as explanations.
Defects of Existing Counterfactual Methods: Methods such as CountER reverse decisions by minimizing feature reduction but suffer from factual inaccuracy problems. Using the headphone example in the paper, CountER might identify comfort as the explanation for recommending headphone A, when in fact headphone A performs worse than headphone B in terms of comfort.
Root Cause: Existing methods optimize pushing the sum of all reduced attributes toward the decision boundary score, rather than pushing each attribute toward the decision boundary, leading to explanations that contradict intuition.
This paper proposes addressing the above issues through comparative counterfactual reasoning by performing attribute-level swap operations between item pairs to generate more faithful and intuitive explanations.
Proposes a novel counterfactual data creation method: A counterfactual data generation mechanism based on soft swap operations
Innovative comparative counterfactual explanation framework: CoCountER can provide explanations for recommendations of arbitrary comparative item pairs
Experimental validation: Validates the method's effectiveness on multiple datasets, surpassing existing methods on counterfactual-related metrics
Given a target user u, an explained item i, and a reference item j, where the recommendation scores satisfy r_{u,i} > r_{u,j}, the goal is to identify key attributes influencing the recommendation decision through minimal swap operations.
Uses the Sentires tool to extract (user, item, attribute, sentiment) quadruples from user reviews, constructing:
User-attribute attention matrix X: X_{u,a} represents user u's attention to attribute a
Item-attribute quality matrix Y: Y_{i,a} represents item i's performance on attribute a
Calculation formulas:
X_{u,a} = {
0, if user u did not mention attribute a
1 + (N-1) · (1-exp^{-t_{u,a}})/(1+exp^{-t_{u,a}}), otherwise
}
Y_{i,a} = {
0, if item i was not mentioned on attribute a
1 + (N-1)/(1+exp^{-t_{i,a}·s_{i,a}}), otherwise
}
Soft Swap Operation: Implements differentiable swap operations through sigmoid functions, where values close to 0 indicate no swap and values close to 1 indicate complete swap
Comparative Framework: Unlike traditional single-item explanations, provides comparative explanations between items
Generality: When fixing the reference item and computing only the first part of the swap function, it can degenerate into reduction-based counterfactual methods
Impact of Reference Item Ranking: Lower-ranked reference items impose fewer optimization constraints, enabling discovery of more effective counterfactual attributes, improving PN and PS performance
Impact of Reference Item Quantity: Appropriately increasing the number of reference items improves performance, but excessive quantities introduce noise leading to slight performance degradation
Yang et al. proposed the concept of comparative explanations but adopted autoregressive decoders to generate textual explanations, differing from this paper's counterfactual reasoning perspective.
The paper cites 30 relevant references covering multiple related fields including explainable recommendations, counterfactual reasoning, and causal inference, providing a solid theoretical foundation for the research.
Overall Assessment: This is a high-quality research paper that proposes an innovative comparative counterfactual explanation framework, addressing important problems with existing methods. The method design is reasonable, experimental validation is comprehensive, and it makes significant contributions to the explainable recommendation field. While some limitations exist, overall it represents valuable research work.