Causality Enhancement for Cross-Domain Recommendation
Wu, Wu, Jiang et al.
Cross-domain recommendation forms a crucial component in recommendation systems. It leverages auxiliary information through source domain tasks or features to enhance target domain recommendations. However, incorporating inconsistent source domain tasks may result in insufficient cross-domain modeling or negative transfer. While incorporating source domain features without considering the underlying causal relationships may limit their contribution to final predictions. Thus, a natural idea is to directly train a cross-domain representation on a causality-labeled dataset from the source to target domain. Yet this direction has been rarely explored, as identifying unbiased real causal labels is highly challenging in real-world scenarios. In this work, we attempt to take a first step in this direction by proposing a causality-enhanced framework, named CE-CDR. Specifically, we first reformulate the cross-domain recommendation as a causal graph for principled guidance. We then construct a causality-aware dataset heuristically. Subsequently, we derive a theoretically unbiased Partial Label Causal Loss to generalize beyond the biased causality-aware dataset to unseen cross-domain patterns, yielding an enriched cross-domain representation, which is then fed into the target model to enhance target-domain recommendations. Theoretical and empirical analyses, as well as extensive experiments, demonstrate the rationality and effectiveness of CE-CDR and its general applicability as a model-agnostic plugin. Moreover, it has been deployed in production since April 2025, showing its practical value in real-world applications.
academic
Causality Enhancement for Cross-Domain Recommendation
Cross-domain recommendation is an important component of recommendation systems, which enhances target domain recommendations through auxiliary information from source domain tasks or features. However, introducing inconsistent source domain tasks may lead to insufficient cross-domain modeling or negative transfer, while introducing source domain features without considering potential causal relationships may limit their contribution to final predictions. This paper proposes a causality enhancement framework CE-CDR, which first reformulates cross-domain recommendation as a causal graph, then heuristically constructs a causality-aware dataset, and derives a theoretically unbiased partial-label causal loss function to generalize to unseen cross-domain patterns. The method has been deployed in production environments, demonstrating its practical application value.
Traditional cross-domain recommendation systems face two main challenges:
Task Inconsistency: Multi-task learning methods promote cross-domain knowledge transfer through parameter or representation sharing, but may fail to capture valuable cross-domain information due to task inconsistency between domains, potentially causing negative transfer
Causal Sparsity: Methods that treat source domain behavior as additional input features ignore the causal sparsity between source and target domains. User preferences in the target domain are not necessarily influenced by source domain behavior
Existing methods either learn functions L→S and L→T (multi-task learning) or learn functions (L,S)→T (feature fusion), but none directly model cross-domain causal relationships. This paper proposes training cross-domain representations directly on causality-annotated datasets, learning the function (L,S)→I→T, where I represents valuable transferable cross-domain information.
The main challenge is that identifying unbiased ground-truth causal labels in real-world scenarios is extremely difficult. This paper addresses this by heuristically constructing causality-aware datasets and designing theoretically unbiased loss functions.
Causal Graph Reconstruction: First reformulates cross-domain recommendation tasks as causal graphs, providing principled guidance for method design
Causality Enhancement Framework: Proposes the CE-CDR framework that directly models cross-domain causal relationships from both data and model perspectives
Theoretical Innovation: Derives theoretically unbiased partial-label causal loss (PLCL) that can generalize to unseen causal patterns
Practical Validation: Serves as a model-agnostic plugin with universal applicability, already deployed in production environments with significant results
Consider a cross-domain recommendation scenario with source domain s and target domain t, where both domains share a user set U. Each domain has its own item set: source domain I_s and target domain I_t. User u has interaction sequences S^u_s and S^u_t in both domains. The goal is to leverage source domain knowledge to improve target domain recommendation accuracy.
Partial-Label Causal Loss (PLCL):
The core innovation lies in handling partial annotation problems. Since high similarity is not a necessary condition for causality, training directly on datasets constructed by CLM introduces bias.
Proposition 1: Given true propensity score e(x), the causal model f(x) can learn theoretically unbiased causal probability estimates through optimization with the following loss:
L = (1/n)∑[h(x)δ^1_f(x) + (1-h(x))δ^0_f(x)]
where the corrected label is:
h(x) = sg[z + (1-z)f(x)(1-e(x))/(1-f(x)e(x))]
Proposition 2: The propensity model e(x) is optimized through the following loss:
Generalization Capability Verification:
By excluding one similarity construction method from the dataset, DCMM successfully identifies unseen causal patterns, proving the model's generalization beyond annotation strategies.
Feature Contribution Analysis:
Feature masking experiments show CE-CDR's cross-domain feature activation variance is significantly larger than baseline methods, proving more effective cross-domain information utilization.
Dynamic Gating Interpretation:
Users with rich source domain information exhibit higher gating values, validating the effectiveness of the adaptive fusion mechanism.
The paper cites 76 relevant references covering multiple domains including cross-domain recommendation, causal inference, and attention mechanisms, providing a solid theoretical foundation for method design.
Overall Assessment: This is a paper with significant innovative value in the cross-domain recommendation field, systematically introducing causal inference to this domain and proposing theoretically guaranteed and practically effective solutions. The paper's theoretical contributions, experimental validation, and practical deployment all demonstrate high academic and practical value.