2025-11-15T07:25:11.598246

Causality Enhancement for Cross-Domain Recommendation

Wu, Wu, Jiang et al.

Cross-domain recommendation forms a crucial component in recommendation systems. It leverages auxiliary information through source domain tasks or features to enhance target domain recommendations. However, incorporating inconsistent source domain tasks may result in insufficient cross-domain modeling or negative transfer. While incorporating source domain features without considering the underlying causal relationships may limit their contribution to final predictions. Thus, a natural idea is to directly train a cross-domain representation on a causality-labeled dataset from the source to target domain. Yet this direction has been rarely explored, as identifying unbiased real causal labels is highly challenging in real-world scenarios. In this work, we attempt to take a first step in this direction by proposing a causality-enhanced framework, named CE-CDR. Specifically, we first reformulate the cross-domain recommendation as a causal graph for principled guidance. We then construct a causality-aware dataset heuristically. Subsequently, we derive a theoretically unbiased Partial Label Causal Loss to generalize beyond the biased causality-aware dataset to unseen cross-domain patterns, yielding an enriched cross-domain representation, which is then fed into the target model to enhance target-domain recommendations. Theoretical and empirical analyses, as well as extensive experiments, demonstrate the rationality and effectiveness of CE-CDR and its general applicability as a model-agnostic plugin. Moreover, it has been deployed in production since April 2025, showing its practical value in real-world applications.

academic

Causality Enhancement for Cross-Domain Recommendation

Basic Information

Paper ID: 2510.14641
Title: Causality Enhancement for Cross-Domain Recommendation
Authors: Zhibo Wu, Yunfan Wu, Lin Jiang, Ping Yang, Yao Hu (Xiaohongshu Co., Ltd)
Classification: cs.IR cs.AI
Conference: Conference acronym 'XX (appears to be conference template based on PDF content)
Paper Link: https://arxiv.org/abs/2510.14641

Abstract

Cross-domain recommendation is an important component of recommendation systems, which enhances target domain recommendations through auxiliary information from source domain tasks or features. However, introducing inconsistent source domain tasks may lead to insufficient cross-domain modeling or negative transfer, while introducing source domain features without considering potential causal relationships may limit their contribution to final predictions. This paper proposes a causality enhancement framework CE-CDR, which first reformulates cross-domain recommendation as a causal graph, then heuristically constructs a causality-aware dataset, and derives a theoretically unbiased partial-label causal loss function to generalize to unseen cross-domain patterns. The method has been deployed in production environments, demonstrating its practical application value.

Research Background and Motivation

Core Problems

Traditional cross-domain recommendation systems face two main challenges:

Task Inconsistency: Multi-task learning methods promote cross-domain knowledge transfer through parameter or representation sharing, but may fail to capture valuable cross-domain information due to task inconsistency between domains, potentially causing negative transfer
Causal Sparsity: Methods that treat source domain behavior as additional input features ignore the causal sparsity between source and target domains. User preferences in the target domain are not necessarily influenced by source domain behavior

Research Motivation

Existing methods either learn functions L→S and L→T (multi-task learning) or learn functions (L,S)→T (feature fusion), but none directly model cross-domain causal relationships. This paper proposes training cross-domain representations directly on causality-annotated datasets, learning the function (L,S)→I→T, where I represents valuable transferable cross-domain information.

Challenges and Innovation

The main challenge is that identifying unbiased ground-truth causal labels in real-world scenarios is extremely difficult. This paper addresses this by heuristically constructing causality-aware datasets and designing theoretically unbiased loss functions.

Core Contributions

Causal Graph Reconstruction: First reformulates cross-domain recommendation tasks as causal graphs, providing principled guidance for method design
Causality Enhancement Framework: Proposes the CE-CDR framework that directly models cross-domain causal relationships from both data and model perspectives
Theoretical Innovation: Derives theoretically unbiased partial-label causal loss (PLCL) that can generalize to unseen causal patterns
Practical Validation: Serves as a model-agnostic plugin with universal applicability, already deployed in production environments with significant results

Method Details

Task Definition

Consider a cross-domain recommendation scenario with source domain s and target domain t, where both domains share a user set U. Each domain has its own item set: source domain I_s and target domain I_t. User u has interaction sequences S^u_s and S^u_t in both domains. The goal is to leverage source domain knowledge to improve target domain recommendation accuracy.

Model Architecture

The CE-CDR framework contains three core modules:

1. Causality Labeling Module (CLM)

Core Assumption: Similarity-Causality Alignment Hypothesis

For a given user, their preference for source domain item i_s causally influences their preference for similar target domain item i_t

Implementation Method:

Uses both content similarity and behavioral similarity metrics
Content embeddings learned through cross-domain shared category classification tasks
Behavioral embeddings generated through graph encoders of global user-item interaction graphs
Similarity calculation formula:

sim^{c,b}(i_s, i_t) = (S^{c,b}_{i_s})^T S^{c,b}_{i_t}

Overall similarity computed through CDF calibration and maximum selection:

sim(i_s, i_t) = max(F_c(sim^c(i_s, i_t)), F_b(sim^b(i_s, i_t)))

2. Direct Causal Modeling Module (DCMM)

Backbone Model: Attention mechanism based on SASRec

f(u, S^u_s, i_t) = σ([f_{se}(f_{fe}(u), f_{em}(i^u,1_s), f_{em}(i^u,2_s), ...)]^T f_{em}(i_t))

Partial-Label Causal Loss (PLCL): The core innovation lies in handling partial annotation problems. Since high similarity is not a necessary condition for causality, training directly on datasets constructed by CLM introduces bias.

Proposition 1: Given true propensity score e(x), the causal model f(x) can learn theoretically unbiased causal probability estimates through optimization with the following loss:

L = (1/n)∑[h(x)δ^1_f(x) + (1-h(x))δ^0_f(x)]

where the corrected label is:

h(x) = sg[z + (1-z)f(x)(1-e(x))/(1-f(x)e(x))]

Proposition 2: The propensity model e(x) is optimized through the following loss:

L = (1/n)∑h(x)[zδ^1_e(x) + (1-z)δ^0_e(x)]

3. Causality Enhancement Module (CEM)

Cross-Domain Self-Attention: Treats source and target domain representations as a sequence of length 2

[r^u_s, r^u_t] = P^u = softmax((X^u W^q)(X^u W^k)^T/√d)(X^u W^v)

Cross-Domain Gated Attention: Adaptively fuses based on user features

g^u = σ(LeakyReLU(Q^u W^{g,1} + b^{g,1})W^{g,2} + b^{g,2})
ĥ^u = g^u ⊙ r^u_s + (1-g^u)r^u_t

Technical Innovations

Causal Perspective: First systematically examines cross-domain recommendation from a causal inference perspective
Partial-Label Learning: Innovatively introduces partial-label learning into causal modeling to address annotation bias
Theoretical Guarantees: Provides theoretically unbiased loss functions ensuring model generalization
Adaptive Fusion: Dynamic gating mechanism adjusts cross-domain information contribution based on user features

Experimental Setup

Datasets

Douban: Contains Book and Music domains with 1,736 shared users
Amazon: Movies and TV and CDs and Vinyl domains with 18,547 shared users
Industry: Large-scale dataset from Xiaohongshu with Note and Video domains, 167,766,638 shared users

Evaluation Metrics

HR@K: Whether relevant items are included in top-K recommendations
NDCG@K: Cumulative gain considering ranking positions
K values: 15 for Douban and Amazon, 150 for Industry

Baseline Methods

Base Model: Basic two-tower model
Multi-task Learning Methods: CoNet, MAN, DiCUR
Feature Fusion Methods: MiNet, TrineCDR
Enhanced Versions: Various baseline methods + CE plugin

Implementation Details

Embedding dimensions: 32 for Douban and Amazon, 64 for Industry
Similarity threshold τ: 0.9 for Douban and Amazon, 0.85 for Industry
Optimizer: Adam with default settings

Experimental Results

Main Results

CE-CDR outperforms baseline methods on all datasets and evaluation metrics:

Douban Dataset Performance:

Book domain: HR@15 reaches 0.5023 (vs. best baseline 0.4586), NDCG@15 reaches 0.3711 (vs. 0.3436)
Music domain: HR@15 reaches 0.4983, NDCG@15 reaches 0.3600

Industry Dataset Performance:

Note domain: HR@150 is 0.2733, NDCG@150 is 0.0933
Video domain: HR@150 is 0.3645, NDCG@150 is 0.0992

Ablation Studies

Component contribution analysis:

CLM Module: Significant performance degradation when removed, validating the importance of high-quality positive causal samples
Causal Loss: Direct use of D+ as ground-truth causal set leads to performance decline
Self-Attention: Removing cross-domain self-attention affects information exchange effectiveness
Gating Mechanism: Simple concatenation replacing gated attention reduces personalized fusion effectiveness
Caching Strategy: Nearly no performance loss while eliminating latency increase

In-Depth Analysis

Generalization Capability Verification: By excluding one similarity construction method from the dataset, DCMM successfully identifies unseen causal patterns, proving the model's generalization beyond annotation strategies.

Feature Contribution Analysis: Feature masking experiments show CE-CDR's cross-domain feature activation variance is significantly larger than baseline methods, proving more effective cross-domain information utilization.

Dynamic Gating Interpretation: Users with rich source domain information exhibit higher gating values, validating the effectiveness of the adaptive fusion mechanism.

Online Experiments

Production Environment A/B Testing (Xiaohongshu Platform):

Video scenario: Clicks +0.33%, CTR +0.37%, Diversity +0.12%
Note scenario: Clicks +0.28%, CTR +0.43%, Diversity +0.16%
Deployed online in April 2025

Cross-Domain Recommendation

Existing methods fall into two categories:

Multi-Task Learning: Promotes knowledge transfer through parameter or representation sharing, but suffers from task inconsistency issues
Feature Fusion: Treats source domain behavior as input features, but ignores causal sparsity

Causal Recommendation

Primarily addresses three problem categories:

Data Bias: Exposure bias, popularity bias, conformity bias
Missing Data: Inability to capture comprehensive user preferences
Interpretability: Improving recommendation interpretability, diversity, and fairness

This paper is the first to apply causal inference to direct causal relationship modeling in cross-domain recommendation.

Conclusions and Discussion

Main Conclusions

Effectiveness: CE-CDR significantly outperforms existing methods on multiple datasets
Generality: Serves as a model-agnostic plugin that can enhance various baseline methods
Practicality: Successfully deployed in large-scale production environments
Theoretical Soundness: Provides a theoretically guaranteed causal modeling framework

Limitations

Assumption Dependency: The similarity-causality alignment hypothesis may not apply to all scenarios
Annotation Quality: Heuristic causal annotation may introduce noise
Computational Overhead: Requires additional causal model training and inference

Future Directions

Assumption Refinement: Explore more precise causal relationship identification methods
Multi-Domain Extension: Extend to causal modeling with multiple source domains
Dynamic Causality: Consider temporal dynamics in causal relationship evolution

In-Depth Evaluation

Strengths

Strong Innovation: First to systematically introduce causal inference into cross-domain recommendation
Solid Theory: Provides rigorous mathematical derivations and theoretical guarantees
Comprehensive Experiments: Multi-dataset validation, ablation studies, and online A/B testing
High Practical Value: Successfully deployed in production environments with significant results
Universal Applicability: Model-agnostic plugin design enhances applicability

Weaknesses

Assumption Limitations: The similarity-causality alignment hypothesis may oversimplify real causal relationships
Annotation Bias: Heuristic causal annotation methods may still introduce systematic bias
Increased Complexity: Additional modules increase system complexity
Domain Specificity: Some designs may be specialized for content platforms, with generalization to be verified

Impact

Academic Contribution: Provides a new causal modeling paradigm for cross-domain recommendation research
Practical Value: Successful production deployment demonstrates commercial value
Research Inspiration: May inspire more applications of causal inference in recommendation systems

Applicable Scenarios

Multi-Domain Platforms: Content sharing platforms, e-commerce platforms with multiple business domains
Cold-Start Problems: New user or new item recommendation scenarios
Cross-Platform Recommendation: Scenarios requiring cross-platform user behavior transfer

References

The paper cites 76 relevant references covering multiple domains including cross-domain recommendation, causal inference, and attention mechanisms, providing a solid theoretical foundation for method design.

Overall Assessment: This is a paper with significant innovative value in the cross-domain recommendation field, systematically introducing causal inference to this domain and proposing theoretically guaranteed and practically effective solutions. The paper's theoretical contributions, experimental validation, and practical deployment all demonstrate high academic and practical value.