2025-11-15T07:25:11.598246

Causality Enhancement for Cross-Domain Recommendation

Wu, Wu, Jiang et al.
Cross-domain recommendation forms a crucial component in recommendation systems. It leverages auxiliary information through source domain tasks or features to enhance target domain recommendations. However, incorporating inconsistent source domain tasks may result in insufficient cross-domain modeling or negative transfer. While incorporating source domain features without considering the underlying causal relationships may limit their contribution to final predictions. Thus, a natural idea is to directly train a cross-domain representation on a causality-labeled dataset from the source to target domain. Yet this direction has been rarely explored, as identifying unbiased real causal labels is highly challenging in real-world scenarios. In this work, we attempt to take a first step in this direction by proposing a causality-enhanced framework, named CE-CDR. Specifically, we first reformulate the cross-domain recommendation as a causal graph for principled guidance. We then construct a causality-aware dataset heuristically. Subsequently, we derive a theoretically unbiased Partial Label Causal Loss to generalize beyond the biased causality-aware dataset to unseen cross-domain patterns, yielding an enriched cross-domain representation, which is then fed into the target model to enhance target-domain recommendations. Theoretical and empirical analyses, as well as extensive experiments, demonstrate the rationality and effectiveness of CE-CDR and its general applicability as a model-agnostic plugin. Moreover, it has been deployed in production since April 2025, showing its practical value in real-world applications.
academic

Causality Enhancement for Cross-Domain Recommendation

Basic Information

  • Paper ID: 2510.14641
  • Title: Causality Enhancement for Cross-Domain Recommendation
  • Authors: Zhibo Wu, Yunfan Wu, Lin Jiang, Ping Yang, Yao Hu (Xiaohongshu Co., Ltd)
  • Classification: cs.IR cs.AI
  • Conference: Conference acronym 'XX (appears to be conference template based on PDF content)
  • Paper Link: https://arxiv.org/abs/2510.14641

Abstract

Cross-domain recommendation is an important component of recommendation systems, which enhances target domain recommendations through auxiliary information from source domain tasks or features. However, introducing inconsistent source domain tasks may lead to insufficient cross-domain modeling or negative transfer, while introducing source domain features without considering potential causal relationships may limit their contribution to final predictions. This paper proposes a causality enhancement framework CE-CDR, which first reformulates cross-domain recommendation as a causal graph, then heuristically constructs a causality-aware dataset, and derives a theoretically unbiased partial-label causal loss function to generalize to unseen cross-domain patterns. The method has been deployed in production environments, demonstrating its practical application value.

Research Background and Motivation

Core Problems

Traditional cross-domain recommendation systems face two main challenges:

  1. Task Inconsistency: Multi-task learning methods promote cross-domain knowledge transfer through parameter or representation sharing, but may fail to capture valuable cross-domain information due to task inconsistency between domains, potentially causing negative transfer
  2. Causal Sparsity: Methods that treat source domain behavior as additional input features ignore the causal sparsity between source and target domains. User preferences in the target domain are not necessarily influenced by source domain behavior

Research Motivation

Existing methods either learn functions L→S and L→T (multi-task learning) or learn functions (L,S)→T (feature fusion), but none directly model cross-domain causal relationships. This paper proposes training cross-domain representations directly on causality-annotated datasets, learning the function (L,S)→I→T, where I represents valuable transferable cross-domain information.

Challenges and Innovation

The main challenge is that identifying unbiased ground-truth causal labels in real-world scenarios is extremely difficult. This paper addresses this by heuristically constructing causality-aware datasets and designing theoretically unbiased loss functions.

Core Contributions

  1. Causal Graph Reconstruction: First reformulates cross-domain recommendation tasks as causal graphs, providing principled guidance for method design
  2. Causality Enhancement Framework: Proposes the CE-CDR framework that directly models cross-domain causal relationships from both data and model perspectives
  3. Theoretical Innovation: Derives theoretically unbiased partial-label causal loss (PLCL) that can generalize to unseen causal patterns
  4. Practical Validation: Serves as a model-agnostic plugin with universal applicability, already deployed in production environments with significant results

Method Details

Task Definition

Consider a cross-domain recommendation scenario with source domain s and target domain t, where both domains share a user set U. Each domain has its own item set: source domain I_s and target domain I_t. User u has interaction sequences S^u_s and S^u_t in both domains. The goal is to leverage source domain knowledge to improve target domain recommendation accuracy.

Model Architecture

The CE-CDR framework contains three core modules:

1. Causality Labeling Module (CLM)

Core Assumption: Similarity-Causality Alignment Hypothesis

For a given user, their preference for source domain item i_s causally influences their preference for similar target domain item i_t

Implementation Method:

  • Uses both content similarity and behavioral similarity metrics
  • Content embeddings learned through cross-domain shared category classification tasks
  • Behavioral embeddings generated through graph encoders of global user-item interaction graphs
  • Similarity calculation formula:
sim^{c,b}(i_s, i_t) = (S^{c,b}_{i_s})^T S^{c,b}_{i_t}
  • Overall similarity computed through CDF calibration and maximum selection:
sim(i_s, i_t) = max(F_c(sim^c(i_s, i_t)), F_b(sim^b(i_s, i_t)))

2. Direct Causal Modeling Module (DCMM)

Backbone Model: Attention mechanism based on SASRec

f(u, S^u_s, i_t) = σ([f_{se}(f_{fe}(u), f_{em}(i^u,1_s), f_{em}(i^u,2_s), ...)]^T f_{em}(i_t))

Partial-Label Causal Loss (PLCL): The core innovation lies in handling partial annotation problems. Since high similarity is not a necessary condition for causality, training directly on datasets constructed by CLM introduces bias.

Proposition 1: Given true propensity score e(x), the causal model f(x) can learn theoretically unbiased causal probability estimates through optimization with the following loss:

L = (1/n)∑[h(x)δ^1_f(x) + (1-h(x))δ^0_f(x)]

where the corrected label is:

h(x) = sg[z + (1-z)f(x)(1-e(x))/(1-f(x)e(x))]

Proposition 2: The propensity model e(x) is optimized through the following loss:

L = (1/n)∑h(x)[zδ^1_e(x) + (1-z)δ^0_e(x)]

3. Causality Enhancement Module (CEM)

Cross-Domain Self-Attention: Treats source and target domain representations as a sequence of length 2

[r^u_s, r^u_t] = P^u = softmax((X^u W^q)(X^u W^k)^T/√d)(X^u W^v)

Cross-Domain Gated Attention: Adaptively fuses based on user features

g^u = σ(LeakyReLU(Q^u W^{g,1} + b^{g,1})W^{g,2} + b^{g,2})
ĥ^u = g^u ⊙ r^u_s + (1-g^u)r^u_t

Technical Innovations

  1. Causal Perspective: First systematically examines cross-domain recommendation from a causal inference perspective
  2. Partial-Label Learning: Innovatively introduces partial-label learning into causal modeling to address annotation bias
  3. Theoretical Guarantees: Provides theoretically unbiased loss functions ensuring model generalization
  4. Adaptive Fusion: Dynamic gating mechanism adjusts cross-domain information contribution based on user features

Experimental Setup

Datasets

  1. Douban: Contains Book and Music domains with 1,736 shared users
  2. Amazon: Movies and TV and CDs and Vinyl domains with 18,547 shared users
  3. Industry: Large-scale dataset from Xiaohongshu with Note and Video domains, 167,766,638 shared users

Evaluation Metrics

  • HR@K: Whether relevant items are included in top-K recommendations
  • NDCG@K: Cumulative gain considering ranking positions
  • K values: 15 for Douban and Amazon, 150 for Industry

Baseline Methods

  • Base Model: Basic two-tower model
  • Multi-task Learning Methods: CoNet, MAN, DiCUR
  • Feature Fusion Methods: MiNet, TrineCDR
  • Enhanced Versions: Various baseline methods + CE plugin

Implementation Details

  • Embedding dimensions: 32 for Douban and Amazon, 64 for Industry
  • Similarity threshold τ: 0.9 for Douban and Amazon, 0.85 for Industry
  • Optimizer: Adam with default settings

Experimental Results

Main Results

CE-CDR outperforms baseline methods on all datasets and evaluation metrics:

Douban Dataset Performance:

  • Book domain: HR@15 reaches 0.5023 (vs. best baseline 0.4586), NDCG@15 reaches 0.3711 (vs. 0.3436)
  • Music domain: HR@15 reaches 0.4983, NDCG@15 reaches 0.3600

Industry Dataset Performance:

  • Note domain: HR@150 is 0.2733, NDCG@150 is 0.0933
  • Video domain: HR@150 is 0.3645, NDCG@150 is 0.0992

Ablation Studies

Component contribution analysis:

  • CLM Module: Significant performance degradation when removed, validating the importance of high-quality positive causal samples
  • Causal Loss: Direct use of D+ as ground-truth causal set leads to performance decline
  • Self-Attention: Removing cross-domain self-attention affects information exchange effectiveness
  • Gating Mechanism: Simple concatenation replacing gated attention reduces personalized fusion effectiveness
  • Caching Strategy: Nearly no performance loss while eliminating latency increase

In-Depth Analysis

Generalization Capability Verification: By excluding one similarity construction method from the dataset, DCMM successfully identifies unseen causal patterns, proving the model's generalization beyond annotation strategies.

Feature Contribution Analysis: Feature masking experiments show CE-CDR's cross-domain feature activation variance is significantly larger than baseline methods, proving more effective cross-domain information utilization.

Dynamic Gating Interpretation: Users with rich source domain information exhibit higher gating values, validating the effectiveness of the adaptive fusion mechanism.

Online Experiments

Production Environment A/B Testing (Xiaohongshu Platform):

  • Video scenario: Clicks +0.33%, CTR +0.37%, Diversity +0.12%
  • Note scenario: Clicks +0.28%, CTR +0.43%, Diversity +0.16%
  • Deployed online in April 2025

Cross-Domain Recommendation

Existing methods fall into two categories:

  1. Multi-Task Learning: Promotes knowledge transfer through parameter or representation sharing, but suffers from task inconsistency issues
  2. Feature Fusion: Treats source domain behavior as input features, but ignores causal sparsity

Causal Recommendation

Primarily addresses three problem categories:

  1. Data Bias: Exposure bias, popularity bias, conformity bias
  2. Missing Data: Inability to capture comprehensive user preferences
  3. Interpretability: Improving recommendation interpretability, diversity, and fairness

This paper is the first to apply causal inference to direct causal relationship modeling in cross-domain recommendation.

Conclusions and Discussion

Main Conclusions

  1. Effectiveness: CE-CDR significantly outperforms existing methods on multiple datasets
  2. Generality: Serves as a model-agnostic plugin that can enhance various baseline methods
  3. Practicality: Successfully deployed in large-scale production environments
  4. Theoretical Soundness: Provides a theoretically guaranteed causal modeling framework

Limitations

  1. Assumption Dependency: The similarity-causality alignment hypothesis may not apply to all scenarios
  2. Annotation Quality: Heuristic causal annotation may introduce noise
  3. Computational Overhead: Requires additional causal model training and inference

Future Directions

  1. Assumption Refinement: Explore more precise causal relationship identification methods
  2. Multi-Domain Extension: Extend to causal modeling with multiple source domains
  3. Dynamic Causality: Consider temporal dynamics in causal relationship evolution

In-Depth Evaluation

Strengths

  1. Strong Innovation: First to systematically introduce causal inference into cross-domain recommendation
  2. Solid Theory: Provides rigorous mathematical derivations and theoretical guarantees
  3. Comprehensive Experiments: Multi-dataset validation, ablation studies, and online A/B testing
  4. High Practical Value: Successfully deployed in production environments with significant results
  5. Universal Applicability: Model-agnostic plugin design enhances applicability

Weaknesses

  1. Assumption Limitations: The similarity-causality alignment hypothesis may oversimplify real causal relationships
  2. Annotation Bias: Heuristic causal annotation methods may still introduce systematic bias
  3. Increased Complexity: Additional modules increase system complexity
  4. Domain Specificity: Some designs may be specialized for content platforms, with generalization to be verified

Impact

  1. Academic Contribution: Provides a new causal modeling paradigm for cross-domain recommendation research
  2. Practical Value: Successful production deployment demonstrates commercial value
  3. Research Inspiration: May inspire more applications of causal inference in recommendation systems

Applicable Scenarios

  1. Multi-Domain Platforms: Content sharing platforms, e-commerce platforms with multiple business domains
  2. Cold-Start Problems: New user or new item recommendation scenarios
  3. Cross-Platform Recommendation: Scenarios requiring cross-platform user behavior transfer

References

The paper cites 76 relevant references covering multiple domains including cross-domain recommendation, causal inference, and attention mechanisms, providing a solid theoretical foundation for method design.


Overall Assessment: This is a paper with significant innovative value in the cross-domain recommendation field, systematically introducing causal inference to this domain and proposing theoretically guaranteed and practically effective solutions. The paper's theoretical contributions, experimental validation, and practical deployment all demonstrate high academic and practical value.