2025-11-18T04:46:12.048153

A Survey on Multi-Behavior Sequential Recommendation

Chen, Li, Pan et al.

Recommender systems is set up to address the issue of information overload in traditional information retrieval systems, which is focused on recommending information that is of most interest to users from massive information. Generally, there is a sequential nature and heterogeneity to the behavior of a person interacting with a system, leading to the proposal of multi-behavior sequential recommendation (MBSR). MBSR is a relatively new and worthy direction for in-depth research, which can achieve state-of-the-art recommendation through suitable modeling, and some related works have been proposed. This survey aims to shed light on the MBSR problem. Firstly, we introduce MBSR in detail, including its problem definition, application scenarios and challenges faced. Secondly, we detail the classification of MBSR, including neighborhood-based methods, matrix factorization-based methods and deep learning-based methods, where we further classify the deep learning-based methods into different learning architectures based on RNN, GNN, Transformer, and generic architectures as well as architectures that integrate hybrid techniques. In each method, we present related works based on the data perspective and the modeling perspective, as well as analyze the strengths, weaknesses and features of these works. Finally, we discuss some promising future research directions to address the challenges and improve the current status of MBSR.

academic

A Survey on Multi-Behavior Sequential Recommendation

Basic Information

Paper ID: 2308.15701
Title: A Survey on Multi-Behavior Sequential Recommendation
Authors: Xiaoqing Chen, Zhitao Li, Weike Pan, Zhong Ming
Category: cs.IR (Information Retrieval)
Publication Date: August 30, 2023 (arXiv preprint)
Paper Link: https://arxiv.org/abs/2308.15701
Journal: IEEE Transactions on Knowledge and Data Engineering

Abstract

Recommendation systems aim to address the information overload problem in traditional information retrieval systems by focusing on recommending the most relevant content from massive information repositories to users. User interactions with systems typically exhibit sequential and heterogeneous characteristics, which has motivated the emergence of Multi-Behavior Sequential Recommendation (MBSR). MBSR represents a relatively novel and worthy research direction that can achieve state-of-the-art recommendation performance through appropriate modeling. This survey aims to clarify the MBSR problem, provide detailed coverage of its problem definition, application scenarios, and challenges, and categorize MBSR methods into neighborhood-based approaches, matrix factorization methods, and deep learning methods.

Research Background and Motivation

Problem Background

Information Overload Problem: With the explosive growth of internet information, users face challenges in rapidly obtaining desired content from massive information repositories
Limitations of Traditional Recommendation Systems:
- Single-Behavior Recommendation (SBR) only considers a single type of user feedback, easily encountering data sparsity and cold-start problems
- Static recommendation methods ignore the temporal nature and dynamic changes of user behavior

Research Motivation

Behavior Heterogeneity: In real-world scenarios, users interact with items through multiple types of behaviors (browsing, adding to cart, purchasing, etc.)
Sequential Dependency: User behavior exhibits inherent temporal characteristics, reflecting the dynamic changes between short-term and long-term preferences
Practical Application Needs: E-commerce, video recommendation, news recommendation and other domains urgently require recommendation algorithms that better align with real user behavior

Core Contributions

Systematic Survey: Provides the first comprehensive systematic investigation of the MBSR field, filling the gap of lacking relevant surveys in this emerging area
Multi-dimensional Classification Framework: Proposes a MBSR method classification framework based on three dimensions: technology, data, and modeling
In-depth Technical Analysis: Provides detailed analysis of the advantages and disadvantages of various methods, offering researchers a clear technical roadmap
Future Direction Guidance: Proposes five valuable future research directions to guide the development of this field

Method Details

Task Definition

The MBSR problem primarily focuses on next-item recommendation in heterogeneous feedback sequences. Given:

User set U
Item set I
Behavior set F

The input consists of (user, heterogeneous behavior sequence) pairs: $(u, S_u)$ , where: $S_u = \{(i_u^1, f_u^1), ..., (i_u^t, f_u^t), ..., (i_u^{|S_u|}, f_u^{|S_u|})\}$

The objective is to predict user u's preference value $\hat{r}_{u,t+1,j}$ for candidate item j at time step (t+1), and generate a Top-K recommendation list.

Classification Framework

1. Neighborhood-based Methods

Core Idea: Utilize similarity between users or items for recommendation
Representative Work: BIS (Bidirectional Item Similarity)
Advantages: Easy to maintain, strong interpretability
Disadvantages: Lack of transitivity, difficulty capturing complex user preferences

2. Matrix Factorization Methods

Core Idea: Decompose user-item interaction matrices into inner products of low-rank matrices
Representative Work: TransRec++
Innovation: Introduces behavior transition vectors to capture sequential relationships and behavior dynamics
Limitations: Model complexity increases dramatically with increasing behavior types

3. Deep Learning Methods (Primary Focus)

RNN Architecture

RLBL: Combines RNN and log-bilinear models using behavior-specific transition matrices
RIB: Utilizes GRU and attention mechanisms to model micro-behaviors
BINN: Designs CLSTM and Bi-CLSTM to process multi-behavior information
Advantages: Well-suited for sequence modeling, capable of storing short-term memory
Disadvantages: Gradient vanishing/exploding problems, low computational efficiency

GNN Architecture

MGNN-SPred: Constructs global graphs to capture behavior transition relationships
DMBGN: Designed for coupon redemption rate prediction tasks
GPG4HSR: Simultaneously considers global and personalized graphs
BGNN: Distinguishes between homogeneous and heterogeneous behavior transition patterns
Advantages: Capable of modeling complex relationships, strong data sparsity handling
Disadvantages: Graph construction complexity increases with more behavior types

Transformer Architecture

DMT: Employs MMoE for multi-task learning
DFN: Handles implicit feedback noise through attention networks
NextIP: Models the problem as dual-task learning for item prediction and purchase prediction
MB-STR: Designs novel position encoding functions
FLAG: Simultaneously models local preferences, global preferences, and local intents
Advantages: Strong parallel computing capability, excellent long-term dependency modeling
Disadvantages: Relatively weaker local information capture

Hybrid Architecture

MKM-SR: Combines GNN and GRU to separately model item sequences and behavior sequences
KHGT: Fuses Transformer and GNN, considering item relationship information
Advantages: Synthesizes advantages of multiple techniques, significant performance improvements
Disadvantages: High model complexity, difficult hyperparameter tuning

Experimental Setup

Application Scenarios

E-commerce Recommendation: Predicts user purchase behavior, considering auxiliary behaviors such as browsing, adding to cart, and favoriting
Video Recommendation: Uses sharing as target behavior and browsing as auxiliary behavior
News Recommendation: Combines explicit feedback (dislike) and implicit feedback (browsing)

Evaluation Metrics

Accuracy Metrics: Hit Rate, NDCG, Recall, Precision
Diversity Metrics: Coverage, Diversity
Efficiency Metrics: Training time, inference time, memory consumption

Data Perspective Classification

(Item, Behavior) Pair Sequences: Most common input format
Behavior-specific Subsequences: Item sequences segmented by behavior type
Behavior-agnostic Item Sequences: Item sequences ignoring behavior types
Behavior Sequences: Pure behavior type sequences

Experimental Results

Main Findings

Deep Learning Methods Dominate: Due to the complexity of MBSR problems, most works adopt deep learning methods
Transformer Performs Excellently: Demonstrates superior performance compared to RNN and GNN across multiple tasks
Hybrid Methods Show Great Potential: Hybrid architectures combining multiple techniques achieve better recommendation performance
Importance of Attention Mechanisms: Nearly all excellent methods employ some form of attention mechanism

Technical Trends

From Single to Hybrid: Technical development evolves from single architectures toward hybrid architectures
From Local to Global: Modeling perspectives expand from local sequences to global graph structures
From Simple to Complex: Behavior modeling develops from simple embeddings to complex transition relationships

Associated Research Areas

Sequential Recommendation: Foundation of MBSR, but considers only single behavior
Multi-behavior Recommendation: Considers heterogeneous behaviors but ignores sequentiality
Session-based Recommendation: Short-term interaction sequence recommendation
Federated Recommendation: Recommendation under privacy protection

Technical Evolution Path

Traditional Collaborative Filtering → Matrix Factorization → Deep Learning → Graph Neural Networks → Transformer → Hybrid Architecture

Conclusions and Discussion

Main Challenges

Heterogeneous Behavior Sequence Modeling: How to effectively model sequential relationships of different behavior types
Behavior Relationship Modeling: Capturing dependencies and transition relationships between behaviors
Long-term and Short-term Preference Fusion: Simultaneously modeling users' stable long-term preferences and dynamic short-term needs
Noise and Bias Handling: Addressing noise and selection bias in implicit feedback

Future Directions

Data Level: Address data sparsity, imbalance, periodicity, and noise issues
Technical Level: Improve computational efficiency, enhance model generalization, combine complementary techniques
Optimization Objectives: Transition from single-objective to multi-objective joint optimization
Large Language Models: Explore the application potential of LLMs in MBSR
Trustworthiness: Improve model interpretability and protect user privacy and security

In-depth Evaluation

Strengths

Comprehensiveness: Provides the first systematic survey of the MBSR field with broad coverage
Clear Structure: Multi-dimensional classification framework is well-organized and easy to understand and apply
Deep Analysis: Thorough analysis of advantages and disadvantages of each method category
Forward-looking: Proposed future directions provide strong guidance value
Practicality: Provides researchers and engineers with practical technical roadmaps

Limitations

Missing Experimental Comparisons: As a survey paper, lacks unified benchmark comparisons across methods
Insufficient Dataset Analysis: Limited analysis of characteristics and limitations of existing datasets
Limited Industrial Application Discussion: Insufficient discussion of practical deployment experiences in industry
Weak Theoretical Analysis: Lacks theoretical analysis of MBSR problem complexity and learnability

Impact

Academic Contribution: Establishes a systematic knowledge framework for the emerging MBSR field
Practical Value: Provides important reference for multi-behavior recommendation system design in industry
Research Guidance: Clarifies the technical development trajectory and future directions of the field
Standard Setting: Facilitates the establishment of evaluation standards and benchmarks for the MBSR field

Applicable Scenarios

Research Entry Point: Provides comprehensive field knowledge overview for beginners
Technology Selection: Guides engineers in selecting appropriate MBSR methods
Academic Research: Provides reference for researchers in determining research directions and innovation points
Industrial Application: Provides technical roadmaps for enterprises building multi-behavior recommendation systems

References

This paper references 107 related works covering important contributions from multiple domains including recommendation systems, deep learning, and graph neural networks, providing solid theoretical foundation and technical support for MBSR research.

Summary: This is a high-quality survey paper that systematically reviews the current state of research in the emerging field of multi-behavior sequential recommendation, laying important foundations for the field's development. The paper features clear structure and in-depth analysis, with significant value for advancing both theoretical development and practical applications of MBSR.