2025-11-23T00:10:15.831186

Multi-Granularity Sequence Denoising with Weakly Supervised Signal for Sequential Recommendation

Li, Yang, Zhu

Sequential recommendation aims to predict the next item based on user interests in historical interaction sequences. Historical interaction sequences often contain irrelevant noisy items, which significantly hinders the performance of recommendation systems. Existing research employs unsupervised methods that indirectly identify item-granularity irrelevant noise by predicting the ground truth item. Since these methods lack explicit noise labels, they are prone to misidentify users' interested items as noise. Additionally, while these methods focus on removing item-granularity noise driven by the ground truth item, they overlook interest-granularity noise, limiting their ability to perform broader denoising based on user interests. To address these issues, we propose Multi-Granularity Sequence Denoising with Weakly Supervised Signal for Sequential Recommendation(MGSD-WSS). MGSD-WSS first introduces the Multiple Gaussian Kernel Perceptron module to map the original and enhance sequence into a common representation space and utilizes weakly supervised signals to accurately identify noisy items in the historical interaction sequence. Subsequently, it employs the item-granularity denoising module with noise-weighted contrastive learning to obtain denoised item representations. Then, it extracts target interest representations from the ground truth item and applies noise-weighted contrastive learning to obtain denoised interest representations. Finally, based on the denoised item and interest representations, MGSD-WSS predicts the next item. Extensive experiments on five datasets demonstrate that the proposed method significantly outperforms state-of-the-art sequence recommendation and denoising models. Our code is available at https://github.com/lalunex/MGSD-WSS.

academic

Multi-Granularity Sequence Denoising with Weakly Supervised Signal for Sequential Recommendation

基本信息

论文ID: 2510.10564
标题: Multi-Granularity Sequence Denoising with Weakly Supervised Signal for Sequential Recommendation
作者: Liang Li (重庆理工大学), Zhou Yang (福州大学), Xiaofei Zhu (重庆理工大学)
分类: cs.IR (信息检索)
发表时间: 2025年10月12日 (arXiv预印本)
论文链接: https://arxiv.org/abs/2510.10564
代码链接: https://github.com/lalunex/MGSD-WSS

摘要

序列推荐旨在基于用户历史交互序列中的兴趣来预测下一个物品。历史交互序列通常包含不相关的噪声物品，这显著阻碍了推荐系统的性能。现有研究采用无监督方法，通过预测真实物品来间接识别物品粒度的无关噪声。由于这些方法缺乏明确的噪声标签，容易将用户感兴趣的物品误识别为噪声。此外，这些方法专注于移除由真实物品驱动的物品粒度噪声，但忽略了兴趣粒度噪声，限制了基于用户兴趣进行更广泛去噪的能力。为解决这些问题，本文提出了多粒度序列去噪与弱监督信号的序列推荐方法(MGSD-WSS)。

研究背景与动机

问题定义

序列推荐系统面临的核心问题是历史交互序列中存在噪声物品，如意外点击和恶意虚假交互，这些噪声显著降低了推荐系统的性能。

现有方法的局限性

软去噪方法：通过注意力机制或过滤算法调整噪声物品的权重，但无法完全消除噪声影响
硬去噪方法：生成噪声检测信号来显式移除噪声物品，但存在以下问题：
- 使用真实物品而非真实噪声标签来指导模型识别噪声，准确性有限
- 仅关注物品粒度去噪，忽略了兴趣粒度的噪声

研究动机

缺乏明确的噪声标签使得现有无监督方法容易误识别用户感兴趣的物品
用户交互不仅反映特定物品偏好，还体现更高层次的兴趣（如"体育"兴趣包含足球、运动鞋、跑步机等）
需要在多个粒度上进行层次化去噪以更全面地移除噪声

核心贡献

首次引入弱监督信号：通过标记的弱监督信号直接训练模型进行噪声识别，克服了以往无监督方法的不准确性
多粒度层次化去噪：提出物品粒度和兴趣粒度的层次化去噪模块，配合噪声加权对比学习
创新的架构设计：
- Multiple Gaussian Kernel Perceptron (MGP)模块
- Target-aware Sequence Encoding
- 噪声加权对比学习框架
显著的性能提升：在五个数据集上显著优于最先进的序列推荐和去噪模型

方法详解

任务定义

给定用户集合 $\mathcal{U} = \{u_1, u_2, \ldots, u_{|\mathcal{U}|}\}$ 和物品集合 $\mathcal{V} = \{v_1, v_2, \ldots, v_{|\mathcal{V}|}\}$ ，每个用户 $u \in \mathcal{U}$ 关联一个按时间顺序排列的历史交互序列 $S = [s_1, s_2, \ldots, s_n]$ 。目标是利用交互序列 $S$ 预测用户在第 $(n+1)$ 步最可能交互的物品，即 $p(s_{n+1}|s_{1:n})$ 。

模型架构

MGSD-WSS包含三个核心组件：

1. Target-aware Sequence Encoding

序列数据增强：

随机选择 $t$ 个不同物品作为噪声插入原始序列
构建增强序列 $\bar{S} = [\bar{s}_1, \bar{s}_2, \ldots, \bar{s}_{n+t}]$
获得监督信号 $\bar{Y} = [\bar{y}_1, \bar{y}_2, \ldots, \bar{y}_{n+t}]$ 标示噪声位置

Multiple Gaussian Kernel Perceptron (MGP)：

计算目标物品与序列中每个物品的余弦相似度： $\bar{\alpha}_i = \cos(\bar{h}_{n+1}, \bar{h}_i)$
使用 $k$ 个高斯核转换相关性得分： $r_{ij} = \exp\left(-\frac{(\bar{\alpha}_i - \mu_j)^2}{2\sigma_j^2}\right)$ $\hat{h}_i = \sum_{j=1}^k r_{ij} \bar{h}_i$
通过Transformer编码器获得丰富的表示： $G = \text{Transformer}(\hat{H} + P)$

2. Auxiliary Noise Discrimination

使用共享的物品级噪声判别器检测增强序列中的噪声物品： $\boldsymbol{\beta}_i = \text{Softmax}((\text{ReLU}(\bar{g}_i W_1 + b_1))W_2)$

通过MSE损失最小化噪声检测信号与监督信号的差异： $MSE = \frac{1}{n}\sum_{i=1}^n (\beta_i^0 - \bar{y}_i)^2$

3. Multi-granularity Sequence Denoising

物品粒度去噪：

使用Gumbel-softmax将噪声检测信号转换为二进制硬值
过滤噪声物品构建去噪表示矩阵
应用噪声加权对比学习： $ITSCL = -\frac{1}{|G^+|}\sum_{g_i \in G^+} \log \frac{\omega(g_i) \cdot \exp(\text{sim}(e_{se}, g_i)/\tau)}{\sum_{g_j \in G} \omega(g_j) \cdot \exp(\text{sim}(e_{se}, g_j)/\tau)}$

兴趣粒度去噪：

引入可学习的兴趣表示矩阵 $Q = [q_1, q_2, \ldots, q_m]$
计算物品与兴趣的相关性得分
使用目标感知兴趣注意力评估兴趣可靠性
应用兴趣粒度噪声加权对比学习

技术创新点

弱监督信号生成：通过数据增强策略生成明确的噪声标签，提供准确的监督信号
多粒度去噪：同时在物品和兴趣两个粒度上进行去噪，更全面地处理序列噪声
噪声加权对比学习：根据噪声程度为样本分配权重，优于传统的等权重对比学习
高斯核感知器：捕获不同相似性区域的信息，增强序列表示

实验设置

数据集

使用五个公开基准数据集：

数据集	序列数	用户数	物品数	平均长度	稀疏度
ML-100k	99,287	944	1,350	105.29	92.21%
Beauty	198,502	22,364	12,102	8.88	99.93%
Sports	296,337	35,599	18,358	8.32	99.95%
Yelp	316,354	30,432	20,034	10.40	99.95%
ML-1M	999,611	6,041	3,417	165.50	95.16%