Diffusion Generative Recommendation with Continuous Tokens
Qu, Lin, Ding et al.
Recent advances in generative artificial intelligence, particularly large language models (LLMs), have opened new opportunities for enhancing recommender systems (RecSys). Most existing LLM-based RecSys approaches operate in a discrete space, using vector-quantized tokenizers to align with the inherent discrete nature of language models. However, these quantization methods often result in lossy tokenization and suboptimal learning, primarily due to inaccurate gradient propagation caused by the non-differentiable argmin operation in standard vector quantization. Inspired by the emerging trend of embracing continuous tokens in language models, we propose ContRec, a novel framework that seamlessly integrates continuous tokens into LLM-based RecSys. Specifically, ContRec consists of two key modules: a sigma-VAE Tokenizer, which encodes users/items with continuous tokens; and a Dispersive Diffusion module, which captures implicit user preference. The tokenizer is trained with a continuous Variational Auto-Encoder (VAE) objective, where three effective techniques are adopted to avoid representation collapse. By conditioning on the previously generated tokens of the LLM backbone during user modeling, the Dispersive Diffusion module performs a conditional diffusion process with a novel Dispersive Loss, enabling high-quality user preference generation through next-token diffusion. Finally, ContRec leverages both the textual reasoning output from the LLM and the latent representations produced by the diffusion model for Top-K item retrieval, thereby delivering comprehensive recommendation results. Extensive experiments on four datasets demonstrate that \ourname{} consistently outperforms both traditional and SOTA LLM-based recommender systems. Our results highlight the potential of continuous tokenization and generative modeling for advancing the next generation of recommender systems.
academic
Diffusion Generative Recommendation with Continuous Tokens
This paper addresses the limitations of discrete tokenization methods in Large Language Model (LLM)-based recommendation systems by proposing the ContRec framework, which seamlessly integrates continuous tokens into LLM recommendation systems. ContRec comprises two core modules: a σ-VAE tokenizer (encoding users/items with continuous tokens) and a dispersed diffusion module (capturing implicit user preferences). By combining text reasoning outputs from LLMs with latent representations generated by diffusion models for Top-K item retrieval, experiments on four datasets demonstrate that ContRec significantly outperforms both traditional and state-of-the-art LLM-based recommendation systems.
Discrete Methods: Approaches like TIGER and UTGRec use VQ-VAE to construct discrete vocabularies, suffering from information compression loss
Continuous Projection Methods: Methods such as CoLLM and LlaRA only employ continuous tokens in the input portion, with outputs still relying on discrete generators, creating discrete-continuous discrepancies
Inspired by the trend of embracing continuous tokens in language models, this work explores the potential of using continuous tokens and diffusion models in recommendation scenarios to achieve higher-quality user preference modeling.
Proposes ContRec Framework: The first framework to seamlessly integrate continuous tokens into LLM recommendation systems, breaking through quantization limitations
Designs Two Key Modules:
σ-VAE Tokenizer: A robust continuous tokenizer employing three techniques to prevent representation collapse
Dispersed Diffusion Module: Generates implicit user preference representations through contrastive self-supervised learning
Introduces Dispersed Loss: A contrastive learning mechanism without explicit negative sample pairs
Experimental Validation: Achieves average improvements of 11.76% HR@10 and 10.11% NDCG@10 across four datasets
Given a user set U = {u₁, u₂, ..., uₙ} and item set V = {v₁, v₂, ..., vₘ}, the objective is to predict future user preferences by analyzing historical interactions, reformulating sequential recommendation as a language model paradigm:
This paper cites important works from recommendation systems, large language models, diffusion models, and related fields, including:
Classical Recommendation Algorithms: LightGCN, SASRec, etc.
LLM-Based Recommendation Systems: P5, TIGER, TokenRec, etc.
Diffusion Models: DDPM, Classifier-free Guidance, etc.
Continuous Tokenization: VAE-MAR, Next-Token Diffusion, etc.
Overall Assessment: This is an important work with significant innovation in the LLM recommendation systems domain. By introducing continuous tokenization and diffusion models, it effectively addresses limitations of existing methods. While there remains room for improvement in computational efficiency and applicability in certain scenarios, its technical innovations and experimental validation are sufficiently rigorous, providing valuable contributions to the field's development.