2025-11-20T07:34:14.820650

CardRewriter: Leveraging Knowledge Cards for Long-Tail Query Rewriting on Short-Video Platforms

Gong, Zhu, Yin et al.

Short-video platforms have rapidly become a new generation of information retrieval systems, where users formulate queries to access desired videos. However, user queries, especially long-tail ones, often suffer from spelling errors, incomplete phrasing, and ambiguous intent, resulting in mismatches between user expectations and retrieved results. While large language models (LLMs) have shown success in long-tail query rewriting within e-commerce, they struggle on short-video platforms, where proprietary content such as short videos, live streams, micro dramas, and user social networks falls outside their training distribution. To address this challenge, we introduce \textbf{CardRewriter}, an LLM-based framework that incorporates domain-specific knowledge to enhance long-tail query rewriting. For each query, our method aggregates multi-source knowledge relevant to the query and summarizes it into an informative and query-relevant knowledge card. This card then guides the LLM to better capture user intent and produce more effective query rewrites. We optimize CardRewriter using a two-stage training pipeline: supervised fine-tuning followed by group relative policy optimization, with a tailored reward system balancing query relevance and retrieval effectiveness. Offline experiments show that CardRewriter substantially improves rewriting quality for queries targeting proprietary content. Online A/B testing further confirms significant gains in long-view rate (LVR) and click-through rate (CTR), along with a notable reduction in initiative query reformulation rate (IQRR). Since September 2025, CardRewriter has been deployed on Kuaishou, one of China's largest short-video platforms, serving hundreds of millions of users daily.

academic

CardRewriter: Leveraging Knowledge Cards for Long-Tail Query Rewriting on Short-Video Platforms

Basic Information

Paper ID: 2510.10095
Title: CardRewriter: Leveraging Knowledge Cards for Long-Tail Query Rewriting on Short-Video Platforms
Authors: Peiyuan Gong, Feiran Zhu, Yaqi Yin, Chenglei Dai, Chao Zhang, Kai Zheng, Wentian Bao, Jiaxin Mao, Yi Zhang
Categories: cs.IR (Information Retrieval), cs.CL (Computational Linguistics)
Publication Date: October 11, 2025
Paper Link: https://arxiv.org/abs/2510.10095

Abstract

Short-video platforms have rapidly become a new generation of information retrieval systems, where users obtain desired videos through queries. However, user queries, particularly long-tail queries, frequently suffer from spelling errors, incomplete expressions, and ambiguous intent, resulting in mismatches between user expectations and retrieval results. While large language models (LLMs) have demonstrated excellence in long-tail query rewriting for e-commerce domains, they face challenges on short-video platforms because platform-specific content (such as short videos, live streams, micro-dramas, and user social networks) lies outside their training distribution. To address this challenge, this paper proposes CardRewriter, an LLM-based framework that enhances long-tail query rewriting by incorporating domain-specific knowledge. The method aggregates multi-source relevant knowledge for each query and summarizes it into informative and query-relevant knowledge cards, which then guide the LLM to better capture user intent and produce more effective query rewrites.

Research Background and Motivation

Problem Definition

Short-video platform user queries face three major challenges:

Spelling Errors: User-input query terms may contain spelling mistakes
Incomplete Expression: User queries lack critical information, leading to inaccurate retrieval results
Ambiguous Intent: Query intent is unclear, particularly when involving platform-specific content

Problem Significance

Short-video platforms have become important information retrieval systems serving hundreds of millions of users
Query quality directly impacts user experience and platform business value
Long-tail queries constitute a significant proportion but are difficult to handle

Limitations of Existing Methods

Embedding-based Methods: Rely on semantic similarity retrieval of queries, with limited effectiveness for long-tail queries
Generative Methods: LLMs perform well in e-commerce domains but lack sufficient understanding of short-video platform-specific content
Domain Adaptation Issues: Existing LLMs are not trained on short videos, live streams, micro-dramas, and other platform-specific content

Research Motivation

Enhance LLM's understanding and rewriting capability for short-video platform queries by introducing platform-specific knowledge, particularly for long-tail queries involving proprietary content.

Core Contributions

Proposes CardRewriter Framework: The first LLM framework specifically designed for long-tail query rewriting on short-video platforms, effectively integrating platform-specific knowledge through knowledge cards
Designs Two-Stage Training Strategy: Combines supervised fine-tuning (SFT) and group relative policy optimization (GRPO), using a customized reward system to balance relevance and effectiveness
Validates Practical Effectiveness: Deployment verification on Kuaishou platform shows significant improvements in both offline and online experiments
Provides Complete Solution: End-to-end solution from knowledge collection, card generation to query rewriting

Methodology Details

Task Definition

Given an input query x, CardRewriter's objective is to generate a rewritten query y that can retrieve video content better aligned with user intent. The entire process can be expressed as:

y = G_θ(x, c), c = C_θ(x, M)

where c is the knowledge card, M is multi-source knowledge, C_θ is the card generation model, and G_θ is the query rewriting model.

Model Architecture

1. Knowledge Collection Stage

Multimodal Video Knowledge Extraction:

Visual Content: Extract three key frames from videos
Textual Content: Titles, subtitles, cover OCR text, author names, background music

Similar Query Expansion:

Rule-based Matching (Q2Q): Select queries with lexical overlap with the original query and intersecting retrieved video lists
Embedding-based Matching (EMB): Calculate embedding similarity between queries

Open-Domain Document Supplementation: Obtain relevant open-domain documents through API services as supplementary knowledge sources

2. Card-Based Rewriting Stage

Knowledge Card Generation:

Input: Original query x and multi-source knowledge M
Output: Concise and relevant knowledge card c
Objective: Denoise, integrate, and highlight content directly relevant to the query

Query Rewriting:

Input: Original query x and knowledge card c
Output: Rewritten query y
Objective: Leverage card information to better understand search context

Technical Innovations

Knowledge Card Design: Compared to direct injection of multi-source knowledge, knowledge cards effectively address issues of structural inconsistency, excessive noise, and limited relevance
Two-Stage Training Strategy:
- SFT Stage: Supervised fine-tuning using high-quality data
- GRPO Stage: Further optimization through reinforcement learning

Customized Reward System:

R_Overall = {
    R_Sys, if R_Sys > 0
    0.1, if R_Sys = 0 and R_Rel > 0  
    0, if R_Sys = R_Rel = 0
}

Balances semantic relevance and system preference

Experimental Setup

Datasets

Training Datasets:

Card Generation: 200K queries, generating 1.6M quadruples, filtered to retain 30K high-quality triples
Query Rewriting: 400K queries, generating 3.2M triples, filtered to retain 50K pairs for SFT
Reward Modeling: 150K queries, generating 240K preference pairs

Test Datasets:

Card Generation: 10K queries
Reward Modeling: 10K queries
Query Rewriting: 15K queries

Evaluation Metrics

Offline Metrics:

Rel (Relevance): Evaluates semantic quality of model outputs
Increment (Retrieval Expansion): Measures relative improvement in retrieval coverage
Hitrate@K (User Satisfaction): Proportion of rewritten queries retrieving videos matching user intent in top-K results

Online Metrics:

LVR (Long View Rate): Ratio of users watching videos for extended periods
IQRR (Initiated Query Reformulation Rate): Ratio of users actively reformulating queries
CTR (Click-Through Rate): Ratio of users clicking retrieval results

Baseline Methods

Prompt-based rewriting methods
SFT and SFT+DPO baselines
Direct knowledge injection method (Naive RAG)
Adapted CSA-QR method

Implementation Details

Card Generation: Fine-tuned based on Qwen2.5-VL-7B-Instruct
Query Rewriting: Fine-tuned based on Qwen3-8B
Training Settings: Learning rate 1×10^-5, AdamW optimizer, DeepSpeed ZeRO-3 distributed training

Experimental Results

Main Results

Method	Knowledge Type	QR-Rel	Increment	Hitrate@50	Hitrate@300
Original Query	-	-	-	31.40%	53.07%
SFT+GRPO	-	78.98%	65.19%	41.68%	65.71%
SFT+GRPO	Naive RAG	74.28%	70.86%	41.05%	65.63%
CardRewriter	Card RAG	85.73%	74.17%	46.64%	76.04%

Key Findings:

CardRewriter achieves best performance on most metrics
Knowledge cards significantly improve rewriting quality compared to direct knowledge injection
SFT+GRPO training strategy yields the best results

Ablation Studies

Multi-Source Knowledge Contribution Analysis:

Removing Visual Content: QC-Rel drops from 91.16% to 89.37%
Removing Textual Content: QC-Rel drops to 86.18% (larger impact)
Removing Related Videos: QC-Rel drops to 78.27% (largest impact)

Reward System Analysis:

Using Only Relevance Reward: Highest query relevance but limited retrieval expansion
Using Only System Reward: Retrieves more new videos but may deviate from original intent
Combined Reward: Achieves best balance between relevance and effectiveness

Case Analysis

The paper demonstrates a typical case:

Original Query: "可口可乐寄人篱下的痛" (User misspelled creator name "可可可乐" as "可口可乐")
LLM Rewrite: "可口可乐寄人篱下创意广告" (Incorrectly associated with beverage brand)
CardRewriter: "可可可乐被寄养的孩子系列" (Correctly understood user intent)

Online Experimental Results

Traffic Type	LVR ↑	IQRR ↓	CTR ↑
Coverage Traffic	+1.853%	-2.630%	+3.729%
Full Traffic	+0.235%	-0.229%	+0.342%

Online A/B testing confirms significant improvements, particularly on coverage traffic.

Query Rewriting Research

Embedding-based Methods: Treat query rewriting as a retrieval task, enhancing original queries through similar queries
Generative Methods: Directly generate revised queries; recent LLM methods show excellent performance in e-commerce domains
Domain Adaptation: Existing methods primarily target e-commerce scenarios; short-video platform research is relatively limited

Retrieval-Augmented Generation

RAG technology enhances generation quality by retrieving relevant information. This paper applies it to query rewriting tasks, effectively integrating multi-source information through knowledge cards.

Conclusions and Discussion

Main Conclusions

CardRewriter effectively addresses long-tail query rewriting on short-video platforms through knowledge cards
Two-stage training strategy and customized reward system significantly improve rewriting quality
Deployment verification on Kuaishou platform demonstrates practical value

Limitations

Computational Overhead: LLM inference requires substantial computational resources; near-line deployment strategy is adopted
Coverage Range: Currently covers only 15-20% of daily search traffic
Domain Dependency: Method is designed for short-video platforms; applicability to other domains requires further verification

Future Directions

Expand coverage of query ranges
Optimize online inference efficiency
Explore applicability across platforms and domains

In-Depth Evaluation

Strengths

Strong Problem Specificity: Accurately identifies unique challenges of query rewriting on short-video platforms
Complete Technical Solution: End-to-end solution from knowledge collection to model training
Comprehensive Experimental Evaluation: Thorough offline experiments with online deployment verification
High Engineering Practical Value: Deployed on large-scale platforms serving hundreds of millions of users

Weaknesses

Computational Efficiency: High LLM inference latency limits real-time applications
Data Dependency: Requires substantial annotated data for training set construction
Interpretability: Interpretability of knowledge card generation process needs enhancement
Generalization Ability: Method is platform-specific; cross-domain generalization capability requires further verification

Impact

Academic Contribution: Provides new perspectives for query understanding on short-video platforms
Practical Value: Solves actual business problems with direct commercial value
Technology Promotion: Knowledge card design approach can be generalized to other RAG applications

Applicable Scenarios

Search optimization for short-video, live-streaming, and other content platforms
Vertical domain search containing large amounts of proprietary content
Query understanding tasks requiring integration of multimodal information

References

The paper cites 33 relevant references covering query rewriting, retrieval-augmented generation, large language models, and other research directions, providing substantial theoretical foundation.

Summary: CardRewriter is an innovative research project addressing long-tail query rewriting on short-video platforms. By effectively integrating platform-specific knowledge through knowledge cards, it achieves significant results in both theoretical methodology and engineering practice. This work provides a valuable solution for handling query understanding tasks involving proprietary content.