2025-11-20T07:34:14.820650

CardRewriter: Leveraging Knowledge Cards for Long-Tail Query Rewriting on Short-Video Platforms

Gong, Zhu, Yin et al.
Short-video platforms have rapidly become a new generation of information retrieval systems, where users formulate queries to access desired videos. However, user queries, especially long-tail ones, often suffer from spelling errors, incomplete phrasing, and ambiguous intent, resulting in mismatches between user expectations and retrieved results. While large language models (LLMs) have shown success in long-tail query rewriting within e-commerce, they struggle on short-video platforms, where proprietary content such as short videos, live streams, micro dramas, and user social networks falls outside their training distribution. To address this challenge, we introduce \textbf{CardRewriter}, an LLM-based framework that incorporates domain-specific knowledge to enhance long-tail query rewriting. For each query, our method aggregates multi-source knowledge relevant to the query and summarizes it into an informative and query-relevant knowledge card. This card then guides the LLM to better capture user intent and produce more effective query rewrites. We optimize CardRewriter using a two-stage training pipeline: supervised fine-tuning followed by group relative policy optimization, with a tailored reward system balancing query relevance and retrieval effectiveness. Offline experiments show that CardRewriter substantially improves rewriting quality for queries targeting proprietary content. Online A/B testing further confirms significant gains in long-view rate (LVR) and click-through rate (CTR), along with a notable reduction in initiative query reformulation rate (IQRR). Since September 2025, CardRewriter has been deployed on Kuaishou, one of China's largest short-video platforms, serving hundreds of millions of users daily.
academic

CardRewriter: Leveraging Knowledge Cards for Long-Tail Query Rewriting on Short-Video Platforms

Basic Information

  • Paper ID: 2510.10095
  • Title: CardRewriter: Leveraging Knowledge Cards for Long-Tail Query Rewriting on Short-Video Platforms
  • Authors: Peiyuan Gong, Feiran Zhu, Yaqi Yin, Chenglei Dai, Chao Zhang, Kai Zheng, Wentian Bao, Jiaxin Mao, Yi Zhang
  • Categories: cs.IR (Information Retrieval), cs.CL (Computational Linguistics)
  • Publication Date: October 11, 2025
  • Paper Link: https://arxiv.org/abs/2510.10095

Abstract

Short-video platforms have rapidly become a new generation of information retrieval systems, where users obtain desired videos through queries. However, user queries, particularly long-tail queries, frequently suffer from spelling errors, incomplete expressions, and ambiguous intent, resulting in mismatches between user expectations and retrieval results. While large language models (LLMs) have demonstrated excellence in long-tail query rewriting for e-commerce domains, they face challenges on short-video platforms because platform-specific content (such as short videos, live streams, micro-dramas, and user social networks) lies outside their training distribution. To address this challenge, this paper proposes CardRewriter, an LLM-based framework that enhances long-tail query rewriting by incorporating domain-specific knowledge. The method aggregates multi-source relevant knowledge for each query and summarizes it into informative and query-relevant knowledge cards, which then guide the LLM to better capture user intent and produce more effective query rewrites.

Research Background and Motivation

Problem Definition

Short-video platform user queries face three major challenges:

  1. Spelling Errors: User-input query terms may contain spelling mistakes
  2. Incomplete Expression: User queries lack critical information, leading to inaccurate retrieval results
  3. Ambiguous Intent: Query intent is unclear, particularly when involving platform-specific content

Problem Significance

  • Short-video platforms have become important information retrieval systems serving hundreds of millions of users
  • Query quality directly impacts user experience and platform business value
  • Long-tail queries constitute a significant proportion but are difficult to handle

Limitations of Existing Methods

  1. Embedding-based Methods: Rely on semantic similarity retrieval of queries, with limited effectiveness for long-tail queries
  2. Generative Methods: LLMs perform well in e-commerce domains but lack sufficient understanding of short-video platform-specific content
  3. Domain Adaptation Issues: Existing LLMs are not trained on short videos, live streams, micro-dramas, and other platform-specific content

Research Motivation

Enhance LLM's understanding and rewriting capability for short-video platform queries by introducing platform-specific knowledge, particularly for long-tail queries involving proprietary content.

Core Contributions

  1. Proposes CardRewriter Framework: The first LLM framework specifically designed for long-tail query rewriting on short-video platforms, effectively integrating platform-specific knowledge through knowledge cards
  2. Designs Two-Stage Training Strategy: Combines supervised fine-tuning (SFT) and group relative policy optimization (GRPO), using a customized reward system to balance relevance and effectiveness
  3. Validates Practical Effectiveness: Deployment verification on Kuaishou platform shows significant improvements in both offline and online experiments
  4. Provides Complete Solution: End-to-end solution from knowledge collection, card generation to query rewriting

Methodology Details

Task Definition

Given an input query x, CardRewriter's objective is to generate a rewritten query y that can retrieve video content better aligned with user intent. The entire process can be expressed as:

y = G_θ(x, c), c = C_θ(x, M)

where c is the knowledge card, M is multi-source knowledge, C_θ is the card generation model, and G_θ is the query rewriting model.

Model Architecture

1. Knowledge Collection Stage

Multimodal Video Knowledge Extraction:

  • Visual Content: Extract three key frames from videos
  • Textual Content: Titles, subtitles, cover OCR text, author names, background music

Similar Query Expansion:

  • Rule-based Matching (Q2Q): Select queries with lexical overlap with the original query and intersecting retrieved video lists
  • Embedding-based Matching (EMB): Calculate embedding similarity between queries

Open-Domain Document Supplementation: Obtain relevant open-domain documents through API services as supplementary knowledge sources

2. Card-Based Rewriting Stage

Knowledge Card Generation:

  • Input: Original query x and multi-source knowledge M
  • Output: Concise and relevant knowledge card c
  • Objective: Denoise, integrate, and highlight content directly relevant to the query

Query Rewriting:

  • Input: Original query x and knowledge card c
  • Output: Rewritten query y
  • Objective: Leverage card information to better understand search context

Technical Innovations

  1. Knowledge Card Design: Compared to direct injection of multi-source knowledge, knowledge cards effectively address issues of structural inconsistency, excessive noise, and limited relevance
  2. Two-Stage Training Strategy:
    • SFT Stage: Supervised fine-tuning using high-quality data
    • GRPO Stage: Further optimization through reinforcement learning
  3. Customized Reward System:
    R_Overall = {
        R_Sys, if R_Sys > 0
        0.1, if R_Sys = 0 and R_Rel > 0  
        0, if R_Sys = R_Rel = 0
    }
    

    Balances semantic relevance and system preference

Experimental Setup

Datasets

Training Datasets:

  • Card Generation: 200K queries, generating 1.6M quadruples, filtered to retain 30K high-quality triples
  • Query Rewriting: 400K queries, generating 3.2M triples, filtered to retain 50K pairs for SFT
  • Reward Modeling: 150K queries, generating 240K preference pairs

Test Datasets:

  • Card Generation: 10K queries
  • Reward Modeling: 10K queries
  • Query Rewriting: 15K queries

Evaluation Metrics

Offline Metrics:

  1. Rel (Relevance): Evaluates semantic quality of model outputs
  2. Increment (Retrieval Expansion): Measures relative improvement in retrieval coverage
  3. Hitrate@K (User Satisfaction): Proportion of rewritten queries retrieving videos matching user intent in top-K results

Online Metrics:

  1. LVR (Long View Rate): Ratio of users watching videos for extended periods
  2. IQRR (Initiated Query Reformulation Rate): Ratio of users actively reformulating queries
  3. CTR (Click-Through Rate): Ratio of users clicking retrieval results

Baseline Methods

  • Prompt-based rewriting methods
  • SFT and SFT+DPO baselines
  • Direct knowledge injection method (Naive RAG)
  • Adapted CSA-QR method

Implementation Details

  • Card Generation: Fine-tuned based on Qwen2.5-VL-7B-Instruct
  • Query Rewriting: Fine-tuned based on Qwen3-8B
  • Training Settings: Learning rate 1×10^-5, AdamW optimizer, DeepSpeed ZeRO-3 distributed training

Experimental Results

Main Results

MethodKnowledge TypeQR-RelIncrementHitrate@50Hitrate@300
Original Query---31.40%53.07%
SFT+GRPO-78.98%65.19%41.68%65.71%
SFT+GRPONaive RAG74.28%70.86%41.05%65.63%
CardRewriterCard RAG85.73%74.17%46.64%76.04%

Key Findings:

  1. CardRewriter achieves best performance on most metrics
  2. Knowledge cards significantly improve rewriting quality compared to direct knowledge injection
  3. SFT+GRPO training strategy yields the best results

Ablation Studies

Multi-Source Knowledge Contribution Analysis:

  • Removing Visual Content: QC-Rel drops from 91.16% to 89.37%
  • Removing Textual Content: QC-Rel drops to 86.18% (larger impact)
  • Removing Related Videos: QC-Rel drops to 78.27% (largest impact)

Reward System Analysis:

  • Using Only Relevance Reward: Highest query relevance but limited retrieval expansion
  • Using Only System Reward: Retrieves more new videos but may deviate from original intent
  • Combined Reward: Achieves best balance between relevance and effectiveness

Case Analysis

The paper demonstrates a typical case:

  • Original Query: "可口可乐寄人篱下的痛" (User misspelled creator name "可可可乐" as "可口可乐")
  • LLM Rewrite: "可口可乐 寄人篱下 创意广告" (Incorrectly associated with beverage brand)
  • CardRewriter: "可可可乐 被寄养的孩子系列" (Correctly understood user intent)

Online Experimental Results

Traffic TypeLVR ↑IQRR ↓CTR ↑
Coverage Traffic+1.853%-2.630%+3.729%
Full Traffic+0.235%-0.229%+0.342%

Online A/B testing confirms significant improvements, particularly on coverage traffic.

Query Rewriting Research

  1. Embedding-based Methods: Treat query rewriting as a retrieval task, enhancing original queries through similar queries
  2. Generative Methods: Directly generate revised queries; recent LLM methods show excellent performance in e-commerce domains
  3. Domain Adaptation: Existing methods primarily target e-commerce scenarios; short-video platform research is relatively limited

Retrieval-Augmented Generation

RAG technology enhances generation quality by retrieving relevant information. This paper applies it to query rewriting tasks, effectively integrating multi-source information through knowledge cards.

Conclusions and Discussion

Main Conclusions

  1. CardRewriter effectively addresses long-tail query rewriting on short-video platforms through knowledge cards
  2. Two-stage training strategy and customized reward system significantly improve rewriting quality
  3. Deployment verification on Kuaishou platform demonstrates practical value

Limitations

  1. Computational Overhead: LLM inference requires substantial computational resources; near-line deployment strategy is adopted
  2. Coverage Range: Currently covers only 15-20% of daily search traffic
  3. Domain Dependency: Method is designed for short-video platforms; applicability to other domains requires further verification

Future Directions

  1. Expand coverage of query ranges
  2. Optimize online inference efficiency
  3. Explore applicability across platforms and domains

In-Depth Evaluation

Strengths

  1. Strong Problem Specificity: Accurately identifies unique challenges of query rewriting on short-video platforms
  2. Complete Technical Solution: End-to-end solution from knowledge collection to model training
  3. Comprehensive Experimental Evaluation: Thorough offline experiments with online deployment verification
  4. High Engineering Practical Value: Deployed on large-scale platforms serving hundreds of millions of users

Weaknesses

  1. Computational Efficiency: High LLM inference latency limits real-time applications
  2. Data Dependency: Requires substantial annotated data for training set construction
  3. Interpretability: Interpretability of knowledge card generation process needs enhancement
  4. Generalization Ability: Method is platform-specific; cross-domain generalization capability requires further verification

Impact

  1. Academic Contribution: Provides new perspectives for query understanding on short-video platforms
  2. Practical Value: Solves actual business problems with direct commercial value
  3. Technology Promotion: Knowledge card design approach can be generalized to other RAG applications

Applicable Scenarios

  1. Search optimization for short-video, live-streaming, and other content platforms
  2. Vertical domain search containing large amounts of proprietary content
  3. Query understanding tasks requiring integration of multimodal information

References

The paper cites 33 relevant references covering query rewriting, retrieval-augmented generation, large language models, and other research directions, providing substantial theoretical foundation.


Summary: CardRewriter is an innovative research project addressing long-tail query rewriting on short-video platforms. By effectively integrating platform-specific knowledge through knowledge cards, it achieves significant results in both theoretical methodology and engineering practice. This work provides a valuable solution for handling query understanding tasks involving proprietary content.