Augmenting Compliance-Guaranteed Customer Service Chatbots: Context-Aware Knowledge Expansion with Large Language Models
Hong, Zhang, Jiang et al.
Retrieval-based chatbots leverage human-verified Q\&A knowledge to deliver accurate, verifiable responses, making them ideal for customer-centric applications where compliance with regulatory and operational standards is critical. To effectively handle diverse customer inquiries, augmenting the knowledge base with "similar questions" that retain semantic meaning while incorporating varied expressions is a cost-effective strategy. In this paper, we introduce the Similar Question Generation (SQG) task for LLM training and inference, proposing context-aware approaches to enable comprehensive semantic exploration and enhanced alignment with source question-answer relationships. We formulate optimization techniques for constructing in-context prompts and selecting an optimal subset of similar questions to expand chatbot knowledge under budget constraints. Both quantitative and human evaluations validate the effectiveness of these methods, achieving a 92% user satisfaction rate in a deployed chatbot system, reflecting an 18% improvement over the unaugmented baseline. These findings highlight the practical benefits of SQG and emphasize the potential of LLMs, not as direct chatbot interfaces, but in supporting non-generative systems for hallucination-free, compliance-guaranteed applications.
academic
Augmenting Compliance-Guaranteed Customer Service Chatbots: Context-Aware Knowledge Expansion with Large Language Models
Retrieval-based chatbots leverage human-verified question-answer knowledge bases to provide accurate and verifiable responses, making them well-suited for customer service applications requiring compliance with regulatory and operational standards. To effectively handle diverse customer queries, augmenting the knowledge base by generating "similar questions" that maintain semantic consistency while exhibiting expression diversity is a cost-effective strategy. This paper introduces the Similar Question Generation (SQG) task for large language model training and inference, proposing context-aware approaches to achieve comprehensive semantic exploration and enhanced alignment with source question-answer relationships. The research establishes optimization techniques for constructing context prompts and selecting optimal similar question subsets under budget constraints. Quantitative and human evaluations validate the effectiveness of these approaches, achieving 92% user satisfaction in a deployed chatbot system, representing an 18% improvement over the unaugmented baseline.
Core Issue: Traditional retrieval-based customer service chatbots suffer from matching failures when handling customer queries with diverse expressions, resulting in poor user experience
Application Scenario Importance: In highly regulated industries such as finance and healthcare, generative large language models are prone to hallucinations and cannot meet compliance requirements
Limitations of Existing Methods:
Manual crowdsourcing is costly and offers limited diversity
This research aims to leverage the generative capabilities of large language models to augment knowledge bases for retrieval-based chatbots rather than using them as direct dialogue interfaces, thereby improving query matching performance while ensuring compliance.
First Definition of SQG Task: Formulates the Similar Question Generation task for retrieval-based service chatbot enhancement, proposing a context-aware one-to-many generation paradigm
Optimization Framework: Proposes optimization techniques under budget constraints for selecting prompt examples and similar question subsets to facilitate knowledge base expansion
Significant Performance Improvements: Experiments demonstrate over 120% relative improvement in qualitative evaluation, 4.74% overall diversity improvement, and 18% user satisfaction increase
Real-World Deployment Verification: Validates method effectiveness through deployment and verification in an actual banking customer service system
Similar Question Generation (SQG) aims to create diverse yet semantically consistent question sets for specific answers in the knowledge base. Key requirements include:
Semantic Consistency: Preserving original intent and meaning
Syntactic Diversity: Variations in wording and structure
Significant Intent Enhancement Effect: The intent-enhanced method achieves 84% acceptance rate in human evaluation, representing a 121.64% improvement over baseline methods
Scale Effects: The proposed method maintains stable precision as the number of generated questions increases, while baseline methods show significant degradation
Real Deployment Performance: Achieves 92% user satisfaction in actual banking applications, representing an 18% improvement over unaugmented baseline
Compared to existing work, this paper is the first to systematically apply large language models to knowledge base augmentation for retrieval-based chatbots, proposing specialized training objectives and optimization frameworks.
The paper cites multiple important related works, including:
Data augmentation methods: Wei et al. (2022), Liu et al. (2023)
Retrieval-based chatbots: Wu et al. (2018), Singh et al. (2018)
Large language model applications: Vaswani et al. (2017), Cheng et al. (2023)
Evaluation methods: Zhang et al. (2020), Li et al. (2016)
Overall Assessment: This is a high-quality applied research paper that achieves good balance between theoretical innovation and practical value. The method design is sound, experimental validation is comprehensive, and deployment verification in real business environments strengthens the paper's persuasiveness. It holds significant reference value for AI applications requiring compliance guarantees.