Multi-stage Prompt Refinement for Mitigating Hallucinations in Large Language Models
Shim, Ju, Park et al.
Recent advancements in large language models (LLMs) have shown strong performance in natural language understanding and generation tasks. However, LLMs continue to encounter challenges with hallucinations, where models generate plausible but incorrect information. While several factors contribute to hallucinations, the impact of ill-formed prompts, prompts with ambiguous wording, incorrect grammar, or incomplete information, was relatively under explored. To address this, we introduce Multi-stage Prompt Refinement (MPR), a framework designed to systematically improve these ill-formed prompts across multiple stages. Each stage addresses specific errors such as punctuation, typographical mistakes, and misuse of key terms, using small language models (SLMs) fine-tuned for these tasks. MPR iteratively enhances the clarity of prompts with additional context and employs a self-reflection mechanism with ranking to prioritize the most relevant input. Experimental results on hallucination benchmarks show that prompts refined by MPR achieve over an 85~\% win rate compared to their original forms, demonstrating its effectiveness in reducing hallucinations and improving LLM output accuracy. Interestingly, we reveal that MPR can be combined with existing post-hoc hallucination mitigation frameworks, further enhancing its versatility. MPR provides a lightweight and adaptable solution for enhancing LLM reliability across various domains.
academic
Multi-stage Prompt Refinement for Mitigating Hallucinations in Large Language Models
Large language models demonstrate exceptional performance in natural language understanding and generation tasks, yet still face the hallucination problem—generating information that appears plausible but is factually incorrect. While multiple factors contribute to hallucinations, the impact of poorly formatted prompts (containing ambiguous phrasing, grammatical errors, or incomplete information) remains relatively underexplored. This paper proposes a Multi-stage Prompt Refinement framework (MPR) that systematically improves such poorly formatted prompts through multiple stages. Each stage employs a small language model (SLM) fine-tuned for specific tasks to address concrete issues such as punctuation, spelling errors, and keyword misuse. MPR iteratively enhances prompt clarity through a self-reflection mechanism and ranking to prioritize the most relevant inputs. Experimental results demonstrate that prompts optimized by MPR achieve over 85% win rate compared to their original form, effectively reducing hallucinations and improving LLM output accuracy.
Although large language models excel in multiple NLP tasks, they face a critical challenge: the hallucination problem, wherein models generate information that appears reasonable but is factually incorrect. This is particularly dangerous in critical domains such as healthcare and education, where accurate information transmission is paramount.
These methods typically overlook an important factor: the quality of user prompts. Poorly formatted prompts directly lead to inaccurate outputs, yet existing solutions often rely on large models or computationally intensive techniques.
This paper posits that systematically optimizing input prompt quality can reduce hallucination problems at their source. Compared to modifying model architectures or post-processing outputs, prompt optimization represents a more lightweight and scalable solution.
Proposes MPR Framework: The first systematic multi-stage optimization framework addressing hallucinations caused by poorly formatted prompts
Lightweight Design: Employs small language models (SLMs) rather than large models, significantly reducing computational costs
Model Agnosticism: Seamlessly integrates with any LLM architecture, demonstrating high adaptability
Comprehensive Evaluation: Validates effectiveness across multiple datasets with win rates exceeding 85%
Compatibility Verification: Demonstrates compatibility with existing post-processing hallucination mitigation methods for further performance enhancement
This paper cites 27 important references covering recent research in large language models, hallucination detection, prompt engineering, small model applications, and related fields, providing a solid theoretical foundation for the research.
Overall Assessment: This is a high-quality research paper proposing an innovative solution to address LLM hallucination problems. The MPR framework is elegantly designed with comprehensive experiments and convincing results. Despite certain limitations, its lightweight and modular design provides high practical value and extension potential.