PAGE: Prompt Augmentation for text Generation Enhancement
Pacchiotti, Ballejos, Ale
In recent years, natural language generative models have shown outstanding performance in text generation tasks. However, when facing specific tasks or particular requirements, they may exhibit poor performance or require adjustments that demand large amounts of additional data. This work introduces PAGE (Prompt Augmentation for text Generation Enhancement), a framework designed to assist these models through the use of simple auxiliary modules. These modules, lightweight models such as classifiers or extractors, provide inferences from the input text. The output of these auxiliaries is then used to construct an enriched input that improves the quality and controllability of the generation. Unlike other generation-assistance approaches, PAGE does not require auxiliary generative models; instead, it proposes a simpler, modular architecture that is easy to adapt to different tasks. This paper presents the proposal, its components and architecture, and reports a proof of concept in the domain of requirements engineering, where an auxiliary module with a classifier is used to improve the quality of software requirements generation.
academic
PAGE: Prompt Augmentation for text Generation Enhancement
Recent natural language generation models have demonstrated superior performance in text generation tasks. However, when faced with domain-specific tasks or special requirements, these models may underperform or require substantial additional data for fine-tuning. This research proposes PAGE (Prompt Augmentation for text Generation Enhancement), a framework that assists these models through simple auxiliary modules. These auxiliary modules are lightweight models, such as classifiers or extractors, capable of providing reasoning information extracted from input text. The output of auxiliary modules is used to construct enriched inputs, thereby improving the quality and controllability of generated text. Unlike other generation-assisted methods, PAGE does not require auxiliary generative models; instead, it proposes a simpler, modular, and easily adaptable architecture for different tasks.
Insufficient Task-Specific Performance: Although large language models (LLMs) excel at general text generation tasks, they often underperform when facing domain-specific or specially-constrained tasks
High Fine-tuning Costs: Traditional solutions involve retraining or fine-tuning models, which require substantial high-quality data and computational resources
Resource Constraints: Many application scenarios lack sufficient training data and computational capacity
Propose PAGE Framework: An innovative prompt augmentation architecture that improves text generation quality through simple auxiliary modules
Modular Design: Unlike other methods, PAGE does not rely on auxiliary generative models but uses lightweight classifiers and extractors
Resource-Friendly: Significantly reduces requirements for training data and computational resources
Practical Validation: Proof-of-concept in software requirements engineering using EARS syntax for structured requirement generation
Performance Improvement: Achieves significant improvements over baseline methods on ROUGE metrics (ROUGE-1 improvement of 65.41%, ROUGE-2 improvement of 205.62%)
Input: Raw text descriptions (e.g., natural language requirement descriptions)
Output: Structured, high-quality text (e.g., requirement expressions conforming to specific grammar specifications)
Objective: Enhance input prompts through auxiliary information to improve generation quality without retraining the main model
The paper cites multiple important related works, including:
Foundational work on Transformer architecture (Vaswani et al., 2017)
Major large language models (GPT, BERT, T5, Llama, etc.)
EARS requirement syntax specification (Mavin et al., 2009)
ROUGE evaluation metrics (Lin, 2004)
Related generation enhancement methods, etc.
Overall Assessment: This is a research paper proposing an innovative method. The PAGE framework provides new insights for text generation enhancement. While there is room for improvement in validation scope and theoretical analysis, its practical value and technical innovation merit recognition. This method is particularly suitable for application scenarios requiring rapid task adaptation with limited resources.