PRoH: Dynamic Planning and Reasoning over Knowledge Hypergraphs for Retrieval-Augmented Generation
Zai, Tan, Wang et al.
Knowledge Hypergraphs (KHs) have recently emerged as a knowledge representation for retrieval-augmented generation (RAG), offering a paradigm to model multi-entity relations into a structured form. However, existing KH-based RAG methods suffer from three major limitations: static retrieval planning, non-adaptive retrieval execution, and superficial use of KH structure and semantics, which constrain their ability to perform effective multi-hop question answering. To overcome these limitations, we propose PRoH, a dynamic Planning and Reasoning over Knowledge Hypergraphs framework. PRoH incorporates three core innovations: (i) a context-aware planning module that sketches the local KH neighborhood to guide structurally grounded reasoning plan generation; (ii) a structured question decomposition process that organizes subquestions as a dynamically evolving Directed Acyclic Graph (DAG) to enable adaptive, multi-trajectory exploration; and (iii) an Entity-Weighted Overlap (EWO)-guided reasoning path retrieval algorithm that prioritizes semantically coherent hyperedge traversals. Experiments across multiple domains demonstrate that PRoH achieves state-of-the-art performance, surpassing the prior SOTA model HyperGraphRAG by an average of 19.73% in F1 and 8.41% in Generation Evaluation (G-E) score, while maintaining strong robustness in long-range multi-hop reasoning tasks.
academic
PRoH: Dynamic Planning and Reasoning over Knowledge Hypergraphs for Retrieval-Augmented Generation
Knowledge Hypergraphs (KHs) represent an emerging knowledge representation paradigm for Retrieval-Augmented Generation (RAG), offering a structured approach to modeling multi-entity relationships. However, existing KH-based RAG methods suffer from three major limitations: static retrieval planning, non-adaptive retrieval execution, and shallow exploitation of KH structural semantics, which restrict their capability for effective multi-hop question answering. To overcome these limitations, this paper proposes PRoH—a dynamic knowledge hypergraph planning and reasoning framework. PRoH incorporates three core innovations: (1) a context-aware planning module that guides structured reasoning plan generation by delineating local KH neighborhoods; (2) a structured question decomposition process that organizes sub-questions into dynamically evolving directed acyclic graphs (DAGs) to enable adaptive multi-trajectory exploration; (3) an Entity-Weighted Overlap (EWO)-guided reasoning path retrieval algorithm that prioritizes semantically coherent hyperedge traversal.
Traditional RAG systems primarily rely on semantic similarity for retrieval, failing to capture structured relational knowledge inherent in many information domains, and frequently retrieving redundant or noisy content. While graph-based RAG improves upon this through knowledge graphs (KGs), most existing frameworks only model binary relationships between two entities, overlooking the n-ary nature of many real-world relationships.
Many real-world relationships involve multiple entities, such as "Mario + Rabbids Kingdom Battle is the first major collaboration between Nintendo and Ubisoft," which simultaneously connects three entities. Decomposing such n-ary relationships into multiple binary edges inevitably results in the loss of critical structural and semantic information.
Existing KH-based RAG methods suffer from three major limitations:
Static Retrieval Planning: Relies on predefined, hard-coded retrieval pipelines that apply identical operation sequences regardless of query content or graph context
Non-Adaptive Retrieval Execution: Employs one-shot, non-iterative retrieval approaches that cannot leverage intermediate reasoning results to optimize retrieval
Shallow Exploitation of Graph Structural Semantics: Primarily treats hyperedges as simple links or routing mechanisms to access relevant text blocks, overlooking the rich relational semantics encoded within hyperedges
Proposes the PRoH Framework: A dynamic knowledge hypergraph RAG framework that fully leverages the expressive power of hypergraphs for multi-hop question answering
Context-Aware Planning Mechanism: A planning mechanism that delineates the underlying knowledge hypergraph and generates feasible reasoning plans
EWO-Guided Reasoning Path Retrieval Strategy: A fine-grained, semantically-aware exploration strategy tailored for knowledge hypergraphs
Significant Performance Improvements: Achieves state-of-the-art performance across multiple knowledge domains, with average F1 score improvements of 19.73% and Generation Evaluation (G-E) score improvements of 8.41%
Given a question q and a knowledge hypergraph H = (V, E), hypergraph RAG requires retrieving question-relevant knowledge from H (a set of facts F), then generating an answer a(q) based on q and F.
On long-horizon multi-hop question answering tasks, PRoH demonstrates robust performance with average F1 improvements of 26.68%, reaching maximum improvements of 44.87% in the computer science domain.
The PRoH-L variant maintains competitive performance while significantly reducing token usage, achieving 30.07% token reduction in the agriculture domain while improving F1 by 16.58%.
Existing graph-based RAG methods achieve more precise retrieval and relationship-aware reasoning through knowledge graphs, but are mostly limited to binary relationship representations.
Early systems such as HyperGraphRAG and Hyper-RAG extract hyperedges to capture higher-order relationships, but still rely on heuristic one-shot retrieval pipelines lacking context-awareness and iterative reasoning capabilities.
Strong Innovation: First to propose a dynamic planning and reasoning KH-RAG framework, addressing core limitations of existing methods
Significant Technical Contributions: The EWO scoring mechanism and structured question decomposition represent important innovations tailored to hypergraph characteristics
Comprehensive Experiments: Covers multiple domains and long-horizon reasoning tasks with thorough ablation studies
Substantial Performance Gains: Consistent and significant improvements over state-of-the-art methods
The paper cites 40 relevant references covering important works in graph-based RAG, knowledge hypergraphs, multi-hop reasoning, and related fields, providing a solid theoretical foundation for the research.