PGDA-KGQA: A Prompt-Guided Generative Framework with Multiple Data Augmentation Strategies for Knowledge Graph Question Answering
Xiujun Zhou, Pingjian Zhang, Deyou Tang

TL;DR
PGDA-KGQA introduces a prompt-guided generative framework that employs multiple data augmentation strategies, significantly enhancing multi-hop reasoning and semantic parsing in knowledge graph question answering with large language models.
Contribution
It presents a novel prompt-design paradigm and multiple augmentation techniques to generate diverse training data, improving multi-hop reasoning and semantic parsing in KGQA models.
Findings
Outperforms state-of-the-art on WebQSP and ComplexWebQuestions datasets.
Achieves up to 3.1% improvements in F1 and Accuracy.
Enhances multi-hop reasoning and robustness through data augmentation.
Abstract
Knowledge Graph Question Answering (KGQA) is a crucial task in natural language processing that requires reasoning over knowledge graphs (KGs) to answer natural language questions. Recent methods utilizing large language models (LLMs) have shown remarkable semantic parsing capabilities but are limited by the scarcity of diverse annotated data and multi-hop reasoning samples. Traditional data augmentation approaches are focus mainly on single-hop questions and prone to semantic distortion, while LLM-based methods primarily address semantic distortion but usually neglect multi-hop reasoning, thus limiting data diversity. The scarcity of multi-hop samples further weakens models' generalization. To address these issues, we propose PGDA-KGQA, a prompt-guided generative framework with multiple data augmentation strategies for KGQA. At its core, PGDA-KGQA employs a unified prompt-design…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Graph Neural Networks · Topic Modeling · Multimodal Machine Learning Applications
