Less is More: Data-Efficient Complex Question Answering over Knowledge Bases
Yuncheng Hua, Yuan-Fang Li, Guilin Qi, Wei Wu, Jingyao Zhang, Daiqing, Qi

TL;DR
This paper introduces NS-CQA, a data-efficient neural-symbolic framework for complex question answering over knowledge bases, achieving high performance with minimal training data by combining reinforcement learning, symbolic actions, and curriculum-guided rewards.
Contribution
The paper presents a novel reinforcement learning approach with symbolic actions, memory buffer, and adaptive rewards to improve data efficiency in complex KB question answering.
Findings
Outperforms state-of-the-art on CQA and WebQuestionsSP datasets.
Achieves high accuracy using only about 1% of training data on CQA.
Effective in handling higher complexity questions.
Abstract
Question answering is an effective method for obtaining information from knowledge bases (KB). In this paper, we propose the Neural-Symbolic Complex Question Answering (NS-CQA) model, a data-efficient reinforcement learning framework for complex question answering by using only a modest number of training samples. Our framework consists of a neural generator and a symbolic executor that, respectively, transforms a natural-language question into a sequence of primitive actions, and executes them over the knowledge base to compute the answer. We carefully formulate a set of primitive symbolic actions that allows us to not only simplify our neural network design but also accelerate model convergence. To reduce search space, we employ the copy and masking mechanisms in our encoder-decoder architecture to drastically reduce the decoder output vocabulary and improve model generalizability. We…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Multimodal Machine Learning Applications · Domain Adaptation and Few-Shot Learning
