Semantic Parsing with Candidate Expressions for Knowledge Base Question Answering
Daehwan Nam, Gary Geunbae Lee

TL;DR
This paper introduces a grammar-augmented semantic parser with candidate expressions for improved accuracy and speed in knowledge base question answering, leveraging large language models and constrained decoding.
Contribution
It proposes a novel grammar with candidate expressions and special rules, enhancing semantic parsing accuracy and decoding efficiency on large knowledge bases.
Findings
Increased parsing accuracy on KQA Pro and Overnight benchmarks.
Significantly faster decoding speed due to mask caching and sub-type inference.
Effective handling of KB elements with constrained decoding using candidate expressions.
Abstract
Semantic parsers convert natural language to logical forms, which can be evaluated on knowledge bases (KBs) to produce denotations. Recent semantic parsers have been developed with sequence-to-sequence (seq2seq) pre-trained language models (PLMs) or large language models, where the models treat logical forms as sequences of tokens. For syntactic and semantic validity, the semantic parsers use grammars that enable constrained decoding. However, the grammars lack the ability to utilize large information of KBs, although logical forms contain representations of KB elements, such as entities or relations. In this work, we propose a grammar augmented with candidate expressions for semantic parsing on a large KB with a seq2seq PLM. The grammar defines actions as production rules, and our semantic parser predicts actions during inference under the constraints by types and candidate…
Peer Reviews
Decision·Submitted to ICLR 2024
* The method is proposed for KBQA tasks is noval. * A state-of-the-art performance on KQAPRO * The abalation study shows the effectiveness of this method
* In this paper, the performance of the model is recorded only in one KB-QA dataset. It is not known how good is this method among all KB-QA tasks.
A well-executed work on designing grammars for knowledge-based question answering. The revisit of traditional grammar-based methods provides insights on whether prior information such as types are still useful in the current era of pre-trained models. (But I also feel the question whether grammar-based constraints are still useful for large models needs to be studied more)
The main contribution, as the author points out, is “To the best of our knowledge, our work is the first to use production rules as actions for semantic parsers based on pre-trained seq2seq models.”. First, I’m not sure what is the underlying challenge of extending traditional grammar-based seq2seqs to their pretrained counterparts? That is, I'm not sure about the technical contribution of the paper. Second, there are already existing works in this direction (as cited in the related work). For
- This paper provides a clear and detailed explanation of their proposed approach, including the grammar and inference algorithm used to generate candidate expressions. - This paper conducts an ablation study to analyze the contribution of candidate expressions to the performance of the semantic parser. This analysis provides insights into the effectiveness of candidate expressions and how they can be used to improve the accuracy of semantic parsers. - The paper is of high quality, with clear an
The major weakness of this work is its incremental contribution over previous grammar-based methods like [1] and [2]. In particular, [2] uses similar grammars for knowledge base question answering. It is unclear what new techniques or insights this work adds beyond existing grammar-based approaches for this task. To strengthen the paper, the authors could focus more on novel grammar designs or representations that improve performance. Additionally, the experimental validation is limited to a si
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Semantic Web and Ontologies
MethodsSigmoid Activation · Tanh Activation · Long Short-Term Memory · Sequence to Sequence · Balanced Selection
