Combining Improvements for Exploiting Dependency Trees in Neural Semantic Parsing
Defeng Xie, Jianmin Ji, Jiafei Xu, Ran Ji

TL;DR
This paper explores combining three dependency tree-based methods in a Transformer semantic parser, demonstrating that their integration improves performance and achieves state-of-the-art results on several benchmarks.
Contribution
It introduces a systematic study of combining PASCAL, SAWRs, and CA methods, showing their complementary effects in neural semantic parsing.
Findings
Combining CA with PASCAL or SAWRs improves parsing accuracy.
The integrated methods outperform previous neural approaches on ATIS, GEO, and JOBS datasets.
The study provides insights into how dependency-based enhancements complement each other.
Abstract
The dependency tree of a natural language sentence can capture the interactions between semantics and words. However, it is unclear whether those methods which exploit such dependency information for semantic parsing can be combined to achieve further improvement and the relationship of those methods when they combine. In this paper, we examine three methods to incorporate such dependency information in a Transformer based semantic parser and empirically study their combinations. We first replace standard self-attention heads in the encoder with parent-scaled self-attention (PASCAL) heads, i.e., the ones that can attend to the dependency parent of each token. Then we concatenate syntax-aware word representations (SAWRs), i.e., the intermediate hidden representations of a neural dependency parser, with ordinary word embedding to enhance the encoder. Later, we insert the constituent…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling · Multimodal Machine Learning Applications
MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Label Smoothing · Absolute Position Encodings · Residual Connection · Softmax · Adam · Position-Wise Feed-Forward Layer · Dense Connections
