Improving Constituency Parsing with Span Attention
Yuanhe Tian, Yan Song, Fei Xia, Tong Zhang

TL;DR
This paper introduces span attention with n-gram weighting for neural constituency parsing, significantly improving performance across multiple languages by better capturing contextual information, especially for long spans.
Contribution
It proposes span attention with n-gram weighting and categorical span attention to enhance span representations in neural chart-based constituency parsing.
Findings
Achieved state-of-the-art results on Arabic, Chinese, and English datasets.
Effectively models long-span information with categorical span attention.
Demonstrates significant performance improvements over existing methods.
Abstract
Constituency parsing is a fundamental and important task for natural language understanding, where a good representation of contextual information can help this task. N-grams, which is a conventional type of feature for contextual information, have been demonstrated to be useful in many tasks, and thus could also be beneficial for constituency parsing if they are appropriately modeled. In this paper, we propose span attention for neural chart-based constituency parsing to leverage n-gram information. Considering that current chart-based parsers with Transformer-based encoder represent spans by subtraction of the hidden states at the span boundaries, which may cause information loss especially for long spans, we incorporate n-grams into span representations by weighting them according to their contributions to the parsing process. Moreover, we propose categorical span attention to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling · Speech and dialogue systems
