Improving AMR Parsing with Sequence-to-Sequence Pre-training
Dongqin Xu, Junhui Li, Muhua Zhu, Min Zhang, Guodong Zhou

TL;DR
This paper introduces a sequence-to-sequence pre-training approach for AMR parsing that leverages multi-task learning, significantly improving performance on benchmark datasets and achieving state-of-the-art results with simpler models.
Contribution
It proposes a novel seq2seq pre-training and multi-task fine-tuning method specifically tailored for AMR parsing, outperforming previous models.
Findings
Pre-trained models improve AMR parsing accuracy from 71.5 to 80.2.
Joint pre-training on translation, syntax, and AMR tasks enhances performance.
Achieves state-of-the-art results with simpler seq2seq models.
Abstract
In the literature, the research on abstract meaning representation (AMR) parsing is much restricted by the size of human-curated dataset which is critical to build an AMR parser with good performance. To alleviate such data size restriction, pre-trained models have been drawing more and more attention in AMR parsing. However, previous pre-trained models, like BERT, are implemented for general purpose which may not work as expected for the specific task of AMR parsing. In this paper, we focus on sequence-to-sequence (seq2seq) AMR parsing and propose a seq2seq pre-training approach to build pre-trained models in both single and joint way on three relevant tasks, i.e., machine translation, syntactic parsing, and AMR parsing itself. Moreover, we extend the vanilla fine-tuning method to a multi-task learning fine-tuning method that optimizes for the performance of AMR parsing while endeavors…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling · Multimodal Machine Learning Applications
MethodsLinear Layer · Tanh Activation · Sigmoid Activation · Long Short-Term Memory · Sequence to Sequence · Dense Connections · Layer Normalization · WordPiece · Multi-Head Attention · Dropout
