Improving AMR Parsing with Sequence-to-Sequence Pre-training

Dongqin Xu; Junhui Li; Muhua Zhu; Min Zhang; Guodong Zhou

arXiv:2010.01771·cs.CL·October 6, 2020·6 cites

Improving AMR Parsing with Sequence-to-Sequence Pre-training

Dongqin Xu, Junhui Li, Muhua Zhu, Min Zhang, Guodong Zhou

PDF

Open Access 1 Repo

TL;DR

This paper introduces a sequence-to-sequence pre-training approach for AMR parsing that leverages multi-task learning, significantly improving performance on benchmark datasets and achieving state-of-the-art results with simpler models.

Contribution

It proposes a novel seq2seq pre-training and multi-task fine-tuning method specifically tailored for AMR parsing, outperforming previous models.

Findings

01

Pre-trained models improve AMR parsing accuracy from 71.5 to 80.2.

02

Joint pre-training on translation, syntax, and AMR tasks enhances performance.

03

Achieves state-of-the-art results with simpler seq2seq models.

Abstract

In the literature, the research on abstract meaning representation (AMR) parsing is much restricted by the size of human-curated dataset which is critical to build an AMR parser with good performance. To alleviate such data size restriction, pre-trained models have been drawing more and more attention in AMR parsing. However, previous pre-trained models, like BERT, are implemented for general purpose which may not work as expected for the specific task of AMR parsing. In this paper, we focus on sequence-to-sequence (seq2seq) AMR parsing and propose a seq2seq pre-training approach to build pre-trained models in both single and joint way on three relevant tasks, i.e., machine translation, syntactic parsing, and AMR parsing itself. Moreover, we extend the vanilla fine-tuning method to a multi-task learning fine-tuning method that optimizes for the performance of AMR parsing while endeavors…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

xdqkid/S2S-AMR-Parser
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Multimodal Machine Learning Applications

MethodsLinear Layer · Tanh Activation · Sigmoid Activation · Long Short-Term Memory · Sequence to Sequence · Dense Connections · Layer Normalization · WordPiece · Multi-Head Attention · Dropout