Structure-aware Fine-tuning of Sequence-to-sequence Transformers for Transition-based AMR Parsing
Jiawei Zhou, Tahira Naseem, Ram\'on Fernandez Astudillo, Young-Suk, Lee, Radu Florian, Salim Roukos

TL;DR
This paper introduces a structure-aware fine-tuning method for sequence-to-sequence transformers in AMR parsing, improving accuracy while maintaining graph properties and simplifying the transition system.
Contribution
It proposes a simplified transition set and modeling strategies that leverage pre-trained models for better structured AMR parsing.
Findings
Achieves new state-of-the-art on AMR 2.0
Retains graph well-formedness properties
Simplifies transition-based parsing approach
Abstract
Predicting linearized Abstract Meaning Representation (AMR) graphs using pre-trained sequence-to-sequence Transformer models has recently led to large improvements on AMR parsing benchmarks. These parsers are simple and avoid explicit modeling of structure but lack desirable properties such as graph well-formedness guarantees or built-in graph-sentence alignments. In this work we explore the integration of general pre-trained sequence-to-sequence language models and a structure-aware transition-based approach. We depart from a pointer-based transition system and propose a simplified transition set, designed to better exploit pre-trained language models for structured fine-tuning. We also explore modeling the parser state within the pre-trained encoder-decoder architecture and different vocabulary strategies for the same purpose. We provide a detailed comparison with recent progress in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Machine Learning in Bioinformatics
MethodsAttention Is All You Need · Linear Layer · Dropout · Label Smoothing · Layer Normalization · Dense Connections · Residual Connection · Adam · Multi-Head Attention · Absolute Position Encodings
