Reassessing Graph Linearization for Sequence-to-sequence AMR Parsing: On the Advantages and Limitations of Triple-Based Encoding

Jeongwoo Kang; Maximin Coavoux; C\'edric Lopez; Didier Schwab

arXiv:2505.08504·cs.CL·May 14, 2025

Reassessing Graph Linearization for Sequence-to-sequence AMR Parsing: On the Advantages and Limitations of Triple-Based Encoding

Jeongwoo Kang, Maximin Coavoux, C\'edric Lopez, Didier Schwab

PDF

1 Repo

TL;DR

This paper evaluates a triple-based linearization method for AMR parsing, highlighting its advantages over Penman encoding in representing complex graphs, but also noting current limitations in efficiency.

Contribution

It introduces a novel triple-based linearization approach for AMR graphs, addressing issues of node proximity and re-entrancy handling in sequence-to-sequence models.

Findings

01

Triple-based encoding better captures graph structure.

02

Penman encoding remains more concise and explicit.

03

Room for improvement in triple encoding efficiency.

Abstract

Sequence-to-sequence models are widely used to train Abstract Meaning Representation (Banarescu et al., 2013, AMR) parsers. To train such models, AMR graphs have to be linearized into a one-line text format. While Penman encoding is typically used for this purpose, we argue that it has limitations: (1) for deep graphs, some closely related nodes are located far apart in the linearized text (2) Penman's tree-based encoding necessitates inverse roles to handle node re-entrancy, doubling the number of relation types to predict. To address these issues, we propose a triple-based linearization method and compare its efficiency with Penman linearization. Although triples are well suited to represent a graph, our results suggest room for improvement in triple encoding to better compete with Penman's concise and explicit representation of a nested graph structure.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Emvista/Triple_AMR_Parser
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.