Exploiting Pre-trained Encoder-Decoder Transformers for Sequence-to-Sequence Constituent Parsing

Daniel Fern\'andez-Gonz\'alez; Cristina Outeiri\~no Cid

arXiv:2605.13373·cs.CL·May 14, 2026

Exploiting Pre-trained Encoder-Decoder Transformers for Sequence-to-Sequence Constituent Parsing

Daniel Fern\'andez-Gonz\'alez, Cristina Outeiri\~no Cid

PDF

TL;DR

This paper explores using pre-trained encoder-decoder models like BART, mBART, and T5 for sequence-to-sequence constituent parsing, showing they outperform previous models and compete with task-specific parsers.

Contribution

It extends the sequence-to-sequence parsing framework by thoroughly investigating pre-trained encoder-decoder models for constituency parsing.

Findings

01

Pre-trained encoder-decoder models outperform prior sequence-to-sequence parsers.

02

The approach achieves competitive results with task-specific parsers on continuous benchmarks.

03

Extensive evaluation across various linearization strategies demonstrates robustness.

Abstract

To achieve deep natural language understanding, syntactic constituent parsing plays a crucial role and is widely required by many artificial intelligence systems for processing both text and speech. A recent approach involves using standard sequence-to-sequence models to handle constituent parsing as a machine translation problem, moving away from traditional task-specific parsers. These models are typically initialized with pre-trained encoder-only language models like BERT or RoBERTa. However, the use of pre-trained encoder-decoder language models for constituency parsing has not been thoroughly explored. To bridge this gap, we extend the sequence-to-sequence framework by investigating parsers built on pre-trained encoder-decoder architectures, including BART, mBART, and T5. We fine-tune them to generate linearized parse trees and extensively evaluate them on different linearization…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.