DMRST: A Joint Framework for Document-Level Multilingual RST Discourse   Segmentation and Parsing

Zhengyuan Liu; Ke Shi; Nancy F. Chen

arXiv:2110.04518·cs.CL·October 12, 2021

DMRST: A Joint Framework for Document-Level Multilingual RST Discourse Segmentation and Parsing

Zhengyuan Liu, Ke Shi, Nancy F. Chen

PDF

Open Access 1 Repo

TL;DR

This paper introduces DMRST, a comprehensive framework for multilingual RST discourse parsing that jointly performs EDU segmentation and discourse tree parsing, with improved domain generality and state-of-the-art results.

Contribution

It presents a joint framework for EDU segmentation and discourse parsing in multiple languages, incorporating cross-translation augmentation for better domain robustness.

Findings

01

Achieves state-of-the-art performance on multilingual RST parsing tasks.

02

Supports joint EDU segmentation and discourse tree parsing.

03

Enhances domain generality through cross-translation augmentation.

Abstract

Text discourse parsing weighs importantly in understanding information flow and argumentative structure in natural language, making it beneficial for downstream tasks. While previous work significantly improves the performance of RST discourse parsing, they are not readily applicable to practical use cases: (1) EDU segmentation is not integrated into most existing tree parsing frameworks, thus it is not straightforward to apply such models on newly-coming data. (2) Most parsers cannot be used in multilingual scenarios, because they are developed only in English. (3) Parsers trained from single-domain treebanks do not generalize well on out-of-domain inputs. In this work, we propose a document-level multilingual RST discourse parsing framework, which conducts EDU segmentation and discourse tree parsing jointly. Moreover, we propose a cross-translation augmentation strategy to enable the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

seq-to-mind/DMRST_Parser
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Speech and dialogue systems