Towards Unification of Discourse Annotation Frameworks

Yingxue Fu

arXiv:2204.07781·cs.CL·April 19, 2022

Towards Unification of Discourse Annotation Frameworks

Yingxue Fu

PDF

Open Access

TL;DR

This paper explores unifying major discourse annotation frameworks (RST, PDTB, SDRT) to enable better use of existing corpora and improve downstream NLP tasks through systematic relations and automatic unification methods.

Contribution

It proposes a comprehensive approach to unify discourse structure and relations across frameworks, including evaluation and applications in multi-task learning and graphical models.

Findings

01

Preliminary analysis of framework relations

02

Proposed automatic unification methods

03

Potential improvements in downstream tasks

Abstract

Discourse information is difficult to represent and annotate. Among the major frameworks for annotating discourse information, RST, PDTB and SDRT are widely discussed and used, each having its own theoretical foundation and focus. Corpora annotated under different frameworks vary considerably. To make better use of the existing discourse corpora and achieve the possible synergy of different frameworks, it is worthwhile to investigate the systematic relations between different frameworks and devise methods of unifying the frameworks. Although the issue of framework unification has been a topic of discussion for a long time, there is currently no comprehensive approach which considers unifying both discourse structure and discourse relations and evaluates the unified framework intrinsically and extrinsically. We plan to use automatic means for the unification task and evaluate the result…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Speech and dialogue systems · Topic Modeling