Unifying Discourse Resources with Dependency Framework
Yi Cheng, Sujian Li, Yueyuan Li

TL;DR
This paper proposes a unified discourse dependency framework to integrate various Chinese discourse corpora, enabling semi-automatic conversion and improving parsing performance through leveraging combined data.
Contribution
It introduces a novel framework unifying multiple discourse annotation schemes and demonstrates how to enhance discourse parsing with this unified data.
Findings
Successful unification of diverse discourse corpora
Improved parser performance using combined data
Effective semi-automatic conversion methods
Abstract
For text-level discourse analysis, there are various discourse schemes but relatively few labeled data, because discourse research is still immature and it is labor-intensive to annotate the inner logic of a text. In this paper, we attempt to unify multiple Chinese discourse corpora under different annotation schemes with discourse dependency framework by designing semi-automatic methods to convert them into dependency structures. We also implement several benchmark dependency parsers and research on how they can leverage the unified data to improve performance.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling · Advanced Text Analysis Techniques
