A Novel Dependency Framework for Enhancing Discourse Data Analysis
Kun Sun, Rong Wang

TL;DR
This paper introduces a dependency framework that converts various discourse annotations into dependency structures, enabling unified analysis across multiple languages and discourse theories, validated through BERT-based parsers and correlation metrics.
Contribution
It presents a novel method for converting PDTB and RST discourse annotations into dependency structures, enhancing cross-theoretical and cross-linguistic discourse data analysis.
Findings
PDTB dependency data is valid.
Strong correlation between RST and PDTB dependency distances.
Framework is applicable across multiple languages.
Abstract
The development of different theories of discourse structure has led to the establishment of discourse corpora based on these theories. However, the existence of discourse corpora established on different theoretical bases creates challenges when it comes to exploring them in a consistent and cohesive way. This study has as its primary focus the conversion of PDTB annotations into dependency structures. It employs refined BERT-based discourse parsers to test the validity of the dependency data derived from the PDTB-style corpora in English, Chinese, and several other languages. By converting both PDTB and RST annotations for the same texts into dependencies, this study also applies ``dependency distance'' metrics to examine the correlation between RST dependencies and PDTB dependencies in English. The results show that the PDTB dependency data is valid and that there is a strong…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSentiment Analysis and Opinion Mining · Service-Oriented Architecture and Web Services
MethodsFocus
