Distant finetuning with discourse relations for stance classification

Lifeng Jin; Kun Xu; Linfeng Song; Dong Yu

arXiv:2204.12693·cs.CL·April 28, 2022

Distant finetuning with discourse relations for stance classification

Lifeng Jin, Kun Xu, Linfeng Song, Dong Yu

PDF

Open Access

TL;DR

This paper introduces a topic-independent stance classification method using discourse relations to automatically generate training data, combined with a multi-stage training framework, leading to state-of-the-art results in a shared task.

Contribution

The paper presents a novel approach that leverages discourse relations for silver label data extraction and a 3-stage training process to improve stance classification performance.

Findings

01

Achieved top performance in the NLPCC 2021 stance classification shared task.

02

Demonstrated that discourse relation-based data extraction enhances model accuracy.

03

Showed that multi-stage training reduces noise and improves stance classification results.

Abstract

Approaches for the stance classification task, an important task for understanding argumentation in debates and detecting fake news, have been relying on models which deal with individual debate topics. In this paper, in order to train a system independent from topics, we propose a new method to extract data with silver labels from raw text to finetune a model for stance classification. The extraction relies on specific discourse relation information, which is shown as a reliable and accurate source for providing stance information. We also propose a 3-stage training framework where the noisy level in the data used for finetuning decreases over different stages going from the most noisy to the least noisy. Detailed experiments show that the automatically annotated dataset as well as the 3-stage training help improve model performance in stance classification. Our approach ranks 1st…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMisinformation and Its Impacts · Hate Speech and Cyberbullying Detection · Software Engineering Research