ReSCo-CC: Unsupervised Identification of Key Disinformation Sentences
Soumya Suvra Ghosal, Deepak P, Anna Jurek-Loughrey

TL;DR
This paper introduces an unsupervised method for detecting key disinformation sentences in untrustworthy texts, using a three-phase NLP approach and a new dataset, demonstrating effective identification of core disinformation.
Contribution
The paper presents a novel unsupervised framework for identifying disinformation sentences and curates a new dataset for evaluation, advancing research in disinformation detection.
Findings
Effective identification of disinformation sentences demonstrated
Proposed method outperforms related techniques in experiments
New dataset facilitates future research in disinformation detection
Abstract
Disinformation is often presented in long textual articles, especially when it relates to domains such as health, often seen in relation to COVID-19. These articles are typically observed to have a number of trustworthy sentences among which core disinformation sentences are scattered. In this paper, we propose a novel unsupervised task of identifying sentences containing key disinformation within a document that is known to be untrustworthy. We design a three-phase statistical NLP solution for the task which starts with embedding sentences within a bespoke feature space designed for the task. Sentences represented using those features are then clustered, following which the key sentences are identified through proximity scoring. We also curate a new dataset with sentence level disinformation scorings to aid evaluation for this task; the dataset is being made publicly available to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMisinformation and Its Impacts · Advanced Text Analysis Techniques · Topic Modeling
