Measuring vagueness and subjectivity in texts: from symbolic to neural   VAGO

Benjamin Icard; Vincent Claveau; Ghislain Atemezing; Paul \'Egr\'e

arXiv:2309.06132·cs.CL·October 25, 2023·1 cites

Measuring vagueness and subjectivity in texts: from symbolic to neural VAGO

Benjamin Icard, Vincent Claveau, Ghislain Atemezing, Paul \'Egr\'e

PDF

Open Access

TL;DR

This paper introduces a hybrid method combining symbolic and neural approaches to measure vagueness and subjectivity in texts, demonstrating its effectiveness on French press data and enabling multilingual applications.

Contribution

It presents VAGO, an expert system for textual vagueness and subjectivity detection, and develops a neural clone based on BERT trained on symbolic scores for improved analysis.

Findings

01

VAGO effectively distinguishes fact from opinion sentences.

02

The neural clone enhances lexicon enrichment and multilingual capabilities.

03

Subjective markers are more prevalent in satirical texts.

Abstract

We present a hybrid approach to the automated measurement of vagueness and subjectivity in texts. We first introduce the expert system VAGO, we illustrate it on a small benchmark of fact vs. opinion sentences, and then test it on the larger French press corpus FreSaDa to confirm the higher prevalence of subjective markers in satirical vs. regular texts. We then build a neural clone of VAGO, based on a BERT-like architecture, trained on the symbolic VAGO scores obtained on FreSaDa. Using explainability tools (LIME), we show the interest of this neural version for the enrichment of the lexicons of the symbolic version, and for the production of versions in other languages.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Advanced Text Analysis Techniques · Natural Language Processing Techniques