SWEAT: Scoring Polarization of Topics across Different Corpora

Federico Bianchi; Marco Marelli; Paolo Nicoli; Matteo Palmonari

arXiv:2109.07231·cs.CL·September 16, 2021

SWEAT: Scoring Polarization of Topics across Different Corpora

Federico Bianchi, Marco Marelli, Paolo Nicoli, Matteo Palmonari

PDF

1 Repo

TL;DR

This paper introduces SWEAT, a new statistical method for measuring the polarization of topics across different text corpora, aiding social science research.

Contribution

The paper presents SWEAT, a novel measure that quantifies topic polarization using distributional representations and opposite-valence wordsets.

Findings

01

SWEAT effectively measures polarization differences.

02

Validation confirms SWEAT's reliability.

03

Case study demonstrates practical utility.

Abstract

Understanding differences of viewpoints across corpora is a fundamental task for computational social sciences. In this paper, we propose the Sliced Word Embedding Association Test (SWEAT), a novel statistical measure to compute the relative polarization of a topical wordset across two distributional representations. To this end, SWEAT uses two additional wordsets, deemed to have opposite valence, to represent two different poles. We validate our approach and illustrate a case study to show the usefulness of the introduced measure.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

vinid/sweat
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsTemporal Word Embeddings with a Compass