Multilingual estimation of political-party positioning: From label   aggregation to long-input Transformers

Dmitry Nikolaev; Tanise Ceron; Sebastian Pad\'o

arXiv:2310.12575·cs.CL·October 20, 2023·1 cites

Multilingual estimation of political-party positioning: From label aggregation to long-input Transformers

Dmitry Nikolaev, Tanise Ceron, Sebastian Pad\'o

PDF

Open Access 1 Repo

TL;DR

This paper compares label aggregation and long-input Transformer models for multilingual political scale estimation from party manifestos, finding that label aggregation yields the best results across diverse languages and countries.

Contribution

It introduces and evaluates two approaches for multilingual political scaling analysis, demonstrating the effectiveness of label aggregation and Transformer models on a large, multilingual dataset.

Findings

01

Label aggregation outperforms Transformer models in accuracy.

02

State-of-the-art models handle long texts effectively.

03

Analysis covers 41 countries and 27 languages.

Abstract

Scaling analysis is a technique in computational political science that assigns a political actor (e.g. politician or party) a score on a predefined scale based on a (typically long) body of text (e.g. a parliamentary speech or an election manifesto). For example, political scientists have often used the left--right scale to systematically analyse political landscapes of different countries. NLP methods for automatic scaling analysis can find broad application provided they (i) are able to deal with long texts and (ii) work robustly across domains and languages. In this work, we implement and compare two approaches to automatic scaling analysis of political-party manifestos: label aggregation, a pipeline strategy relying on annotations of individual statements from the manifestos, and long-input-Transformer-based models, which compute scaling values directly from raw text. We carry out…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

macleginn/party-positioning-code
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsComputational and Text Analysis Methods · Natural Language Processing Techniques · Topic Modeling