Political Leaning and Politicalness Classification of Texts

Matous Volf (1); Jakub Simko (2) ((1) DELTA High school of computer science; economics; Pardubice; Czechia; (2) Kempelen Institute of Intelligent Technologies; Bratislava; Slovakia)

arXiv:2507.13913·cs.CL·July 21, 2025

Political Leaning and Politicalness Classification of Texts

Matous Volf (1), Jakub Simko (2) ((1) DELTA High school of computer science, economics, Pardubice, Czechia, (2) Kempelen Institute of Intelligent Technologies, Bratislava, Slovakia)

PDF

Open Access 2 Models

TL;DR

This paper evaluates transformer-based models for classifying texts by political leaning and politicalness, highlighting limitations of current approaches and proposing a diverse dataset for improved generalization.

Contribution

It compiles a comprehensive, diverse dataset and benchmarks models to improve out-of-distribution performance in political text classification.

Findings

01

Current models perform poorly on out-of-distribution texts

02

A new diverse dataset enhances model generalization

03

Benchmarking reveals strengths and weaknesses of existing approaches

Abstract

This paper addresses the challenge of automatically classifying text according to political leaning and politicalness using transformer models. We compose a comprehensive overview of existing datasets and models for these tasks, finding that current approaches create siloed solutions that perform poorly on out-of-distribution texts. To address this limitation, we compile a diverse dataset by combining 12 datasets for political leaning classification and creating a new dataset for politicalness by extending 18 existing datasets with the appropriate label. Through extensive benchmarking with leave-one-in and leave-one-out methodologies, we evaluate the performance of existing models and train new ones with enhanced generalization capabilities.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Models

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsComputational and Text Analysis Methods · Sentiment Analysis and Opinion Mining · Misinformation and Its Impacts