TL;DR
This paper develops a transformer-based model to predict the political orientation of German texts on a continuous spectrum, using diverse corpora and evaluating multiple models for accuracy and robustness.
Contribution
It introduces a methodology for predicting political bias in German texts with a continuous scale, comparing 13 transformer models across multiple datasets.
Findings
DeBERTa-large achieved the highest in-domain F1 score of 0.844.
Gemma2-2B performed best on newspaper out-of-domain test with MAE=0.172.
Transformer models can effectively recognize political framing in German news.
Abstract
Elections represent a crucial milestone in a nation's ongoing development. To better understand the political rhetoric from various movements, ranging from left to right, we propose a transformer-based model capable of projecting the political orientation of a text on a continuous left-to-right spectrum, represented by a normalized scalar d between -1 and 1. This approach enables analysts to focus on specific segments of the political landscape, such as conservatives, while excluding liberal and far-right movements. Such a task can only be achieved with multiclass classifiers, provided that the desired orientation is incorporated within one of their predefined classes. To determine the most suitable foundation model among 13 candidate transformers for this task, we constructed four distinct corpora. One corpus comprised annotated plenary notes from the German Bundestag, while another…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
