Validating Political Position Predictions of Arguments
Jordan Robinson, Angus R. Williams, Katie Atkinson, Anthony G. Cohn

TL;DR
This paper introduces a dual-scale validation framework for assessing political stance predictions in argumentative discourse, combining pointwise and pairwise human annotations, and creates a large knowledge base for political arguments.
Contribution
It presents a novel validation methodology for subjective continuous attributes, and constructs a structured knowledge base enabling reasoning and retrieval in political domains.
Findings
Moderate agreement between humans and models on pointwise evaluations (Krippendorff's α=0.578)
High alignment between human and model rankings on pairwise validation (α=0.86)
Demonstrates extraction of ordinal structure from subjective language model predictions
Abstract
Real-world knowledge representation often requires capturing subjective, continuous attributes -- such as political positions -- that conflict with pairwise validation, the widely accepted gold standard for human evaluation. We address this challenge through a dual-scale validation framework applied to political stance prediction in argumentative discourse, combining pointwise and pairwise human annotation. Using 22 language models, we construct a large-scale knowledge base of political position predictions for 23,228 arguments drawn from 30 debates that appeared on the UK politicial television programme \textit{Question Time}. Pointwise evaluation shows moderate human-model agreement (Krippendorff's ), reflecting intrinsic subjectivity, while pairwise validation reveals substantially stronger alignment between human- and model-derived rankings ( for the best…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Sentiment Analysis and Opinion Mining · Computational and Text Analysis Methods
