Language Models Learn Metadata: Political Stance Detection Case Study

Stanley Cao; Felix Drinkall

arXiv:2409.13756·cs.CL·September 24, 2024

Language Models Learn Metadata: Political Stance Detection Case Study

Stanley Cao, Felix Drinkall

PDF

Open Access

TL;DR

This paper shows that simple metadata, like party membership, can significantly improve political stance detection, outperforming complex methods, and highlights the importance of how metadata is incorporated.

Contribution

It reveals that straightforward metadata usage can surpass complex models in political stance detection and emphasizes optimal metadata integration strategies.

Findings

01

Party membership alone outperforms previous methods.

02

Prepending metadata to speeches yields best results.

03

Complex metadata systems may not learn effectively.

Abstract

Stance detection is a crucial NLP task with numerous applications in social science, from analyzing online discussions to assessing political campaigns. This paper investigates the optimal way to incorporate metadata into a political stance detection task. We demonstrate that previous methods combining metadata with language-based data for political stance detection have not fully utilized the metadata information; our simple baseline, using only party membership information, surpasses the current state-of-the-art. We then show that prepending metadata (e.g., party and policy) to political speeches performs best, outperforming all baselines, indicating that complex metadata inclusion systems may not learn the task optimally.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHate Speech and Cyberbullying Detection · Natural Language Processing Techniques · Topic Modeling