Comparative Insights from 12 Machine Learning Models in Extracting   Economic Ideology from Political Text

Jihed Ncib

arXiv:2501.09719·cs.CL·January 17, 2025

Comparative Insights from 12 Machine Learning Models in Extracting Economic Ideology from Political Text

Jihed Ncib

PDF

Open Access

TL;DR

This paper systematically compares 12 machine learning models, including generative, fine-tuned, and zero-shot, in extracting economic ideology from UK political manifestos, highlighting their strengths, limitations, and best practices.

Contribution

It provides a comprehensive evaluation of various models for political text analysis, revealing the superior performance of generative models and the trade-offs of fine-tuning and zero-shot approaches.

Findings

01

Generative models like GPT-4o outperform others in accuracy.

02

Fine-tuning improves performance but depends on training data size.

03

Zero-shot models struggle with domain-specific signals.

Abstract

This study conducts a systematic assessment of the capabilities of 12 machine learning models and model variations in detecting economic ideology. As an evaluation benchmark, I use manifesto data spanning six elections in the United Kingdom and pre-annotated by expert and crowd coders. The analysis assesses the performance of several generative, fine-tuned, and zero-shot models at the granular and aggregate levels. The results show that generative models such as GPT-4o and Gemini 1.5 Flash consistently outperform other models against all benchmarks. However, they pose issues of accessibility and resource availability. Fine-tuning yielded competitive performance and offers a reliable alternative through domain-specific optimization. But its dependency on training data severely limits scalability. Zero-shot models consistently face difficulties with identifying signals of economic…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsComputational and Text Analysis Methods