Selecting Between BERT and GPT for Text Classification in Political   Science Research

Yu Wang; Wen Qu; Xin Ye

arXiv:2411.05050·cs.CL·November 11, 2024·3 cites

Selecting Between BERT and GPT for Text Classification in Political Science Research

Yu Wang, Wen Qu, Xin Ye

PDF

Open Access

TL;DR

This paper compares BERT fine-tuning and GPT prompt-based methods for text classification in political science, especially under low-data conditions, providing practical guidance for researchers with limited labeled data.

Contribution

It systematically evaluates GPT-based prompt methods against BERT fine-tuning across various classification tasks in low-resource scenarios, highlighting their relative strengths and limitations.

Findings

01

GPT models perform reasonably in zero- and few-shot settings.

02

BERT fine-tuning outperforms GPT as data size increases.

03

Prompt engineering offers a cost-effective alternative for early-stage research.

Abstract

Political scientists often grapple with data scarcity in text classification. Recently, fine-tuned BERT models and their variants have gained traction as effective solutions to address this issue. In this study, we investigate the potential of GPT-based models combined with prompt engineering as a viable alternative. We conduct a series of experiments across various classification tasks, differing in the number of classes and complexity, to evaluate the effectiveness of BERT-based versus GPT-based models in low-data scenarios. Our findings indicate that while zero-shot and few-shot learning with GPT models provide reasonable performance and are well-suited for early-stage research exploration, they generally fall short - or, at best, match - the performance of BERT fine-tuning, particularly as the training set reaches a substantial size (e.g., 1,000 samples). We conclude by comparing…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSentiment Analysis and Opinion Mining

MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · Cosine Annealing · Byte Pair Encoding · Dropout · Discriminative Fine-Tuning · Linear Warmup With Cosine Annealing · Dense Connections · Layer Normalization · GPT