State of the Art in Text Classification for South Slavic Languages: Fine-Tuning or Prompting?

Taja Kuzman Punger\v{s}ek; Peter Rupnik; Ivan Porupski; Vuk Dini\'c; Nikola Ljube\v{s}i\'c

arXiv:2511.07989·cs.CL·February 20, 2026

State of the Art in Text Classification for South Slavic Languages: Fine-Tuning or Prompting?

Taja Kuzman Punger\v{s}ek, Peter Rupnik, Ivan Porupski, Vuk Dini\'c, Nikola Ljube\v{s}i\'c

PDF

Open Access

TL;DR

This study evaluates the performance of large language models versus fine-tuned BERT-like models on text classification tasks across South Slavic languages, highlighting strengths and limitations of LLMs in low-resource language settings.

Contribution

It provides a comprehensive comparison of LLMs and BERT-like models for South Slavic language classification tasks, emphasizing practical considerations and performance insights.

Findings

01

LLMs often match or surpass fine-tuned models in zero-shot settings.

02

LLMs perform similarly in South Slavic languages and English.

03

Fine-tuned BERT-like models are more practical due to lower computational costs.

Abstract

Until recently, fine-tuned BERT-like models provided state-of-the-art performance on text classification tasks. With the rise of instruction-tuned decoder-only models, commonly known as large language models (LLMs), the field has increasingly moved toward zero-shot and few-shot prompting. However, the performance of LLMs on text classification, particularly on less-resourced languages, remains under-explored. In this paper, we evaluate the performance of current language models on text classification tasks across several South Slavic languages. We compare openly available fine-tuned BERT-like models with a selection of open-source and closed-source LLMs across three tasks in three domains: sentiment classification in parliamentary speeches, topic classification in news articles and parliamentary speeches, and genre identification in web texts. Our results show that LLMs demonstrate…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSentiment Analysis and Opinion Mining · Authorship Attribution and Profiling · Computational and Text Analysis Methods