Fine-Tuning Large Language Models for Scientific Text Classification: A Comparative Study

Zhyar Rzgar K Rostam; G\'abor Kert\'esz

arXiv:2412.00098·cs.CL·September 9, 2025

Fine-Tuning Large Language Models for Scientific Text Classification: A Comparative Study

Zhyar Rzgar K Rostam, G\'abor Kert\'esz

PDF

Open Access

TL;DR

This study evaluates the performance of various large language models in scientific text classification, demonstrating that domain-specific models like SciBERT outperform general-purpose models, highlighting the importance of domain adaptation.

Contribution

It provides a comparative analysis of fine-tuning four LLMs on scientific datasets, emphasizing the superiority of domain-specific models for specialized text classification tasks.

Findings

01

SciBERT outperforms other models in accuracy

02

Domain-specific models excel in scientific text classification

03

LLMs show advantages over traditional deep learning models in domain tasks

Abstract

The exponential growth of online textual content across diverse domains has necessitated advanced methods for automated text classification. Large Language Models (LLMs) based on transformer architectures have shown significant success in this area, particularly in natural language processing (NLP) tasks. However, general-purpose LLMs often struggle with domain-specific content, such as scientific texts, due to unique challenges like specialized vocabulary and imbalanced data. In this study, we fine-tune four state-of-the-art LLMs BERT, SciBERT, BioBERT, and BlueBERT on three datasets derived from the WoS-46985 dataset to evaluate their performance in scientific text classification. Our experiments reveal that domain-specific models, particularly SciBERT, consistently outperform general-purpose models in both abstract-based and keyword-based classification tasks. Additionally, we…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsText and Document Classification Technologies · Advanced Text Analysis Techniques

MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Softmax · Dropout · Dense Connections · Layer Normalization · Linear Layer · Multi-Head Attention · Weight Decay · Linear Warmup With Linear Decay · WordPiece