Boosting classification reliability of NLP transformer models in the   long run

Zolt\'an Kmetty; Bence Koll\'anyi; Kriszti\'an Boros

arXiv:2302.10016·cs.CL·February 18, 2025·1 cites

Boosting classification reliability of NLP transformer models in the long run

Zolt\'an Kmetty, Bence Koll\'anyi, Kriszti\'an Boros

PDF

Open Access

TL;DR

This study evaluates methods for maintaining NLP transformer model performance over time in dynamic environments, emphasizing the importance of continuous fine-tuning and annotation to sustain classification reliability.

Contribution

It compares different fine-tuning strategies for BERT in long-term classification tasks and demonstrates the effectiveness of using all available unlabeled data for model updates.

Findings

01

Using all available unlabeled comments improves model performance.

02

Random sampling from new data is more effective than focusing only on unseen words.

03

Regular annotation and fine-tuning are necessary to prevent performance degradation.

Abstract

Transformer-based machine learning models have become an essential tool for many natural language processing (NLP) tasks since the introduction of the method. A common objective of these projects is to classify text data. Classification models are often extended to a different topic and/or time period. In these situations, deciding how long a classification is suitable for and when it is worth re-training our model is difficult. This paper compares different approaches to fine-tune a BERT model for a long-running classification task. We use data from different periods to fine-tune our original BERT model, and we also measure how a second round of annotation could boost the classification quality. Our corpus contains over 8 million comments on COVID-19 vaccination in Hungary posted between September 2020 and December 2021. Our results show that the best solution is using all available…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling

MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · Attention Dropout · Dropout · Linear Warmup With Linear Decay · Residual Connection · WordPiece · Softmax · Linear Layer · Layer Normalization