Exploiting All Samples in Low-Resource Sentence Classification: Early Stopping and Initialization Parameters
Hongseok Choi, Hyunju Lee

TL;DR
This paper investigates how to best utilize limited labeled data in low-resource sentence classification by exploring training strategies, proposing an integrated approach, and demonstrating its effectiveness across multiple datasets and models.
Contribution
It introduces an integrated method combining weight averaging and non-validation stopping, improving performance without additional data or model redesigns in low-resource settings.
Findings
The integrated method outperforms conventional validation-based methods by 1.8% accuracy on average.
Performance varies significantly with different training strategies and approaches.
The method enhances state-of-the-art models using additional data or redesigns.
Abstract
To improve deep-learning performance in low-resource settings, many researchers have redesigned model architectures or applied additional data (e.g., external resources, unlabeled samples). However, there have been relatively few discussions on how to make good use of small amounts of labeled samples, although it is potentially beneficial and should be done before applying additional data or redesigning models. In this study, we assume a low-resource setting in which only a few labeled samples (i.e., 30-100 per class) are available, and we discuss how to exploit them without additional data or model redesigns. We explore possible approaches in the following three aspects: training-validation splitting, early stopping, and weight initialization. Extensive experiments are conducted on six public sentence classification datasets. Performance on various evaluation metrics (e.g., accuracy,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Machine Learning and Data Classification
MethodsEarly Stopping
