Exploiting All Samples in Low-Resource Sentence Classification: Early   Stopping and Initialization Parameters

Hongseok Choi; Hyunju Lee

arXiv:2111.06971·cs.CL·July 26, 2024

Exploiting All Samples in Low-Resource Sentence Classification: Early Stopping and Initialization Parameters

Hongseok Choi, Hyunju Lee

PDF

Open Access 1 Repo

TL;DR

This paper investigates how to best utilize limited labeled data in low-resource sentence classification by exploring training strategies, proposing an integrated approach, and demonstrating its effectiveness across multiple datasets and models.

Contribution

It introduces an integrated method combining weight averaging and non-validation stopping, improving performance without additional data or model redesigns in low-resource settings.

Findings

01

The integrated method outperforms conventional validation-based methods by 1.8% accuracy on average.

02

Performance varies significantly with different training strategies and approaches.

03

The method enhances state-of-the-art models using additional data or redesigns.

Abstract

To improve deep-learning performance in low-resource settings, many researchers have redesigned model architectures or applied additional data (e.g., external resources, unlabeled samples). However, there have been relatively few discussions on how to make good use of small amounts of labeled samples, although it is potentially beneficial and should be done before applying additional data or redesigning models. In this study, we assume a low-resource setting in which only a few labeled samples (i.e., 30-100 per class) are available, and we discuss how to exploit them without additional data or model redesigns. We explore possible approaches in the following three aspects: training-validation splitting, early stopping, and weight initialization. Extensive experiments are conducted on six public sentence classification datasets. Performance on various evaluation metrics (e.g., accuracy,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

dmcb-gist/exploit_all_samples
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Machine Learning and Data Classification

MethodsEarly Stopping