Reducing and Exploiting Data Augmentation Noise through Meta Reweighting Contrastive Learning for Text Classification
Guanyi Mou, Yichuan Li, Kyumin Lee

TL;DR
This paper introduces a novel meta reweighting framework that leverages contrastive learning to improve text classification by effectively utilizing and refining augmented data based on their quality.
Contribution
The paper proposes a new framework combining meta learning and contrastive learning to reweight and refine augmented samples, enhancing model performance on text classification tasks.
Findings
Achieves up to 4.4% absolute improvement on GLUE datasets.
Effectively cooperates with existing models like RoBERTa and Text-CNN.
Provides in-depth analysis of network components' contributions.
Abstract
Data augmentation has shown its effectiveness in resolving the data-hungry problem and improving model's generalization ability. However, the quality of augmented data can be varied, especially compared with the raw/original data. To boost deep learning models' performance given augmented data/samples in text classification tasks, we propose a novel framework, which leverages both meta learning and contrastive learning techniques as parts of our design for reweighting the augmented samples and refining their feature representations based on their quality. As part of the framework, we propose novel weight-dependent enqueue and dequeue algorithms to utilize augmented samples' weight/quality information effectively. Through experiments, we show that our framework can reasonably cooperate with existing deep learning models (e.g., RoBERTa-base and Text-CNN) and augmentation techniques (e.g.,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsContrastive Learning
