APAM: Adaptive Pre-training and Adaptive Meta Learning in Language Model   for Noisy Labels and Long-tailed Learning

Sunyi Chi; Bo Dong; Yiming Xu; Zhenyu Shi; Zheng Du

arXiv:2302.03488·cs.CL·May 4, 2023·1 cites

APAM: Adaptive Pre-training and Adaptive Meta Learning in Language Model for Noisy Labels and Long-tailed Learning

Sunyi Chi, Bo Dong, Yiming Xu, Zhenyu Shi, Zheng Du

PDF

Open Access

TL;DR

This paper introduces a flexible framework combining adaptive pre-training and meta-learning to improve NLP model robustness against noisy labels and long-tailed data distributions, outperforming existing methods.

Contribution

It proposes a novel contrastive learning-based adaptive framework with a learnable re-weighting module for handling noisy and long-tailed NLP data.

Findings

01

Consistently outperforms baseline methods in experiments.

02

Effectively mitigates negative impact of noisy labels.

03

Handles long-tailed data distribution robustly.

Abstract

Practical natural language processing (NLP) tasks are commonly long-tailed with noisy labels. Those problems challenge the generalization and robustness of complex models such as Deep Neural Networks (DNNs). Some commonly used resampling techniques, such as oversampling or undersampling, could easily lead to overfitting. It is growing popular to learn the data weights leveraging a small amount of metadata. Besides, recent studies have shown the advantages of self-supervised pre-training, particularly to the under-represented data. In this work, we propose a general framework to handle the problem of both long-tail and noisy labels. The model is adapted to the domain of problems in a contrastive learning manner. The re-weighting module is a feed-forward network that learns explicit weighting functions and adapts weights according to metadata. The framework further adapts weights of terms…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsText and Document Classification Technologies · Machine Learning and Data Classification · Water Systems and Optimization

MethodsContrastive Learning