Label Anchored Contrastive Learning for Language Understanding

Zhenyu Zhang; Yuming Zhao; Meng Chen; Xiaodong He

arXiv:2205.10227·cs.CL·May 23, 2022

Label Anchored Contrastive Learning for Language Understanding

Zhenyu Zhang, Yuming Zhao, Meng Chen, Xiaodong He

PDF

Open Access

TL;DR

This paper introduces LaCon, a label anchored contrastive learning method for language understanding that improves classification performance by leveraging class labels for hard positive/negative mining without extra data or architecture.

Contribution

The paper proposes a novel contrastive learning approach (LaCon) that incorporates label information through multiple contrastive objectives, enhancing language understanding tasks.

Findings

01

Up to 4.1% accuracy improvement on GLUE and CLUE datasets.

02

Significant gains in few-shot and data imbalance scenarios, up to 9.4%.

03

Compatible with existing pre-trained language models without extra architecture.

Abstract

Contrastive learning (CL) has achieved astonishing progress in computer vision, speech, and natural language processing fields recently with self-supervised learning. However, CL approach to the supervised setting is not fully explored, especially for the natural language understanding classification task. Intuitively, the class label itself has the intrinsic ability to perform hard positive/negative mining, which is crucial for CL. Motivated by this, we propose a novel label anchored contrastive learning approach (denoted as LaCon) for language understanding. Specifically, three contrastive objectives are devised, including a multi-head instance-centered contrastive loss (ICL), a label-centered contrastive loss (LCL), and a label embedding regularizer (LER). Our approach does not require any specialized network architecture or any extra data augmentation, thus it can be easily plugged…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMultimodal Machine Learning Applications · Natural Language Processing Techniques · Topic Modeling

MethodsContrastive Learning