Virtual Augmentation Supported Contrastive Learning of Sentence   Representations

Dejiao Zhang; Wei Xiao; Henghui Zhu; Xiaofei Ma; Andrew O. Arnold

arXiv:2110.08552·cs.CL·February 28, 2022·6 cites

Virtual Augmentation Supported Contrastive Learning of Sentence Representations

Dejiao Zhang, Wei Xiao, Henghui Zhu, Xiaofei Ma, Andrew O. Arnold

PDF

Open Access 2 Repos

TL;DR

This paper introduces VaSCL, a novel contrastive learning method for sentence representations that uses virtual augmentations generated from in-batch neighbors, achieving state-of-the-art results in unsupervised settings.

Contribution

The paper proposes a new virtual augmentation technique for contrastive learning in NLP that leverages in-batch neighbors to improve sentence representations.

Findings

01

VaSCL achieves new state-of-the-art performance on multiple downstream tasks.

02

The virtual augmentation method enhances contrastive learning without domain-specific data augmentation.

03

Using in-batch neighbors effectively approximates neighborhoods for augmentation.

Abstract

Despite profound successes, contrastive representation learning relies on carefully designed data augmentations using domain specific knowledge. This challenge is magnified in natural language processing where no general rules exist for data augmentation due to the discrete nature of natural language. We tackle this challenge by presenting a Virtual augmentation Supported Contrastive Learning of sentence representations (VaSCL). Originating from the interpretation that data augmentation essentially constructs the neighborhoods of each training instance, we in turn utilize the neighborhood to generate effective data augmentations. Leveraging the large training batch size of contrastive learning, we approximate the neighborhood of an instance via its K-nearest in-batch neighbors in the representation space. We then define an instance discrimination task regarding this neighborhood and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Multimodal Machine Learning Applications · Interpreting and Communication in Healthcare

MethodsContrastive Learning