Differentiable Data Augmentation for Contrastive Sentence Representation   Learning

Tianduo Wang; Wei Lu

arXiv:2210.16536·cs.CL·November 1, 2022

Differentiable Data Augmentation for Contrastive Sentence Representation Learning

Tianduo Wang, Wei Lu

PDF

Open Access 1 Repo

TL;DR

This paper introduces a differentiable data augmentation technique using prefix-tuning to enhance contrastive sentence representation learning, significantly improving performance especially with limited labeled data.

Contribution

It proposes a novel differentiable data augmentation method with prefix-tuning for contrastive learning, improving sentence representations in semi-supervised and supervised settings.

Findings

01

Significant performance improvements over existing methods.

02

Enhanced label efficiency in low-resource settings.

03

Effective augmentation through differentiable prefix-tuning.

Abstract

Fine-tuning a pre-trained language model via the contrastive learning framework with a large amount of unlabeled sentences or labeled sentence pairs is a common way to obtain high-quality sentence representations. Although the contrastive learning framework has shown its superiority on sentence representation learning over previous methods, the potential of such a framework is under-explored so far due to the simple method it used to construct positive pairs. Motivated by this, we propose a method that makes hard positives from the original training examples. A pivotal ingredient of our approach is the use of prefix that is attached to a pre-trained language model, which allows for differentiable data augmentation during contrastive learning. Our method can be summarized in two steps: supervised prefix-tuning followed by joint contrastive fine-tuning with unlabeled or labeled examples.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

tianduowang/diffaug
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications

MethodsContrastive Learning