Sentence Representation Learning with Generative Objective rather than   Contrastive Objective

Bohong Wu; Hai Zhao

arXiv:2210.08474·cs.CL·October 24, 2022

Sentence Representation Learning with Generative Objective rather than Contrastive Objective

Bohong Wu, Hai Zhao

PDF

Open Access 1 Repo

TL;DR

This paper introduces a generative self-supervised learning approach for sentence representation that models intra-sentence structure through phrase reconstruction, outperforming contrastive methods on semantic tasks.

Contribution

It proposes a novel phrase-based generative objective for sentence embedding, addressing interpretability and performance issues of contrastive learning methods.

Findings

01

Outperforms contrastive methods on STS benchmarks

02

Improves downstream semantic retrieval and reranking tasks

03

Achieves significant performance gains in sentence representation learning

Abstract

Though offering amazing contextualized token-level representations, current pre-trained language models take less attention on accurately acquiring sentence-level representation during their self-supervised pre-training. However, contrastive objectives which dominate the current sentence representation learning bring little linguistic interpretability and no performance guarantee on downstream semantic tasks. We instead propose a novel generative self-supervised learning objective based on phrase reconstruction. To overcome the drawbacks of previous generative methods, we carefully model intra-sentence structure by breaking down one sentence into pieces of important phrases. Empirical studies show that our generative learning achieves powerful enough performance improvement and outperforms the current state-of-the-art contrastive methods not only on the STS benchmarks, but also on…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

chengzhipanpan/paser
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications