Composition-contrastive Learning for Sentence Embeddings

Sachin J. Chanchani; Ruihong Huang

arXiv:2307.07380·cs.CL·July 17, 2023

Composition-contrastive Learning for Sentence Embeddings

Sachin J. Chanchani, Ruihong Huang

PDF

Open Access 1 Repo

TL;DR

This paper introduces a novel contrastive learning method for sentence embeddings that aligns texts with their phrasal compositions, improving semantic similarity performance without extra training costs.

Contribution

It proposes a new composition-contrastive learning approach that enhances sentence representations by leveraging phrasal structures, outperforming baselines without additional parameters or auxiliary objectives.

Findings

01

Improved semantic textual similarity scores

02

Comparable performance to state-of-the-art methods

03

No extra training costs or network complexity

Abstract

Vector representations of natural language are ubiquitous in search applications. Recently, various methods based on contrastive learning have been proposed to learn textual representations from unlabelled data; by maximizing alignment between minimally-perturbed embeddings of the same text, and encouraging a uniform distribution of embeddings across a broader corpus. Differently, we propose maximizing alignment between texts and a composition of their phrasal constituents. We consider several realizations of this objective and elaborate the impact on representations in each case. Experimental results on semantic textual similarity tasks show improvements over baselines that are comparable with state-of-the-art approaches. Moreover, this work is the first to do so without incurring costs in auxiliary training objectives or additional network parameters.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

perceptiveshawty/compcse
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Sentiment Analysis and Opinion Mining

MethodsContrastive Learning