DefSent+: Improving sentence embeddings of language models by projecting definition sentences into a quasi-isotropic or isotropic vector space of unlimited dictionary entries
Xiaodong Liu

TL;DR
DefSent+ enhances sentence embeddings by projecting definition sentences into a quasi-isotropic vector space, overcoming previous limitations and significantly improving performance on sentence similarity tasks and downstream NLP applications.
Contribution
The paper introduces DefSent+, a novel method that builds unlimited dictionary entry embeddings, enabling better sentence representations and surpassing prior approaches like DefSent.
Findings
Significantly improved sentence similarity measurement performance.
Achieves state-of-the-art results with data-augmented models.
Competitive in downstream NLP transfer tasks.
Abstract
This paper presents a significant improvement on the previous conference paper known as DefSent. The prior study seeks to improve sentence embeddings of language models by projecting definition sentences into the vector space of dictionary entries. We discover that this approach is not fully explored due to the methodological limitation of using word embeddings of language models to represent dictionary entries. This leads to two hindrances. First, dictionary entries are constrained by the single-word vocabulary, and thus cannot be fully exploited. Second, semantic representations of language models are known to be anisotropic, but pre-processing word embeddings for DefSent is not allowed because its weight is frozen during training and tied to the prediction layer. In this paper, we propose a novel method to progressively build entry embeddings not subject to the limitations. As a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
- 🤗RyuKT/DefSentPlus-bert-base-uncasedmodel· 1 dl1 dl
- 🤗RyuKT/DefSentPlus-simcse-bert-base-uncasedmodel· 1 dl1 dl
- 🤗RyuKT/DefSentPlus-sncse-bert-base-uncasedmodel
- 🤗RyuKT/DefSentPlus-bert-large-uncasedmodel
- 🤗RyuKT/DefSentPlus-simcse-bert-large-uncasedmodel· 2 dl2 dl
- 🤗RyuKT/DefSentPlus-sncse-bert-large-uncasedmodel
- 🤗RyuKT/DefSentPlus-roberta-basemodel
- 🤗RyuKT/DefSentPlus-simcse-roberta-basemodel
- 🤗RyuKT/DefSentPlus-sncse-roberta-basemodel
- 🤗RyuKT/DefSentPlus-syncse-partial-roberta-basemodel· 1 dl1 dl
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling
MethodsSimCSE
