Ranking-Enhanced Unsupervised Sentence Representation Learning
Yeon Seonwoo, Guoyin Wang, Changmin Seo, Sajal Choudhary, Jiwei Li,, Xiang Li, Puyang Xu, Sunghyun Park, Alice Oh

TL;DR
This paper introduces RankEncoder, an unsupervised sentence encoder that improves semantic representations by incorporating relationships with nearest-neighbor sentences, leading to better performance on semantic benchmarks.
Contribution
The paper proposes a novel ranking-based approach, RankEncoder, which leverages external corpus relationships to enhance unsupervised sentence embeddings.
Findings
Achieves 80.07% Spearman's correlation, surpassing previous state-of-the-art.
Universally applicable to existing unsupervised methods.
Effective in predicting similarity scores of similar sentence pairs.
Abstract
Unsupervised sentence representation learning has progressed through contrastive learning and data augmentation methods such as dropout masking. Despite this progress, sentence encoders are still limited to using only an input sentence when predicting its semantic vector. In this work, we show that the semantic meaning of a sentence is also determined by nearest-neighbor sentences that are similar to the input sentence. Based on this finding, we propose a novel unsupervised sentence encoder, RankEncoder. RankEncoder predicts the semantic vector of an input sentence by leveraging its relationship with other sentences in an external corpus, as well as the input sentence itself. We evaluate RankEncoder on semantic textual benchmark datasets. From the experimental results, we verify that 1) RankEncoder achieves 80.07% Spearman's correlation, a 1.1% absolute improvement compared to the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Text Readability and Simplification
MethodsContrastive Learning · Dropout
