An Unsupervised Sentence Embedding Method by Mutual Information Maximization
Yan Zhang, Ruidan He, Zuozhu Liu, Kwan Hui Lim, Lidong Bing

TL;DR
This paper introduces an unsupervised sentence embedding method based on mutual information maximization, enabling effective semantic representations without labeled data, outperforming existing unsupervised models and rivaling supervised approaches.
Contribution
The paper presents a novel self-supervised learning approach that extends BERT for unsupervised sentence embedding using mutual information maximization, applicable across various domains.
Findings
Outperforms other unsupervised sentence embedding methods on STS tasks
Outperforms SBERT when labeled data is unavailable
Achieves competitive results with supervised methods
Abstract
BERT is inefficient for sentence-pair tasks such as clustering or semantic search as it needs to evaluate combinatorially many sentence pairs which is very time-consuming. Sentence BERT (SBERT) attempted to solve this challenge by learning semantically meaningful representations of single sentences, such that similarity comparison can be easily accessed. However, SBERT is trained on corpus with high-quality labeled sentence pairs, which limits its application to tasks where labeled data is extremely scarce. In this paper, we propose a lightweight extension on top of BERT and a novel self-supervised learning objective based on mutual information maximization strategies to derive meaningful sentence embeddings in an unsupervised manner. Unlike SBERT, our method is not restricted by the availability of labeled data, such that it can be applied on different domain-specific corpus.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Sentiment Analysis and Opinion Mining
MethodsSentence-BERT · Linear Layer · Softmax · Refunds@Expedia|||How do I get a full refund from Expedia? · Dense Connections · Dropout · Linear Warmup With Linear Decay · Layer Normalization · Attention Dropout · WordPiece
