Whitening Sentence Representations for Better Semantics and Faster Retrieval
Jianlin Su, Jiarun Cao, Weijie Liu, Yangyiwen Ou

TL;DR
This paper introduces a whitening technique for sentence representations derived from pre-trained models like BERT, which improves semantic quality, enhances isotropy, reduces storage, and speeds up retrieval in NLP tasks.
Contribution
The paper demonstrates that applying whitening to sentence embeddings improves isotropy, reduces dimensionality, and enhances retrieval efficiency, offering a simple yet effective alternative to flow-based methods.
Findings
Whitening improves the isotropy of sentence representations.
Whitening reduces storage costs and accelerates retrieval.
Competitive performance achieved with the whitening technique.
Abstract
Pre-training models such as BERT have achieved great success in many natural language processing tasks. However, how to obtain better sentence representation through these pre-training models is still worthy to exploit. Previous work has shown that the anisotropy problem is an critical bottleneck for BERT-based sentence representation which hinders the model to fully utilize the underlying semantic features. Therefore, some attempts of boosting the isotropy of sentence distribution, such as flow-based model, have been applied to sentence representations and achieved some improvement. In this paper, we find that the whitening operation in traditional machine learning can similarly enhance the isotropy of sentence representations and achieve competitive results. Furthermore, the whitening technique is also capable of reducing the dimensionality of the sentence representation. Our…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications
MethodsLinear Layer · Weight Decay · WordPiece · Refunds@Expedia|||How do I get a full refund from Expedia? · Adam · Softmax · Dense Connections · Attention Is All You Need · Linear Warmup With Linear Decay · Dropout
