The Daunting Dilemma with Sentence Encoders: Success on Standard Benchmarks, Failure in Capturing Basic Semantic Properties
Yash Mahajan, Naman Bansal, Shubhra Kanti Karmaker ("Santu")

TL;DR
This paper critically examines popular sentence encoders, revealing that while they excel on standard benchmarks, they fail to capture fundamental semantic properties, highlighting a significant challenge in NLP.
Contribution
The study introduces new semantic evaluation criteria and provides a comprehensive comparison of five popular sentence encoders, exposing their limitations in capturing basic semantics.
Findings
Sentence-BERT and USE pass paraphrasing tests, with SBERT performing better.
LASER excels in synonym replacement tasks.
All encoders fail in antonym replacement and sentence jumbling tests.
Abstract
In this paper, we adopted a retrospective approach to examine and compare five existing popular sentence encoders, i.e., Sentence-BERT, Universal Sentence Encoder (USE), LASER, InferSent, and Doc2vec, in terms of their performance on downstream tasks versus their capability to capture basic semantic properties. Initially, we evaluated all five sentence encoders on the popular SentEval benchmark and found that multiple sentence encoders perform quite well on a variety of popular downstream tasks. However, being unable to find a single winner in all cases, we designed further experiments to gain a deeper understanding of their behavior. Specifically, we proposed four semantic evaluation criteria, i.e., Paraphrasing, Synonym Replacement, Antonym Replacement, and Sentence Jumbling, and evaluated the same five sentence encoders using these criteria. We found that the Sentence-Bert and USE…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Text Readability and Simplification · Topic Modeling
MethodsSentence-BERT · Multilingual Universal Sentence Encoder
