The Daunting Dilemma with Sentence Encoders: Success on Standard   Benchmarks, Failure in Capturing Basic Semantic Properties

Yash Mahajan; Naman Bansal; Shubhra Kanti Karmaker ("Santu")

arXiv:2309.03747·cs.CL·September 8, 2023·1 cites

The Daunting Dilemma with Sentence Encoders: Success on Standard Benchmarks, Failure in Capturing Basic Semantic Properties

Yash Mahajan, Naman Bansal, Shubhra Kanti Karmaker ("Santu")

PDF

Open Access

TL;DR

This paper critically examines popular sentence encoders, revealing that while they excel on standard benchmarks, they fail to capture fundamental semantic properties, highlighting a significant challenge in NLP.

Contribution

The study introduces new semantic evaluation criteria and provides a comprehensive comparison of five popular sentence encoders, exposing their limitations in capturing basic semantics.

Findings

01

Sentence-BERT and USE pass paraphrasing tests, with SBERT performing better.

02

LASER excels in synonym replacement tasks.

03

All encoders fail in antonym replacement and sentence jumbling tests.

Abstract

In this paper, we adopted a retrospective approach to examine and compare five existing popular sentence encoders, i.e., Sentence-BERT, Universal Sentence Encoder (USE), LASER, InferSent, and Doc2vec, in terms of their performance on downstream tasks versus their capability to capture basic semantic properties. Initially, we evaluated all five sentence encoders on the popular SentEval benchmark and found that multiple sentence encoders perform quite well on a variety of popular downstream tasks. However, being unable to find a single winner in all cases, we designed further experiments to gain a deeper understanding of their behavior. Specifically, we proposed four semantic evaluation criteria, i.e., Paraphrasing, Synonym Replacement, Antonym Replacement, and Sentence Jumbling, and evaluated the same five sentence encoders using these criteria. We found that the Sentence-Bert and USE…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Text Readability and Simplification · Topic Modeling

MethodsSentence-BERT · Multilingual Universal Sentence Encoder