SenTest: Evaluating Robustness of Sentence Encoders

Tanmay Chavan; Shantanu Patankar; Aditya Kane; Omkar Gokhale,; Geetanjali Kale; Raviraj Joshi

arXiv:2311.17722·cs.CL·November 30, 2023·1 cites

SenTest: Evaluating Robustness of Sentence Encoders

Tanmay Chavan, Shantanu Patankar, Aditya Kane, Omkar Gokhale,, Geetanjali Kale, Raviraj Joshi

PDF

Open Access

TL;DR

This paper evaluates the robustness of sentence transformers against various adversarial attacks, revealing significant vulnerabilities and highlighting that current models do not effectively utilize semantic and syntactic information.

Contribution

It introduces a comprehensive adversarial evaluation framework for sentence encoders, demonstrating their fragility and analyzing their ability to capture sentence structure.

Findings

01

Models' accuracy drops up to 15% on perturbed data

02

Sentence embeddings capture semantic and syntactic structure

03

Supervised classifiers act as n-gram detectors

Abstract

Contrastive learning has proven to be an effective method for pre-training models using weakly labeled data in the vision domain. Sentence transformers are the NLP counterparts to this architecture, and have been growing in popularity due to their rich and effective sentence representations. Having effective sentence representations is paramount in multiple tasks, such as information retrieval, retrieval augmented generation (RAG), and sentence comparison. Keeping in mind the deployability factor of transformers, evaluating the robustness of sentence transformers is of utmost importance. This work focuses on evaluating the robustness of the sentence encoders. We employ several adversarial attacks to evaluate its robustness. This system uses character-level attacks in the form of random character substitution, word-level attacks in the form of synonym replacement, and sentence-level…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Text Readability and Simplification