ShufText: A Simple Black Box Approach to Evaluate the Fragility of Text   Classification Models

Rutuja Taware; Shraddha Varat; Gaurav Salunke; Chaitanya Gawande,; Geetanjali Kale; Rahul Khengare; Raviraj Joshi

arXiv:2102.00238·cs.CL·February 4, 2022

ShufText: A Simple Black Box Approach to Evaluate the Fragility of Text Classification Models

Rutuja Taware, Shraddha Varat, Gaurav Salunke, Chaitanya Gawande,, Geetanjali Kale, Rahul Khengare, Raviraj Joshi

PDF

TL;DR

This paper introduces ShufText, a black box method to evaluate text classification models' reliance on keywords by shuffling words and assessing accuracy, revealing many models' limited semantic understanding.

Contribution

The paper presents ShufText, a simple technique to expose over-reliance on keywords in text classifiers, highlighting their limited semantic comprehension and robustness issues.

Findings

01

Models often maintain accuracy after word shuffling, indicating reliance on keywords.

02

Pretraining with language models does not significantly improve semantic understanding.

03

Both simple and complex models show limited robustness to out-of-domain sentences.

Abstract

Text classification is the most basic natural language processing task. It has a wide range of applications ranging from sentiment analysis to topic classification. Recently, deep learning approaches based on CNN, LSTM, and Transformers have been the de facto approach for text classification. In this work, we highlight a common issue associated with these approaches. We show that these systems are over-reliant on the important words present in the text that are useful for classification. With limited training data and discriminative training strategy, these approaches tend to ignore the semantic meaning of the sentence and rather just focus on keywords or important n-grams. We propose a simple black box technique ShutText to present the shortcomings of the model and identify the over-reliance of the model on keywords. This involves randomly shuffling the words in a sentence and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsLinear Layer · Refunds@Expedia|||How do I get a full refund from Expedia? · Softmax · Attention Is All You Need · Dense Connections · Residual Connection · WordPiece · Attention Dropout · Adam · Linear Warmup With Linear Decay