Investigating the Working of Text Classifiers
Devendra Singh Sachan, Manzil Zaheer, Ruslan Salakhutdinov

TL;DR
This paper investigates whether neural text classifiers truly learn sentence composition or rely on keywords, revealing that models often depend on lexicons and that regularization can improve their generalization.
Contribution
The study introduces datasets designed to test compositional understanding and proposes regularization techniques that reduce lexicon dependence, enhancing classifier robustness.
Findings
Models perform poorly when lexicons are absent in test data.
Regularization techniques improve accuracy on lexicon-independent datasets.
Neural classifiers often rely on keywords rather than compositional meaning.
Abstract
Text classification is one of the most widely studied tasks in natural language processing. Motivated by the principle of compositionality, large multilayer neural network models have been employed for this task in an attempt to effectively utilize the constituent expressions. Almost all of the reported work train large networks using discriminative approaches, which come with a caveat of no proper capacity control, as they tend to latch on to any signal that may not generalize. Using various recent state-of-the-art approaches for text classification, we explore whether these models actually learn to compose the meaning of the sentences or still just focus on some keywords or lexicons for classifying the document. To test our hypothesis, we carefully construct datasets where the training and test splits have no direct overlap of such lexicons, but overall language structure would be…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Text and Document Classification Technologies · Natural Language Processing Techniques
