Sequential Learning of Convolutional Features for Effective Text Classification
Avinash Madasu, Vijjini Anvesh Rao

TL;DR
This paper critically examines CNN components for text classification, introduces the SCARN model combining recurrent and convolutional features, and demonstrates its superior performance across multiple NLP tasks with fewer parameters.
Contribution
The paper presents SCARN, a novel model that effectively integrates recurrent and convolutional structures, addressing limitations of existing CNNs for text classification.
Findings
SCARN outperforms existing recurrent convolutional models.
SCARN achieves better results than large CNN and LSTM architectures.
SCARN requires significantly fewer parameters.
Abstract
Text classification has been one of the major problems in natural language processing. With the advent of deep learning, convolutional neural network (CNN) has been a popular solution to this task. However, CNNs which were first proposed for images, face many crucial challenges in the context of text processing, namely in their elementary blocks: convolution filters and max pooling. These challenges have largely been overlooked by the most existing CNN models proposed for text classification. In this paper, we present an experimental study on the fundamental blocks of CNNs in text categorization. Based on this critique, we propose Sequential Convolutional Attentive Recurrent Network (SCARN). The proposed SCARN model utilizes both the advantages of recurrent and convolutional structures efficiently in comparison to previously proposed recurrent convolutional models. We test our model on…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Sentiment Analysis and Opinion Mining · Text and Document Classification Technologies
MethodsSigmoid Activation · Tanh Activation · Convolution · Long Short-Term Memory
