Character-Based Text Classification using Top Down Semantic Model for Sentence Representation
Zhenzhou Wu, Xin Zheng, Daniel Dahlmeier

TL;DR
This paper introduces TDSM, a hybrid deep learning model that combines word-level and sentence-level semantics for improved character-based text classification, outperforming existing models on multiple datasets.
Contribution
The paper proposes TDSM, a novel model integrating attention-based word semantics and BiLSTM sentence semantics, achieving superior performance with fewer parameters.
Findings
TDSM outperforms all compared CNN models across seven datasets.
TDSM uses only 1 extbackslash% of parameters of previous models.
TDSM surpasses traditional TF-IDF linear models on small datasets.
Abstract
Despite the success of deep learning on many fronts especially image and speech, its application in text classification often is still not as good as a simple linear SVM on n-gram TF-IDF representation especially for smaller datasets. Deep learning tends to emphasize on sentence level semantics when learning a representation with models like recurrent neural network or recursive neural network, however from the success of TF-IDF representation, it seems a bag-of-words type of representation has its strength. Taking advantage of both representions, we present a model known as TDSM (Top Down Semantic Model) for extracting a sentence representation that considers both the word-level semantics by linearly combining the words with attention weights and the sentence-level semantics with BiLSTM and use it on text classification. We apply the model on characters and our results show that our…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Text and Document Classification Technologies
MethodsSigmoid Activation · Tanh Activation · Support Vector Machine · Long Short-Term Memory · Bidirectional LSTM
