A multi-model-based deep learning framework for short text multiclass classification with the imbalanced and extremely small data set
Jiajun Tong, Zhixiao Wang, Xiaobin Rui

TL;DR
This paper introduces a lightweight, multi-model deep learning framework combining DISTILBERT and bidirectional LSTM for effective short text classification on extremely small, imbalanced datasets, suitable for mobile deployment.
Contribution
It proposes a novel multi-model deep learning framework that effectively handles small, imbalanced datasets with a compressed model suitable for mobile devices.
Findings
Achieves state-of-the-art performance on benchmark datasets.
Retains high precision, recall, accuracy, and F1 score.
Offers faster training and smaller model size for mobile deployment.
Abstract
Text classification plays an important role in many practical applications. In the real world, there are extremely small datasets. Most existing methods adopt pre-trained neural network models to handle this kind of dataset. However, these methods are either difficult to deploy on mobile devices because of their large output size or cannot fully extract the deep semantic information between phrases and clauses. This paper proposes a multimodel-based deep learning framework for short-text multiclass classification with an imbalanced and extremely small data set. Our framework mainly includes five layers: The encoder layer uses DISTILBERT to obtain context-sensitive dynamic word vectors that are difficult to represent in traditional feature engineering methods. Since the transformer part of this layer is distilled, our framework is compressed. Then, we use the next two layers to extract…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsText and Document Classification Technologies · Sentiment Analysis and Opinion Mining · Spam and Phishing Detection
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Multi-Head Attention · Attention Is All You Need · Linear Layer · Residual Connection · Adam · Attention Dropout · Layer Normalization · Linear Warmup With Linear Decay · Dropout
