ClassiNet -- Predicting Missing Features for Short-Text Classification
Danushka Bollegala, Vincent Atanasov, Takanori Maehara, Ken-ichi, Kawarabayashi

TL;DR
ClassiNet is a novel network of classifiers designed to predict missing features in short texts, effectively addressing feature sparseness and improving classification accuracy without external resources.
Contribution
The paper introduces ClassiNet, a new method that models implicit feature co-occurrences and predicts missing features to enhance short-text classification.
Findings
Significant accuracy improvements on benchmark datasets.
Effective feature prediction without external resources.
Generalizes word co-occurrence graphs through implicit feature modeling.
Abstract
The fundamental problem in short-text classification is \emph{feature sparseness} -- the lack of feature overlap between a trained model and a test instance to be classified. We propose \emph{ClassiNet} -- a network of classifiers trained for predicting missing features in a given instance, to overcome the feature sparseness problem. Using a set of unlabeled training instances, we first learn binary classifiers as feature predictors for predicting whether a particular feature occurs in a given instance. Next, each feature predictor is represented as a vertex in the ClassiNet where a one-to-one correspondence exists between feature predictors and vertices. The weight of the directed edge connecting a vertex to a vertex represents the conditional probability that given exists in an instance, also exists in the same instance. We show that ClassiNets…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
