ClassiNet -- Predicting Missing Features for Short-Text Classification

Danushka Bollegala; Vincent Atanasov; Takanori Maehara; Ken-ichi; Kawarabayashi

arXiv:1804.05260·cs.CL·April 17, 2018

ClassiNet -- Predicting Missing Features for Short-Text Classification

Danushka Bollegala, Vincent Atanasov, Takanori Maehara, Ken-ichi, Kawarabayashi

PDF

TL;DR

ClassiNet is a novel network of classifiers designed to predict missing features in short texts, effectively addressing feature sparseness and improving classification accuracy without external resources.

Contribution

The paper introduces ClassiNet, a new method that models implicit feature co-occurrences and predicts missing features to enhance short-text classification.

Findings

01

Significant accuracy improvements on benchmark datasets.

02

Effective feature prediction without external resources.

03

Generalizes word co-occurrence graphs through implicit feature modeling.

Abstract

The fundamental problem in short-text classification is \emph{feature sparseness} -- the lack of feature overlap between a trained model and a test instance to be classified. We propose \emph{ClassiNet} -- a network of classifiers trained for predicting missing features in a given instance, to overcome the feature sparseness problem. Using a set of unlabeled training instances, we first learn binary classifiers as feature predictors for predicting whether a particular feature occurs in a given instance. Next, each feature predictor is represented as a vertex $v_{i}$ in the ClassiNet where a one-to-one correspondence exists between feature predictors and vertices. The weight of the directed edge $e_{ij}$ connecting a vertex $v_{i}$ to a vertex $v_{j}$ represents the conditional probability that given $v_{i}$ exists in an instance, $v_{j}$ also exists in the same instance. We show that ClassiNets…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.