Learning from Incomplete Features by Simultaneous Training of Neural Networks and Sparse Coding
Cesar F. Caiafa, Ziyao Wang, Jordi Sol\'e-Casals, Qibin Zhao

TL;DR
This paper introduces a novel supervised learning approach that trains classifiers directly on incomplete feature data by leveraging sparse representations, avoiding traditional imputation methods.
Contribution
The authors propose a new method combining neural networks and sparse coding to effectively classify incomplete data without imputing missing features.
Findings
The method outperforms traditional imputation-based approaches.
Theoretical conditions guarantee correct classification of original data.
Validated on synthetic and real datasets.
Abstract
In this paper, the problem of training a classifier on a dataset with incomplete features is addressed. We assume that different subsets of features (random or structured) are available at each data instance. This situation typically occurs in the applications when not all the features are collected for every data sample. A new supervised learning method is developed to train a general classifier, such as a logistic regression or a deep neural network, using only a subset of features per sample, while assuming sparse representations of data vectors on an unknown dictionary. Sufficient conditions are identified, such that, if it is possible to train a classifier on incomplete observations so that their reconstructions are well separated by a hyperplane, then the same classifier also correctly separates the original (unobserved) data samples. Extensive simulation results on synthetic and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSparse and Compressive Sensing Techniques · Machine Learning and Data Classification · Anomaly Detection Techniques and Applications
MethodsLogistic Regression
