Classification with Low Rank and Missing Data

Elad Hazan; Roi Livni; Yishay Mansour

arXiv:1501.03273·cs.LG·January 15, 2015·19 cites

Classification with Low Rank and Missing Data

Elad Hazan, Roi Livni, Yishay Mansour

PDF

Open Access

TL;DR

This paper introduces an efficient algorithm for classification with missing data, assuming data lies in a low-rank subspace, achieving performance comparable to classifiers with complete data.

Contribution

It presents a novel non-proper formulation and an agnostic algorithm that effectively classifies missing data by leveraging low-rank assumptions.

Findings

01

Algorithm classifies as well as the best linear classifier with full data

02

Works with both linear and kernel methods

03

Handles missing data efficiently

Abstract

We consider classification and regression tasks where we have missing data and assume that the (clean) data resides in a low rank subspace. Finding a hidden subspace is known to be computationally hard. Nevertheless, using a non-proper formulation we give an efficient agnostic algorithm that classifies as good as the best linear classifier coupled with the best low-dimensional subspace in which the data resides. A direct implication is that our algorithm can linearly (and non-linearly through kernels) classify provably as well as the best classifier that has access to the full data.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Algorithms · Machine Learning and Data Classification · Face and Expression Recognition