Optimized Linear Imputation

Yehezkel S. Resheff; Daphna Weinshall

arXiv:1511.05309·stat.ML·December 8, 2016·ICPRAM

Optimized Linear Imputation

Yehezkel S. Resheff, Daphna Weinshall

PDF

Open Access

TL;DR

This paper introduces a new linear imputation method formulated as an optimization problem with guaranteed convergence, outperforming IRMI and other methods in handling missing data in high-dimensional datasets.

Contribution

The paper presents a convergent, optimization-based linear imputation method that improves upon IRMI's iterative approach for missing data imputation.

Findings

01

Method guarantees convergence to a local minimum.

02

Performance is superior to IRMI in non-converging cases.

03

Results are comparable to IRMI when IRMI converges.

Abstract

Often in real-world datasets, especially in high dimensional data, some feature values are missing. Since most data analysis and statistical methods do not handle gracefully missing values, the first step in the analysis requires the imputation of missing values. Indeed, there has been a long standing interest in methods for the imputation of missing values as a pre-processing step. One recent and effective approach, the IRMI stepwise regression imputation method, uses a linear regression model for each real-valued feature on the basis of all other features in the dataset. However, the proposed iterative formulation lacks convergence guarantee. Here we propose a closely related method, stated as a single optimization problem and a block coordinate-descent solution which is guaranteed to converge to a local minimum. Experiments show results on both synthetic and benchmark datasets, which…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStatistical Methods and Inference · Gaussian Processes and Bayesian Inference · Face and Expression Recognition

MethodsLinear Regression