Blockwise Principal Component Analysis for monotone missing data imputation and dimensionality reduction
Tu T. Do, Mai Anh Vu, Tuan L. Vo, Hoang Thien Ly, Thu Nguyen, Steven, A. Hicks, Michael A. Riegler, P{\aa}l Halvorsen, and Binh T. Nguyen

TL;DR
This paper introduces a Blockwise PCA Imputation framework that efficiently reduces dimensionality and imputes monotone missing data, significantly speeding up the process and improving convergence with certain imputation techniques.
Contribution
The proposed BPI framework enables faster dimensionality reduction and imputation for monotone missing data by applying PCA blockwise before imputation, adaptable to various imputation methods.
Findings
BPI significantly reduces imputation time.
BPI with MICE can lead to convergence issues in direct imputation.
Experiments validate BPI's efficiency and effectiveness.
Abstract
Monotone missing data is a common problem in data analysis. However, imputation combined with dimensionality reduction can be computationally expensive, especially with the increasing size of datasets. To address this issue, we propose a Blockwise principal component analysis Imputation (BPI) framework for dimensionality reduction and imputation of monotone missing data. The framework conducts Principal Component Analysis (PCA) on the observed part of each monotone block of the data and then imputes on merging the obtained principal components using a chosen imputation technique. BPI can work with various imputation techniques and can significantly reduce imputation time compared to conducting dimensionality reduction after imputation. This makes it a practical and efficient approach for large datasets with monotone missing data. Our experiments validate the improvement in speed. In…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFace and Expression Recognition · Advanced Neuroimaging Techniques and Applications · Dementia and Cognitive Impairment Research
