A primer on linear classification with missing data
Angel D Reyero Lobo (UT3), Alexis Ayme (LPSM (UMR\_8001)), Claire, Boyer (LMO), Erwan Scornet (LPSM (UMR\_8001))

TL;DR
This paper analyzes how classical linear classifiers perform with missing data under different strategies, revealing limitations of imputation and pattern-by-pattern methods, and highlighting the effectiveness of P-b-P LDA.
Contribution
The paper provides a theoretical analysis of linear classifiers with missing data, showing limitations of common strategies and proposing bounds for P-b-P LDA performance.
Findings
Imputation and P-b-P approaches are ill-specified for logistic regression.
P-b-P LDA methods are most effective for missing data classification.
Finite-sample bounds are provided for P-b-P LDA in high-dimensional and MNAR settings.
Abstract
Supervised learning with missing data aims at building the best prediction of a target output based on partially-observed inputs. Major approaches to address this problem can be decomposed into impute-then-predict strategies, which first fill in the empty input components and then apply a unique predictor and Pattern-by-Pattern (P-b-P) approaches, where a predictor is built on each missing pattern. In this paper, we theoretically analyze how three classical linear classifiers, namely perceptron, logistic regression and linear discriminant analysis (LDA), behave with Missing Completely At Random (MCAR) data, depending on the strategy (imputation or P-b-P) used to handle missing values. We prove that both imputation and P-b-P approaches are ill-specified in a logistic regression framework, thus questioning the relevance of such approaches to handle missing data. The most…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications · Face and Expression Recognition · Statistical Methods and Inference
