Debiasing Stochastic Gradient Descent to handle missing values

Julie Josse (CMAP); Aude Sportisse (LPSM (UMR\_8001)); Claire Boyer; (LPSM UMR 8001; DMA); Aymeric Dieuleveut (CMAP)

arXiv:2002.09338·math.ST·June 9, 2020·1 cites

Debiasing Stochastic Gradient Descent to handle missing values

Julie Josse (CMAP), Aude Sportisse (LPSM (UMR\_8001)), Claire Boyer, (LPSM UMR 8001, DMA), Aymeric Dieuleveut (CMAP)

PDF

Open Access

TL;DR

This paper introduces a new averaged stochastic gradient algorithm that effectively handles missing data in linear models, achieving optimal convergence rates without requiring data distribution assumptions, applicable to large-scale and real-world datasets.

Contribution

The paper presents a novel stochastic gradient method that manages missing values without distribution modeling, maintaining optimal convergence rates in both streaming and finite-sample scenarios.

Findings

01

Achieves convergence rate of O(1/n) despite missing data

02

Effective on both synthetic and real medical datasets

03

Does not require prior data distribution assumptions

Abstract

Stochastic gradient algorithm is a key ingredient of many machine learning methods, particularly appropriate for large-scale learning.However, a major caveat of large data is their incompleteness.We propose an averaged stochastic gradient algorithm handling missing values in linear models. This approach has the merit to be free from the need of any data distribution modeling and to account for heterogeneous missing proportion.In both streaming and finite-sample settings, we prove that this algorithm achieves convergence rate of $O (\frac{1}{n})$ at the iteration $n$ , the same as without missing values. We show the convergence behavior and the relevance of the algorithm not only on synthetic data but also on real data sets, including those collected from medical register.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStochastic Gradient Optimization Techniques · Privacy-Preserving Technologies in Data · Statistical Methods and Inference