Handling Missing Data with Graph Representation Learning

Jiaxuan You; Xiaobai Ma; Daisy Yi Ding; Mykel Kochenderfer; Jure; Leskovec

arXiv:2010.16418·cs.LG·November 2, 2020·94 cites

Handling Missing Data with Graph Representation Learning

Jiaxuan You, Xiaobai Ma, Daisy Yi Ding, Mykel Kochenderfer, Jure, Leskovec

PDF

Open Access 1 Repo 1 Video

TL;DR

GRAPE introduces a graph neural network framework for missing data handling, enabling effective feature imputation and label prediction by modeling data as a bipartite graph, outperforming existing methods on benchmark datasets.

Contribution

This paper presents GRAPE, a novel graph-based framework that unifies feature imputation and label prediction for missing data using bipartite graph modeling and GNNs.

Findings

01

20% lower mean absolute error for imputation

02

10% lower error for label prediction

03

Outperforms state-of-the-art methods on nine datasets

Abstract

Machine learning with missing data has been approached in two different ways, including feature imputation where missing feature values are estimated based on observed values, and label prediction where downstream labels are learned directly from incomplete data. However, existing imputation models tend to have strong prior assumptions and cannot learn from downstream tasks, while models targeting label prediction often involve heuristics and can encounter scalability issues. Here we propose GRAPE, a graph-based framework for feature imputation as well as label prediction. GRAPE tackles the missing data problem using a graph representation, where the observations and features are viewed as two types of nodes in a bipartite graph, and the observed feature values as edges. Under the GRAPE framework, the feature imputation is formulated as an edge-level prediction task and the label…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

maxiaoba/GRAPE
pytorchOfficial

Videos

Handling Missing Data with Graph Representation Learning· slideslive

Taxonomy

TopicsAdvanced Graph Neural Networks · Topic Modeling · Recommender Systems and Techniques