Identification and Estimation for Nonignorable Missing Data: A Data   Fusion Approach

Zixiao Wang; AmirEmad Ghassami; Ilya Shpitser

arXiv:2311.09015·stat.ME·February 29, 2024·1 cites

Identification and Estimation for Nonignorable Missing Data: A Data Fusion Approach

Zixiao Wang, AmirEmad Ghassami, Ilya Shpitser

PDF

Open Access

TL;DR

This paper introduces a data fusion method to identify and estimate parameters in nonignorable missing data scenarios by combining MNAR and MAR datasets, enabling identification under certain assumptions.

Contribution

It proposes a novel data fusion approach that allows identification and estimation of parameters in MNAR data using auxiliary MAR data, under new assumptions.

Findings

01

The IPW estimator performs well in simulations.

02

Data fusion enables identification where individual datasets cannot.

03

Application demonstrates practical utility of the method.

Abstract

We consider the task of identifying and estimating a parameter of interest in settings where data is missing not at random (MNAR). In general, such parameters are not identified without strong assumptions on the missing data model. In this paper, we take an alternative approach and introduce a method inspired by data fusion, where information in an MNAR dataset is augmented by information in an auxiliary dataset subject to missingness at random (MAR). We show that even if the parameter of interest cannot be identified given either dataset alone, it can be identified given pooled data, under two complementary sets of assumptions. We derive an inverse probability weighted (IPW) estimator for identified parameters, and evaluate the performance of our estimation strategies via simulation studies, and a data application.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTarget Tracking and Data Fusion in Sensor Networks · Distributed Sensor Networks and Detection Algorithms · Bayesian Modeling and Causal Inference