CrowdMI: Multiple Imputation via Crowdsourcing

Lovedeep Gondara

arXiv:1612.02707·cs.LG·February 26, 2018

CrowdMI: Multiple Imputation via Crowdsourcing

Lovedeep Gondara

PDF

Open Access

TL;DR

This paper introduces CrowdMI, a crowdsourcing-based method for imputing missing data by converting data into surveys, demonstrating comparable accuracy to traditional statistical models for both qualitative and quantitative data.

Contribution

The paper proposes a novel crowdsourcing approach for data imputation that replicates multiple imputation frameworks, offering an alternative to complex statistical models.

Findings

01

CrowdMI produces valid imputations for qualitative data.

02

CrowdMI achieves results comparable to statistical models.

03

The method is effective for both qualitative and quantitative missing data.

Abstract

Can humans impute missing data with similar proficiency as machines? This is the question we aim to answer in this paper. We present a novel idea of converting observations with missing data in to a survey questionnaire, which is presented to crowdworkers for completion. We replicate a multiple imputation framework by having multiple unique crowdworkers complete our questionnaire. Experimental results demonstrate that using our method, it is possible to generate valid imputations for qualitative and quantitative missing data, with results comparable to imputations generated by complex statistical models.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMobile Crowdsensing and Crowdsourcing · Privacy-Preserving Technologies in Data · Human Mobility and Location-Based Analysis