Discordance Minimization-based Imputation Algorithms for Missing Values   in Rating Data

Young Woong Park; Jinhak Kim; Dan Zhu

arXiv:2311.04035·stat.ML·December 5, 2023·2 cites

Discordance Minimization-based Imputation Algorithms for Missing Values in Rating Data

Young Woong Park, Jinhak Kim, Dan Zhu

PDF

Open Access

TL;DR

This paper introduces novel imputation algorithms that minimize rating discordance to accurately fill missing values in combined rating datasets, outperforming existing methods.

Contribution

It develops optimization models based on rating discordance minimization, tailored for specific data structures, and demonstrates superior imputation accuracy over current methods.

Findings

01

Proposed algorithms outperform state-of-the-art imputation methods.

02

Algorithms effectively handle various real-world missing data patterns.

03

Imputation accuracy is validated through experiments on real and synthetic datasets.

Abstract

Ratings are frequently used to evaluate and compare subjects in various applications, from education to healthcare, because ratings provide succinct yet credible measures for comparing subjects. However, when multiple rating lists are combined or considered together, subjects often have missing ratings, because most rating lists do not rate every subject in the combined list. In this study, we propose analyses on missing value patterns using six real-world data sets in various applications, as well as the conditions for applicability of imputation algorithms. Based on the special structures and properties derived from the analyses, we propose optimization models and algorithms that minimize the total rating discordance across rating providers to impute missing ratings in the combined rating lists, using only the known rating information. The total rating discordance is defined as the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Causal Inference Techniques