A Comparison of Clustering and Missing Data Methods for Health Sciences
Ran Zhao, Deanna Needell, Christopher Johansen, Jerry L. Grenard

TL;DR
This paper compares clustering methods for health data with missing values, demonstrating that spectral clustering combined with compressive sensing outperforms traditional approaches in accuracy and data completion.
Contribution
It introduces the use of spectral clustering with matrix completion via compressive sensing for health data, showing improved performance over standard methods.
Findings
Lower misclassification rates with proposed methods
Better matrix completion performance
Spectral clustering leverages high-dimensional data properties
Abstract
In this paper, we compare and analyze clustering methods with missing data in health behavior research. In particular, we propose and analyze the use of compressive sensing's matrix completion along with spectral clustering to cluster health related data. The empirical tests and real data results show that these methods can outperform standard methods like LPA and FIML, in terms of lower misclassification rates in clustering and better matrix completion performance in missing data problems. According to our examination, a possible explanation of these improvements is that spectral clustering takes advantage of high data dimension and compressive sensing methods utilize the near-to-low-rank property of health data.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSparse and Compressive Sensing Techniques · Distributed Sensor Networks and Detection Algorithms · Face and Expression Recognition
MethodsSpectral Clustering
