Clustering functional data with measurement errors: a simulation-based approach
Tingyu Zhu, Lan Xue, Carmen Tekwe, Keith Diaz, Mark Benden, Roger Zoh

TL;DR
This paper introduces a simulation-based method for clustering functional data contaminated with measurement errors, improving accuracy over naive methods by accounting for error distributions through repeated measurements.
Contribution
The paper presents a novel simulation-based approach that estimates measurement error distribution and adjusts clustering of functional data accordingly.
Findings
Improved clustering accuracy over naive methods
Effective in handling measurement errors in functional data
Applied successfully to childhood obesity data
Abstract
Clustering analysis of functional data, which comprises observations that evolve continuously over time or space, has gained increasing attention across various scientific disciplines. Practical applications often involve functional data that are contaminated with measurement errors arising from imprecise instruments, sampling errors, or other sources. These errors can significantly distort the inherent data structure, resulting in erroneous clustering outcomes. In this paper, we propose a simulation-based approach designed to mitigate the impact of measurement errors. Our proposed method estimates the distribution of functional measurement errors through repeated measurements. Subsequently, the clustering algorithm is applied to simulated data generated from the conditional distribution of the unobserved true functional data given the observed contaminated functional data, accounting…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData Mining Algorithms and Applications
