Semi-supervised Classification for Functional Data with Application to Astronomical Spectra Analysis
Ruoxu Tan, Mingjie Jian, Yiming Zang

TL;DR
This paper extends semi-supervised classification methods to functional data, introducing new classifiers based on the Fermat distance, with theoretical guarantees and application to astronomical spectra.
Contribution
It develops novel semi-supervised classifiers for functional data using the Fermat distance, with efficient estimation procedures and theoretical convergence guarantees.
Findings
Unlabeled data improves accuracy only with sufficiently high sampling rate.
Proposed classifiers outperform existing supervised methods on simulated and real astronomical data.
Established exponential convergence rates for the $k$-NN classifier and Fermat distance consistency.
Abstract
Despite its extensive development for multivariate data, semi-supervised learning remains underdeveloped for functional data. To address this challenge, we extend the Fermat distance, a density-sensitive metric aligning with the semi-supervised setting, to the functional domain. Leveraging the Fermat distance, we propose novel semi-supervised classifiers, including the weighted -nearest neighbors (NN) classifier and multidimensional scaling (MDS)-induced classifiers. To accommodate massive datasets commonly seen in semi-supervised applications, we design a computationally efficient estimation procedure tailored for discrete and noisy functional observations. Theoretically, we establish exponentially decaying convergence rates of the -NN classifier and the consistency of the estimated Fermat distance. Crucially, our results reveal a phenomenon unique to error-contaminated…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
