Predictions Based on the Clustering of Heterogeneous Functions via Shape and Subject-Specific Covariates
Garritt L. Page, Fernando A. Quintana

TL;DR
This paper introduces a Bayesian clustering method for functional data that leverages curve shape heterogeneity and covariates to improve predictions of career performance in NBA players.
Contribution
It develops a hierarchical Bayesian approach combining penalized B-splines and covariate-guided clustering for functional data analysis.
Findings
Effective clustering of player career curves based on smoothness.
Improved prediction accuracy for incomplete and future career curves.
Method successfully incorporates covariate information into clustering.
Abstract
We consider a study of players employed by teams who are members of the National Basketball Association where units of observation are functional curves that are realizations of production measurements taken through the course of one's career. The observed functional output displays large amounts of between player heterogeneity in the sense that some individuals produce curves that are fairly smooth while others are (much) more erratic. We argue that this variability in curve shape is a feature that can be exploited to guide decision making, learn about processes under study and improve prediction. In this paper we develop a methodology that takes advantage of this feature when clustering functional curves. Individual curves are flexibly modeled using Bayesian penalized B-splines while a hierarchical structure allows the clustering to be guided by the smoothness of individual curves. In…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
