Functional clustering methods for binary longitudinal data with temporal heterogeneity
Jinwon Sohn, Seonghyun Jeong, Young Min Cho, Taeyoung Park

TL;DR
This paper introduces a novel generalized varying-coefficient model for binary longitudinal data that simultaneously clusters time-dependent effects, effectively capturing heterogeneity without predefining cluster numbers.
Contribution
The paper presents a new clustering method for varying-coefficient functions in binary longitudinal data, handling unknown number of clusters and avoiding overfitting.
Findings
Successfully identified three distinct clusters in GSOEP data
Accurately specified cluster-specific varying-coefficients in simulations
Demonstrated model's ability to handle heterogeneity and unknown cluster count
Abstract
In the analysis of binary longitudinal data, it is of interest to model a dynamic relationship between a response and covariates as a function of time, while also investigating similar patterns of time-dependent interactions. We present a novel generalized varying-coefficient model that accounts for within-subject variability and simultaneously clusters varying-coefficient functions, without restricting the number of clusters nor overfitting the data. In the analysis of a heterogeneous series of binary data, the model extracts population-level fixed effects, cluster-level varying effects, and subject-level random effects. Various simulation studies show the validity and utility of the proposed method to correctly specify cluster-specific varying-coefficients when the number of clusters is unknown. The proposed method is applied to a heterogeneous series of binary data in the German…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical Methods and Bayesian Inference · Spatial and Panel Data Analysis
