Imputation of Longitudinal Data Using GANs: Challenges and Implications for Classification
Sharon Torao Pingi, Md Abul Bashar, Richi Nayak

TL;DR
This paper reviews how GANs are used for imputing missing data in longitudinal studies, discussing challenges, current methods, and future research directions to improve classification accuracy.
Contribution
It categorizes existing GAN-based imputation methods, analyzes their strengths and limitations, and highlights key challenges and future research directions in longitudinal data imputation.
Findings
GANs have potential to improve longitudinal data quality.
Current methods face challenges with data heterogeneity and missingness.
Identifies gaps and proposes future research directions.
Abstract
Longitudinal data is commonly utilised across various domains, such as health, biomedical, education and survey studies. This ubiquity has led to a rise in statistical, machine and deep learning-based methods for Longitudinal Data Classification (LDC). However, the intricate nature of the data, characterised by its multi-dimensionality, causes instance-level heterogeneity and temporal correlations that add to the complexity of longitudinal data analysis. Additionally, LDC accuracy is often hampered by the pervasiveness of missing values in longitudinal data. Despite ongoing research that draw on the generative power and utility of Generative Adversarial Networks (GANs) to address the missing data problem, critical considerations include statistical assumptions surrounding longitudinal data and missingness within it, as well as other data-level challenges like class imbalance and mixed…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
