Improving Cardiac Risk Prediction Using Data Generation Techniques
Alexandre Cabodevila, Pedro Gamallo-Fernandez, Juan C. Vidal, Manuel Lama

TL;DR
This paper introduces a CVAE-based method to generate realistic synthetic clinical data, improving cardiac risk prediction accuracy and addressing data scarcity issues in medical datasets.
Contribution
It presents a novel architecture using CVAE for generating coherent synthetic clinical records to enhance cardiac risk prediction models.
Findings
Synthetic data improves classifier accuracy for risk detection.
The proposed method outperforms existing deep learning approaches.
Generated data is coherent and realistic.
Abstract
Cardiac rehabilitation constitutes a structured clinical process involving multiple interdependent phases, individualized medical decisions, and the coordinated participation of diverse healthcare professionals. This sequential and adaptive nature enables the program to be modeled as a business process, thereby facilitating its analysis. Nevertheless, studies in this context face significant limitations inherent to real-world medical databases: data are often scarce due to both economic costs and the time required for collection; many existing records are not suitable for specific analytical purposes; and, finally, there is a high prevalence of missing values, as not all patients undergo the same diagnostic tests. To address these limitations, this work proposes an architecture based on a Conditional Variational Autoencoder (CVAE) for the synthesis of realistic clinical records that are…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Healthcare · Artificial Intelligence in Healthcare · ECG Monitoring and Analysis
