TL;DR
This paper introduces a semi-supervised GAN model called gGAN that generates synthetic genetic data to predict disease outcomes, reducing the need for large labeled datasets and handling diverse populations.
Contribution
The paper presents a novel semi-supervised GAN architecture tailored for genetic data, enabling effective disease outcome prediction with limited labeled data.
Findings
Achieved satisfactory prediction results on real genetic datasets.
Capable of assessing compatibility of new genetic profiles for prediction.
Effective across different populations with varying genetic profiles.
Abstract
For most diseases, building large databases of labeled genetic data is an expensive and time-demanding task. To address this, we introduce genetic Generative Adversarial Networks (gGAN), a semi-supervised approach based on an innovative GAN architecture to create large synthetic genetic data sets starting with a small amount of labeled data and a large amount of unlabeled data. Our goal is to determine the propensity of a new individual to develop the severe form of the illness from their genetic profile alone. The proposed model achieved satisfactory results using real genetic data from different datasets and populations, in which the test populations may not have the same genetic profiles. The proposed model is self-aware and capable of determining whether a new genetic profile has enough compatibility with the data on which the network was trained and is thus suitable for prediction.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
