Method for Generating Synthetic Data Combining Chest Radiography Images with Tabular Clinical Information Using Dual Generative Models
Tomohiro Kikuchi, Shouhei Hanaoka, Takahiro Nakao, Tomomi Takenaga,, Yukihiro Nomura, Harushi Mori, Takeharu Yoshikawa

TL;DR
This paper presents a novel method to generate synthetic hybrid medical records combining chest X-ray images and clinical data using dual GANs, enabling privacy-preserving data sharing with comparable analytical utility.
Contribution
The paper introduces a combined auto-encoding GAN and tabular GAN approach to create realistic synthetic medical records with both image and non-image data.
Findings
Synthetic data enabled training models with comparable performance to real data.
The method successfully integrates image features with clinical variables.
Synthetic datasets can be generated five times larger than original data.
Abstract
The generation of synthetic medical records using Generative Adversarial Networks (GANs) is becoming crucial for addressing privacy concerns and facilitating data sharing in the medical domain. In this paper, we introduce a novel method to create synthetic hybrid medical records that combine both image and non-image data, utilizing an auto-encoding GAN (alphaGAN) and a conditional tabular GAN (CTGAN). Our methodology encompasses three primary steps: I) Dimensional reduction of images in a private dataset (pDS) using the pretrained encoder of the {\alpha}GAN, followed by integration with the remaining non-image clinical data to form tabular representations; II) Training the CTGAN on the encoded pDS to produce a synthetic dataset (sDS) which amalgamates encoded image features with non-image clinical data; and III) Reconstructing synthetic images from the image features using the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAI in cancer detection · Generative Adversarial Networks and Image Synthesis · COVID-19 diagnosis using AI
