Generation of synthetic data using breast cancer dataset and classification with resnet18
Dilsat Berin Aytar, Semra Gunduc

TL;DR
This paper explores generating synthetic breast cancer histopathology images using MSG-GAN and evaluates their effectiveness in training ResNet18 classifiers, aiming to improve cancer detection while addressing data privacy issues.
Contribution
The study introduces the use of MSG-GAN to generate synthetic breast cancer images and assesses their utility in training ResNet18 for classification tasks.
Findings
Synthetic images closely resemble real data.
ResNet18 trained on synthetic data achieves comparable accuracy.
Synthetic data helps address privacy and data scarcity issues.
Abstract
Since technology is advancing so quickly in the modern era of information, data is becoming an essential resource in many fields. Correct data collection, organization, and analysis make it a potent tool for successful decision-making, process improvement, and success across a wide range of sectors. Synthetic data is required for a number of reasons, including the constraints of real data, the expense of collecting labeled data, and privacy and security problems in specific situations and domains. For a variety of reasons, including security, ethics, legal restrictions, sensitivity and privacy issues, and ethics, synthetic data is a valuable tool, particularly in the health sector. A deep learning model called GAN (Generative Adversarial Networks) has been developed with the intention of generating synthetic data. In this study, the Breast Histopathology dataset was used to generate…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAI in cancer detection
