Aiding Medical Diagnosis through Image Synthesis and Classification
Kanishk Choudhary

TL;DR
This paper introduces a system that generates realistic medical images from text descriptions and validates them with a classifier, enhancing medical training and diagnostic support through high-quality synthetic images.
Contribution
It presents a novel method combining image synthesis and classification validation using fine-tuned diffusion models and a high-accuracy classifier for medical image generation.
Findings
Achieved an F1 score of 0.6727 for generated images.
Classifier accuracy of 99.76% on the dataset.
Some tissue types reached perfect classification scores.
Abstract
Medical professionals, especially those in training, often depend on visual reference materials to support an accurate diagnosis and develop pattern recognition skills. However, existing resources may lack the diversity and accessibility needed for broad and effective clinical learning. This paper presents a system designed to generate realistic medical images from textual descriptions and validate their accuracy through a classification model. A pretrained stable diffusion model was fine-tuned using Low-Rank Adaptation (LoRA) on the PathMNIST dataset, consisting of nine colorectal histopathology tissue types. The generative model was trained multiple times using different training parameter configurations, guided by domain-specific prompts to capture meaningful features. To ensure quality control, a ResNet-18 classification model was trained on the same dataset, achieving 99.76%…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAI in cancer detection · Brain Tumor Detection and Classification · Digital Imaging for Blood Diseases
