Deep Learning Based Model for Breast Cancer Subtype Classification
Sheetal Rajpal, Virendra Kumar, Manoj Agarwal, Naveen Kumar

TL;DR
This paper presents a two-stage deep learning model combining an autoencoder and neural network to classify breast cancer into four subtypes with high accuracy using gene expression data.
Contribution
It introduces a novel deep learning framework that reduces dimensionality and improves classification accuracy for breast cancer subtypes.
Findings
Achieved 90.7% mean 10-fold test accuracy on TCGA dataset.
Autoencoder effectively reduces gene expression features from 20,530 to 500.
Model demonstrates robustness across multiple runs.
Abstract
Breast cancer has long been a prominent cause of mortality among women. Diagnosis, therapy, and prognosis are now possible, thanks to the availability of RNA sequencing tools capable of recording gene expression data. Molecular subtyping being closely related to devising clinical strategy and prognosis, this paper focuses on the use of gene expression data for the classification of breast cancer into four subtypes, namely, Basal, Her2, LumA, and LumB. In stage 1, we suggested a deep learning-based model that uses an autoencoder to reduce dimensionality. The size of the feature set is reduced from 20,530 gene expression values to 500 by using an autoencoder. This encoded representation is passed to the deep neural network of the second stage for the classification of patients into four molecular subtypes of breast cancer. By deploying the combined network of stages 1 and 2, we have been…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGene expression and cancer classification · AI in cancer detection · Molecular Biology Techniques and Applications
