Machine Learning Against Cancer: Accurate Diagnosis of Cancer by Machine Learning Classification of the Whole Genome Sequencing Data
Arash Hooshmand

TL;DR
This paper presents a novel machine learning approach that achieves perfect accuracy in classifying cancerous versus healthy tissues using whole genome sequencing data, including early-stage cancers, with privacy-preserving normalized data.
Contribution
The authors introduce MLAC, a new machine learning framework that attains perfect classification metrics across multiple cancer types using genomic data, even with limited samples.
Findings
Achieved perfect precision, sensitivity, and specificity for most tumor types.
Effective classification and clustering of cancerous and healthy samples.
Works well with normalized RNA sequencing data, preserving privacy.
Abstract
Machine learning can precisely identify different cancer tumors at any stage by classifying cancerous and healthy samples based on their genomic profile. We have developed novel methods of MLAC (Machine Learning Against Cancer) achieving perfect results with perfect precision, sensitivity, and specificity. We have used the whole genome sequencing data acquired by next-generation RNA sequencing techniques in The Cancer Genome Atlas and Genotype-Tissue Expression projects for cancerous and healthy tissues respectively. Moreover, we have shown that unsupervised machine learning clustering has great potential to be used for cancer diagnosis. Indeed, a creative way to work with data and general algorithms has resulted in perfect classification i.e. all precision, sensitivity, and specificity are equal to 1 for most of the different tumor types even with a modest amount of data, and the same…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGene expression and cancer classification · AI in cancer detection · Genetics, Bioinformatics, and Biomedical Research
