Predicting Molecular Phenotypes with Single Cell RNA Sequencing Data: an Assessment of Unsupervised Machine Learning Models
Anastasia Dunca, Frederick R. Adler

TL;DR
This study evaluates various unsupervised machine learning models applied to single cell RNA sequencing data to classify treatment-resistant tumor phenotypes, identifying effective clustering approaches for potential clinical applications.
Contribution
It provides a comprehensive assessment of multiple clustering algorithms on scRNAseq data for tumor phenotype classification, proposing an optimized pipeline for clinical research.
Findings
K-Means, Ward, and BIRCH achieved ~80% accuracy in tumor vs. non-tumor classification.
The pipeline effectively distinguishes molecular subtypes and patient IDs.
Optimized models can aid understanding of cancer cell behavior in clinical settings.
Abstract
According to the National Cancer Institute, there were 9.5 million cancer-related deaths in 2018. A challenge in improving treatment is resistance in genetically unstable cells. The purpose of this study is to evaluate unsupervised machine learning on classifying treatment-resistant phenotypes in heterogeneous tumors through analysis of single cell RNA sequencing(scRNAseq) data with a pipeline and evaluation metrics. scRNAseq quantifies mRNA in cells and characterizes cell phenotypes. One scRNAseq dataset was analyzed (tumor/non-tumor cells of different molecular subtypes and patient identifications). The pipeline consisted of data filtering, dimensionality reduction with Principal Component Analysis, projection with Uniform Manifold Approximation and Projection, clustering with nine approaches (Ward, BIRCH, Gaussian Mixture Model, DBSCAN, Spectral, Affinity Propagation, Agglomerative…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCancer Genomics and Diagnostics · Gene expression and cancer classification · Molecular Biology Techniques and Applications
