A Pan-cancer Classification Model using Multi-view Feature Selection Method and Ensemble Classifier
Tareque Mohmud Chowdhury, Farzana Tabassum, Sabrina Islam, and Abu, Raihan Mostofa Kamal

TL;DR
This paper introduces a novel multi-view feature selection and ensemble classification framework that significantly improves the accuracy of cancer sample classification across 33 types, especially in challenging subgroups.
Contribution
It proposes a new feature selection method tailored for transcriptome data and combines multiple classifiers into an ensemble, achieving state-of-the-art accuracy in pan-cancer classification.
Findings
97.11% classification accuracy
Over 90% accuracy on 12 challenging cancer types
Enriched pathways related to cancer development
Abstract
Accurately identifying cancer samples is crucial for precise diagnosis and effective patient treatment. Traditional methods falter with high-dimensional and high feature-to-sample count ratios, which are critical for classifying cancer samples. This study aims to develop a novel feature selection framework specifically for transcriptome data and propose two ensemble classifiers. For feature selection, we partition the transcriptome dataset vertically based on feature types. Then apply the Boruta feature selection process on each of the partitions, combine the results, and apply Boruta again on the combined result. We repeat the process with different parameters of Boruta and prepare the final feature set. Finally, we constructed two ensemble ML models based on LR, SVM and XGBoost classifiers with max voting and averaging probability approach. We used 10-fold cross-validation to ensure…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsSparse Evolutionary Training · Feature Selection · Support Vector Machine
