Provably Robust Pre-Trained Ensembles for Biomarker-Based Cancer Classification
Chongmin Lee, Jihie Kim

TL;DR
This paper introduces a robust ensemble approach using pre-trained Hyperfast models for cancer classification, achieving high accuracy and resilience to class imbalance with fewer features and minimal tuning.
Contribution
It presents a novel ensemble combining Hyperfast, XGBoost, and LightGBM for multi-class cancer classification, demonstrating improved accuracy and robustness over existing methods.
Findings
Achieved highest AUC of 0.9929 on binary cancer classification.
Demonstrated robustness under class imbalance with balanced accuracy and recall.
Reduced feature usage to 500 PCA components while maintaining high accuracy.
Abstract
Certain cancer types, notably pancreatic cancer, are difficult to detect at an early stage, motivating robust biomarker-based screening. Liquid biopsies enable non-invasive monitoring of circulating biomarkers, but typical machine learning pipelines for high-dimensional tabular data (e.g., random forests, SVMs) rely on expensive hyperparameter tuning and can be brittle under class imbalance. We leverage a meta-trained Hyperfast model for classifying cancer, accomplishing the highest AUC of 0.9929 and simultaneously achieving robustness especially on highly imbalanced datasets compared to other ML algorithms in several binary classification tasks (e.g. breast invasive carcinoma; BRCA vs. non-BRCA). We also propose a novel ensemble model combining pre-trained Hyperfast model, XGBoost, and LightGBM for multi-class classification tasks, achieving an incremental increase in accuracy (0.9464)…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAI in cancer detection
MethodsPrincipal Components Analysis · Support Vector Machine
