Genetic Features for Drug Responses in Cancer -- Investigating an Ensemble-Feature-Selection Approach

Johannes Schl\"uter; Alexander Sch\"onhuth

arXiv:2507.02818·q-bio.GN·July 4, 2025·Comput. Biol. Medicine

Genetic Features for Drug Responses in Cancer -- Investigating an Ensemble-Feature-Selection Approach

Johannes Schl\"uter, Alexander Sch\"onhuth

PDF

TL;DR

This study uses ensemble machine learning to identify key genetic features, especially CNVs, that predict drug responses in cancer, proposing a reduced feature set for improved biomarker discovery and personalized therapy.

Contribution

Introduces an ensemble-feature-selection approach that reduces thousands of features to a critical set, highlighting the predictive power of CNVs over mutations for drug response.

Findings

01

Copy number variations are more predictive than mutations.

02

A reduced set of 421 features effectively predicts drug responses.

03

IC50 values are validated as reliable metrics for drug efficacy.

Abstract

Predicting drug responses using genetic and transcriptomic features is crucial for enhancing personalized medicine. In this study, we implemented an ensemble of machine learning algorithms to analyze the correlation between genetic and transcriptomic features of cancer cell lines and IC50 values, a reliable metric for drug efficacy. Our analysis involved a reduction of the feature set from an original pool of 38,977 features, demonstrating a strong linear relationship between genetic features and drug responses across various algorithms, including SVR, Linear Regression, and Ridge Regression. Notably, copy number variations (CNVs) emerged as more predictive than mutations, suggesting a significant reevaluation of biomarkers for drug response prediction. Through rigorous statistical methods, we identified a highly reduced set of 421 critical features. This set offers a novel perspective…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.