# Structured penalized regression for drug sensitivity prediction

**Authors:** Zhi Zhao, Manuela Zucknick

arXiv: 1902.04996 · 2020-03-10

## TL;DR

This paper introduces a novel structured penalized regression method, IPF-tree-lasso, that leverages correlation structures and heterogeneity in multi-omics data to improve drug sensitivity prediction in cancer cell lines.

## Contribution

The paper develops the IPF-tree-lasso method, combining structured penalties and efficient parameter optimization, to enhance multivariate drug sensitivity prediction from heterogeneous multi-omics data.

## Key findings

- IPF-tree-lasso outperforms other lasso-type methods in simulations.
- The method effectively utilizes correlation structures between drugs.
- Application to cancer data demonstrates improved prediction accuracy.

## Abstract

Large-scale {\it in vitro} drug sensitivity screens are an important tool in personalized oncology to predict the effectiveness of potential cancer drugs. The prediction of the sensitivity of cancer cell lines to a panel of drugs is a multivariate regression problem with high-dimensional heterogeneous multi-omics data as input data and with potentially strong correlations between the outcome variables which represent the sensitivity to the different drugs. We propose a joint penalized regression approach with structured penalty terms which allow us to utilize the correlation structure between drugs with group-lasso-type penalties and at the same time address the heterogeneity between omics data sources by introducing data-source-specific penalty factors to penalize different data sources differently. By combining integrative penalty factors (IPF) with tree-guided group lasso, we create the IPF-tree-lasso method. We present a unified framework to transform more general IPF-type methods to the original penalized method. Because the structured penalty terms have multiple parameters, we demonstrate how the interval-search Efficient Parameter Selection via Global Optimization (EPSGO) algorithm can be used to optimize multiple penalty parameters efficiently. Simulation studies show that IPF-tree-lasso can improve the prediction performance compared to other lasso-type methods, in particular for heterogenous data sources. Finally, we employ the new methods to analyse data from the Genomics of Drug Sensitivity in Cancer project.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1902.04996/full.md

## Figures

6 figures with captions in the complete paper: https://tomesphere.com/paper/1902.04996/full.md

## References

43 references — full list in the complete paper: https://tomesphere.com/paper/1902.04996/full.md

---
Source: https://tomesphere.com/paper/1902.04996