Evaluating Deep Regression Models for WSI-Based Gene-Expression   Prediction

Fredrik K. Gustafsson; Mattias Rantalainen

arXiv:2410.00945·q-bio.GN·October 3, 2024

Evaluating Deep Regression Models for WSI-Based Gene-Expression Prediction

Fredrik K. Gustafsson, Mattias Rantalainen

PDF

Open Access 1 Repo

TL;DR

This paper evaluates deep learning regression models for predicting gene expression from whole-slide images, providing insights and recommendations for model training strategies in computational pathology.

Contribution

It offers a comprehensive analysis of training strategies for deep regression models in WSI-based gene-expression prediction, including the effectiveness of a single multi-gene model.

Findings

01

Training a single model for all genes is computationally efficient.

02

Deep regression models can effectively predict high-dimensional gene expression.

03

Recommendations for model training improve prediction performance.

Abstract

Prediction of mRNA gene-expression profiles directly from routine whole-slide images (WSIs) using deep learning models could potentially offer cost-effective and widely accessible molecular phenotyping. While such WSI-based gene-expression prediction models have recently emerged within computational pathology, the high-dimensional nature of the corresponding regression problem offers numerous design choices which remain to be analyzed in detail. This study provides recommendations on how deep regression models should be trained for WSI-based gene-expression prediction. For example, we conclude that training a single model to simultaneously regress all 20530 genes is a computationally efficient yet very strong baseline.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

mahmoodlab/uni
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGene expression and cancer classification · Machine Learning and Data Classification · Machine Learning in Bioinformatics