Representing high throughput expression profiles via perturbation barcodes reveals compound targets
Tracey Filzen, Peter Kutchukian, Jeffrey Hermes, Jing Li, Matthew, Tudor

TL;DR
This paper introduces a deep learning-based perturbation barcode method for high throughput gene expression data that improves biological signal detection, reveals compound targets, and enhances functional annotation of unknown compounds.
Contribution
The study presents a novel deep learning approach to convert gene expression profiles into perturbation barcodes, outperforming raw data in biological insight extraction and compound target prediction.
Findings
Barcode captures compound structure and target information.
Better than raw data at predicting compound promiscuity.
Enables more sensitive functional assignment of compounds.
Abstract
High throughput mRNA expression profiling can be used to characterize the response of cell culture models to perturbations such as pharmacologic modulators and genetic perturbations. As profiling campaigns expand in scope, it is important to homogenize, summarize, and analyze the resulting data in a manner that captures significant biological signals in spite of various noise sources such as batch effects and stochastic variation. We used the L1000 platform for large-scale profiling of 978 genes, chosen to be representative of the genome as whole, across thousands of compound treatments. Here, a method is described that uses deep learning techniques to convert the expression changes of the landmark genes into a perturbation barcode that reveals important features of the underlying data, performing better than the raw data in revealing important biological insights. The barcode captures…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
