Pathway-based Bayesian factor models for 'omics data
Lorenzo Mauri, Federica Stolf, Amy H. Herring, Cameron Miller, David B. Dunson

TL;DR
BASIL is a scalable Bayesian framework that integrates pathway knowledge into gene expression analysis, improving interpretability and stability of transcriptomic data modeling without heavy computational costs.
Contribution
It introduces a novel pre-training based Bayesian factor model that incorporates annotated gene sets, enhancing interpretability and reproducibility in transcriptomic data analysis.
Findings
Uncovered key host-response modules in fever transcriptomics
Identified phosphoinositide signaling as a driver of gene variability
Demonstrated improved interpretability over standard models
Abstract
Interpreting RNA-sequencing data requires identifying coordinated gene expression patterns that correspond to biological pathways. Standard factor models provide useful dimension reduction but typically ignore existing pathway knowledge or incorporate it through restrictive assumptions, limiting interpretability, and reproducibility. Here, we develop Bayesian Analysis with gene-Sets Informed Latent space (BASIL), a scalable framework for analyzing transcriptomic data that integrates annotated gene sets into latent variable inference. BASIL places structured priors on factor loadings, shrinking them toward combinations of annotated gene sets, enhancing biological interpretability and stability, while simultaneously learning new unstructured components. BASIL provides accurate covariance estimates and uncertainty quantification, without resorting to computationally expensive Markov chain…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBioinformatics and Genomic Networks · Gene expression and cancer classification · Single-cell and spatial transcriptomics
