Bayesian Hybrid Matrix Factorisation for Data Integration
Thomas Brouwer, Pietro Li\'o

TL;DR
This paper presents a Bayesian hybrid matrix factorisation model that effectively integrates diverse datasets for accurate in- and out-of-matrix predictions, outperforming existing methods especially with sparse data.
Contribution
The paper introduces a flexible Bayesian hybrid matrix factorisation model capable of integrating various data types for improved prediction accuracy.
Findings
Better performance on drug sensitivity datasets, especially with increased sparsity.
Achieved top results in out-of-matrix predictions on methylation and gene expression datasets.
Outperformed state-of-the-art models in multiple biological data integration tasks.
Abstract
We introduce a novel Bayesian hybrid matrix factorisation model (HMF) for data integration, based on combining multiple matrix factorisation methods, that can be used for in- and out-of-matrix prediction of missing values. The model is very general and can be used to integrate many datasets across different entity types, including repeated experiments, similarity matrices, and very sparse datasets. We apply our method on two biological applications, and extensively compare it to state-of-the-art machine learning and matrix factorisation models. For in-matrix predictions on drug sensitivity datasets we obtain consistently better performances than existing methods. This is especially the case when we increase the sparsity of the datasets. Furthermore, we perform out-of-matrix predictions on methylation and gene expression datasets, and obtain the best results on two of the three datasets,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGene expression and cancer classification · Bioinformatics and Genomic Networks · Tensor decomposition and applications
