A Bayesian model for microarray datasets merging
Marie-Christine Roubaud (LATP), Bruno Torr\'esani (LATP)

TL;DR
This paper introduces a Bayesian model for merging microarray datasets by explicitly modeling interstudy variability, enabling more accurate data aggregation and biological effect detection across studies.
Contribution
It presents a novel Bayesian algorithm that models study-dependent nonlinear transformations and noise, improving dataset merging and biological signal recovery.
Findings
Successfully corrected nonlinear distortions in E. Coli data
Enhanced detection of differentially expressed genes in prostate cancer datasets
Validated method improves data integration accuracy
Abstract
The aggregation of microarray datasets originating from different studies is still a difficult open problem. Currently, best results are generally obtained by the so-called meta-analysis approach, which aggregates results from individual datasets, instead of analyzing aggre-gated datasets. In order to tackle such aggregation problems, it is necessary to correct for interstudy variability prior to aggregation. The goal of this paper is to present a new approach for microarray datasets merging, based upon explicit modeling of interstudy variability and gene variability. We develop and demonstrate a new algorithm for microarray datasets merging. The underlying model assumes normally distributed intrinsic gene expressions, distorted by a study-dependent nonlinear transformation, and study dependent (normally distributed) observation noise. The algorithm addresses both parameter estimation…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGene expression and cancer classification · Bioinformatics and Genomic Networks · Gene Regulatory Network Analysis
