Detecting signal from science:The structure of research communities and prior knowledge improves prediction of genetic regulatory experiments
Alexander V. Belikov, Andrey Rzhetsky, James Evans

TL;DR
This paper introduces a Bayesian framework that predicts the robustness and replicability of scientific claims in gene research by analyzing publication metadata and community structure, aiding navigation and funding decisions.
Contribution
It presents a novel Bayesian calculus leveraging publication metadata and community analysis to predict the reproducibility of scientific findings from literature and high-throughput experiments.
Findings
Scientifically focused but independent research is more likely to replicate.
Dispersing research funding broadly enhances the likelihood of reproducible findings.
The approach can identify and counteract biases in scientific literature.
Abstract
The explosive growth of scientists, scientific journals, articles and findings in recent years exponentially increases the difficulty scientists face in navigating prior knowledge. This challenge is exacerbated by uncertainty about the reproducibility of published findings. The availability of massive digital archives, machine reading and extraction tools on the one hand, and automated high-throughput experiments on the other, allow us to evaluate these challenges at scale and identify novel opportunities for accelerating scientific advance. Here we demonstrate a Bayesian calculus that enables the positive prediction of robust, replicable scientific claims with findings automatically extracted from published literature on gene interactions. We matched these findings, filtered by science, with unfiltered gene interactions measured by the massive LINCS L1000 high-throughput experiment to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGene expression and cancer classification · Scientific Computing and Data Management · Biomedical Text Mining and Ontologies
