False discovery rates in somatic mutation studies of cancer
Lorenzo Trippa, Giovanni Parmigiani

TL;DR
This paper evaluates statistical methods for controlling false discovery rates in cancer genome mutation studies, introducing a Bayesian model to improve accuracy and assess existing approaches.
Contribution
It develops a semiparametric Bayesian model for more accurate false discovery rate estimation and compares it with existing methods using realistic simulated data.
Findings
Benjamini-Hochberg method is conservative in FDR control.
Empirical Bayes approach performs well in simulations.
Sj"oblom et al.'s methodology shows negligible FDR deviation.
Abstract
The purpose of cancer genome sequencing studies is to determine the nature and types of alterations present in a typical cancer and to discover genes mutated at high frequencies. In this article we discuss statistical methods for the analysis of somatic mutation frequency data generated in these studies. We place special emphasis on a two-stage study design introduced by Sj\"{o}blom et al. [Science 314 (2006) 268--274]. In this context, we describe and compare statistical methods for constructing scores that can be used to prioritize candidate genes for further investigation and to assess the statistical significance of the candidates thus identified. Controversy has surrounded the reliability of the false discovery rates estimates provided by the approximations used in early cancer genome studies. To address these, we develop a semiparametric Bayesian model that provides an accurate…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
