Combining Bayesian and Frequentist Inference for Laboratory-Specific Performance Guarantees in Copy Number Variation Detection
Austin Talbot, Alex V. Kotlar, Yue Ke

TL;DR
This paper introduces a hybrid Bayesian-Frequentist framework for CNV detection in oncology diagnostics, providing reliable performance guarantees despite small sample sizes and process heterogeneity.
Contribution
It presents a novel method combining Bayesian posteriors with frequentist coverage modeling, improving calibration and accuracy in CNV detection.
Findings
Achieves single-digit mean absolute coverage error across genes.
Outperforms Bayesian comparators with errors exceeding 60%.
Effective under both process-matched and unmatched conditions.
Abstract
Targeted amplicon panels are widely used in oncology diagnostics, but providing per-gene performance guarantees for copy number variant (CNV) detection remains challenging due to amplification artifacts, process-mismatch heterogeneity, and limited validation sample sizes. While Bayesian CNV callers naturally quantify per-sample uncertainty, translating this into the frequentist population-level guarantees required for clinical validation, coverage rates, false-positive bounds, and minimum detectable copy-number changes, is a fundamentally different inferential problem. We show empirically that even robust Bayesian credible intervals, including coarsened posteriors and sandwich-adjusted intervals, are severely miscalibrated on panels with small amplicon counts per gene. To address this, we propose a hybrid framework that evaluates Bayesian posterior functionals on validation samples and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
