Combining Bayesian and Frequentist Inference for Laboratory-Specific Performance Guarantees in Copy Number Variation Detection

Austin Talbot; Alex V. Kotlar; Yue Ke

arXiv:2604.14305·stat.ME·April 17, 2026

Combining Bayesian and Frequentist Inference for Laboratory-Specific Performance Guarantees in Copy Number Variation Detection

Austin Talbot, Alex V. Kotlar, Yue Ke

PDF

TL;DR

This paper introduces a hybrid Bayesian-Frequentist framework for CNV detection in oncology diagnostics, providing reliable performance guarantees despite small sample sizes and process heterogeneity.

Contribution

It presents a novel method combining Bayesian posteriors with frequentist coverage modeling, improving calibration and accuracy in CNV detection.

Findings

01

Achieves single-digit mean absolute coverage error across genes.

02

Outperforms Bayesian comparators with errors exceeding 60%.

03

Effective under both process-matched and unmatched conditions.

Abstract

Targeted amplicon panels are widely used in oncology diagnostics, but providing per-gene performance guarantees for copy number variant (CNV) detection remains challenging due to amplification artifacts, process-mismatch heterogeneity, and limited validation sample sizes. While Bayesian CNV callers naturally quantify per-sample uncertainty, translating this into the frequentist population-level guarantees required for clinical validation, coverage rates, false-positive bounds, and minimum detectable copy-number changes, is a fundamentally different inferential problem. We show empirically that even robust Bayesian credible intervals, including coarsened posteriors and sandwich-adjusted intervals, are severely miscalibrated on panels with small amplicon counts per gene. To address this, we propose a hybrid framework that evaluates Bayesian posterior functionals on validation samples and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.