General Coverage Models: Structure, Monotonicity, and Shotgun Sequencing
Yitzchak Grunbaum, Eitan Yaakobi

TL;DR
This paper introduces a combinatorial approach to analyze coverage times in models inspired by shotgun DNA sequencing, providing exact formulas, asymptotic analysis, and bounds for various sampling schemes.
Contribution
It develops a unifying combinatorial framework for coverage problems, enabling exact calculations and asymptotic analysis for window-based models and uniform regular models.
Findings
Exact expressions for coverage times in window models.
Asymptotic behavior characterized for cyclic and non-cyclic models.
Bounds established for uniform -regular models and batch sampling.
Abstract
We study coverage processes in which each draw reveals a subset of , and the goal is to determine the expected number of draws until all items are seen at least once. A classical example is the Coupon Collector's Problem, where each draw reveals exactly one item. Motivated by shotgun DNA sequencing, we introduce a model where each draw is a contiguous window of fixed length, in both cyclic and non-cyclic variants. We develop a unifying combinatorial tool that shifts the task of finding coverage time from probability, to a counting problem over families of subsets of that together contain all items, enabling exact calculation. Using this result, we obtain exact expressions for the window models. We then leverage past results on a continuous analogue of the cyclic window model to analyze the asymptotic behavior of both models. We further study what we call uniform…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
