Another view of sequential sampling in the birth process with immigration
Poly H. da Silva, Arash Jamshidpey, Simon Tavar\'e

TL;DR
This paper investigates properties of a continuous-time birth process with immigration model for counts-of-counts data, analyzing correlations, variances, and asymptotics, with applications to biological data and Fisher's multi-sampling problem.
Contribution
It introduces a continuous-time framework for counts-of-counts data in a birth process with immigration, simplifying asymptotic analysis and connecting to Chinese Restaurant Process samples.
Findings
Derived expected sample variance and asymptotics for sequential samples
Connected continuous-time model to Chinese Restaurant Process sampling
Reinterpreted Fisher's 1943 multi-sampling problem in this context
Abstract
Models of counts-of-counts data have been extensively used in the biological sciences, for example in cancer, population genetics, sampling theory and ecology. In this paper we explore properties of one model that is embedded into a continuous-time process and can describe the appearance of certain biological data such as covid DNA sequences in a database. More specifically, we consider an evolving model of counts-of-counts data that arises as the family size counts of samples taken sequentially from a Birth process with Immigration (BI). Here, each family represents a type or species, and the family size counts represent the type or species frequency spectrum in the population. We study the correlation of and , the number of families observed in two disjoint time intervals and . We find the expected sample variance and its asymptotics for consecutive…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBayesian Methods and Mixture Models · Fractal and DNA sequence analysis · Stochastic processes and statistical mechanics
