Distributions of positive signals in pyrosequencing
Yong Kong

TL;DR
This paper derives the statistical distribution of positive signals in pyrosequencing, providing formulas for mean and variance, which aid in instrument and software development for this sequencing technology.
Contribution
It introduces explicit formulas for the distribution of positive signals in pyrosequencing, including mean and variance, based on flow cycles and nucleotide probabilities.
Findings
Mean number of positive signals is approximately twice the flow cycles.
Derived explicit formulas for distribution, mean, and variance.
Distribution depends on flow cycle number and nucleotide probabilities.
Abstract
Pyrosequencing is one of the important next-generation sequencing technologies. We derive the distribution of the number of positive signals in pyrograms of this sequencing technology as a function of flow cycle numbers and nucleotide probabilities of the target sequences. As for the distribution of sequence length, we also derive the distribution of positive signals for the fixed flow cycle model. Explicit formulas are derived for the mean and variance of the distributions. A simple result for the mean of the distribution is that the mean number of positive signals in a pyrogram is approximately twice the number of flow cycles, regardless of nucleotide probabilities. The statistical distributions will be useful for instrument and software development for pyrosequencing and other related platforms.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
