The number of distinct adjacent pairs in geometrically distributed words
Margaret Archibald, Aubrey Blecher, Charlotte Brennan, Arnold, Knopfmacher, Stephan Wagner, Mark Ward

TL;DR
This paper investigates the expected number of distinct adjacent pairs in sequences of independent geometric random variables, providing exact counts for short sequences and asymptotic behavior for large sequences.
Contribution
It derives an explicit expression for the expected number of distinct adjacent pairs in geometric words and analyzes its asymptotic growth as sequence length increases.
Findings
Exact counts for short sequence lengths
Asymptotic expression for large n
Growth rate of distinct pairs as sequence length increases
Abstract
A sequence of geometric random variables of length is a sequence of independent and identically distributed geometric random variables () where for with We study the number of distinct adjacent two letter patterns in such sequences. Initially we directly count the number of distinct pairs in words of short length. Because of the rapid growth of the number of word patterns we change our approach to this problem by obtaining an expression for the expected number of distinct pairs in words of length . We also obtain the asymptotics for the expected number as .
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic processes and statistical mechanics · Algorithms and Data Compression · Authorship Attribution and Profiling
