On the number of words with restrictions on the number of symbols
Ver\'onica Becher, Eda Cesaratto

TL;DR
This paper demonstrates that in an n-symbol alphabet, the count of words of length n with a number of distinct symbols significantly different from the expected (1-1/e)n decreases exponentially with n, using advanced combinatorial methods.
Contribution
It provides a new exponential decay bound for the number of words with atypical symbol diversity, employing Laplace's method and Stirling number bounds.
Findings
Number of words with atypical symbol counts decays exponentially.
Quantitative bounds expressed via inequalities.
Analysis based on Laplace's method and Stirling numbers.
Abstract
We show that, in an alphabet of symbols, the number of words of length whose number of different symbols is away from , which is the value expected by the Poisson distribution, has exponential decay in . We use Laplace's method for sums and known bounds of Stirling numbers of the second kind. We express our result in terms of inequalities.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
