Partial fillup and search time in LC tries
Svante Janson, Wojciech Szpankowski

TL;DR
This paper provides a theoretical analysis of partial fillup in LC tries, showing that it slightly improves search times compared to original LC tries, with search depth typically proportional to loglog n.
Contribution
It offers a rigorous justification for experimental results on partial fillup in LC tries, quantifying the typical search depth and its dependence on parameters.
Findings
Partial fillup levels are concentrated on two values with high probability.
Typical search depth in alpha-LC tries is C loglog n, with C depending on p.
Search time in alpha-LC tries is smaller but of the same order as in original LC tries.
Abstract
Andersson and Nilsson introduced in 1993 a level-compressed trie (in short: LC trie) in which a full subtree of a node is compressed to a single node of degree being the size of the subtree. Recent experimental results indicated a 'dramatic improvement' when full subtrees are replaced by partially filled subtrees. In this paper, we provide a theoretical justification of these experimental results showing, among others, a rather moderate improvement of the search time over the original LC tries. For such an analysis, we assume that n strings are generated independently by a binary memoryless source with p denoting the probability of emitting a 1. We first prove that the so called alpha-fillup level (i.e., the largest level in a trie with alpha fraction of nodes present at this level) is concentrated on two values with high probability. We give these values explicitly up to O(1), and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAlgorithms and Data Compression · DNA and Biological Computing · Cellular Automata and Applications
