Partial fillup and search time in LC tries

Svante Janson; Wojciech Szpankowski

arXiv:cs/0510017·cs.DS·May 23, 2007

Partial fillup and search time in LC tries

Svante Janson, Wojciech Szpankowski

PDF

Open Access

TL;DR

This paper provides a theoretical analysis of partial fillup in LC tries, showing that it slightly improves search times compared to original LC tries, with search depth typically proportional to loglog n.

Contribution

It offers a rigorous justification for experimental results on partial fillup in LC tries, quantifying the typical search depth and its dependence on parameters.

Findings

01

Partial fillup levels are concentrated on two values with high probability.

02

Typical search depth in alpha-LC tries is C loglog n, with C depending on p.

03

Search time in alpha-LC tries is smaller but of the same order as in original LC tries.

Abstract

Andersson and Nilsson introduced in 1993 a level-compressed trie (in short: LC trie) in which a full subtree of a node is compressed to a single node of degree being the size of the subtree. Recent experimental results indicated a 'dramatic improvement' when full subtrees are replaced by partially filled subtrees. In this paper, we provide a theoretical justification of these experimental results showing, among others, a rather moderate improvement of the search time over the original LC tries. For such an analysis, we assume that n strings are generated independently by a binary memoryless source with p denoting the probability of emitting a 1. We first prove that the so called alpha-fillup level (i.e., the largest level in a trie with alpha fraction of nodes present at this level) is concentrated on two values with high probability. We give these values explicitly up to O(1), and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAlgorithms and Data Compression · DNA and Biological Computing · Cellular Automata and Applications