Partial-Match Queries with Random Wildcards: In Tries and Distributed Hash Tables
Junichiro Fukuyama

TL;DR
This paper analyzes the average complexity of partial-match queries with random wildcards in tries and distributed hash tables, showing significant reductions from naive bounds and providing probabilistic correctness guarantees.
Contribution
It introduces tight average-case bounds for wildcard queries in tries and extends these results to distributed hash tables, including probabilistic correctness analysis for Chord.
Findings
Average number of steps in tries is O(2^w m / w)
Distributed hash tables require O(2^w m / w) hops on average for wildcard queries
Lookup in Chord is correct with high probability in O(m) hops
Abstract
Consider an -bit query to a bitwise trie . A wildcard is an unspecified bit in for which the query asks the membership for both cases and . It is common that such partial-match queries with wildcards are issued in tries. With uniformly random occurrences of wildcards in assumed, the obvious upper bound on the average number of traversal steps in is . We show that the average does not exceed \[ \frac{m+1}{w+1} \left( 2^{w+2} - 2 w - 4 \right) + m = O \left( \frac{2^w m}{w} \right), \] and equals the value exactly when includes all the -bit keys as the worst case. Here the query performs with the naive backtracking algorithm in . It is similarly shown that the average is in a general trie of maximum out-degree . Our analysis for tries is extended to a distributed hash table (DHT), which is…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCaching and Content Delivery · Peer-to-Peer Network Technologies · Distributed systems and fault tolerance
