Bounds on the Number of Huffman and Binary-Ternary Trees
Angeline Rao, Ying Liu, Yezhou Feng, and Jian Shen

TL;DR
This paper establishes bounds on the number of n-ary Huffman trees and introduces Binary-Ternary trees, expanding the theoretical framework for data compression and biological data analysis.
Contribution
It provides recursive bounds for Huffman trees and introduces Binary-Ternary trees, offering new insights for data compression and biological sequence analysis.
Findings
Derived bounds for the number of Huffman trees matching previous best results.
Proved recursive formulas for Binary-Ternary trees.
Demonstrated applications in data compression and genome analysis.
Abstract
Huffman coding is a widely used method for lossless data compression because it optimally stores data based on how often the characters occur in Huffman trees. An -ary Huffman tree is a connected, cycle-lacking graph where each vertex can have either "children" vertices connecting to it, or 0 children. Vertices with 0 children are called \textit{leaves}. We let represent the total number of -ary Huffman trees with leaves. In this paper, we use a recursive method to generate upper and lower bounds on and get for . This matches the best results achieved by Elsholtz, Heuberger, and Prodinger in August 2011. Our approach reveals patterns in Huffman trees that we used in our analysis of the Binary-Ternary (BT) trees we created. Our research opens a completely new door in data compression…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAlgorithms and Data Compression · Error Correcting Code Techniques · Machine Learning in Bioinformatics
