Algorithmic counting of nonequivalent compact Huffman codes

Christian Elsholtz; Clemens Heuberger; Daniel Krenn

arXiv:1901.11343·math.CO·October 17, 2024

Algorithmic counting of nonequivalent compact Huffman codes

Christian Elsholtz, Clemens Heuberger, Daniel Krenn

PDF

1 Repo

TL;DR

This paper presents an efficient algorithm for counting nonequivalent compact Huffman codes and related structures, significantly improving previous computational bounds by using power series division.

Contribution

It introduces a method to compute the sequence for all n < N with nearly linear complexity in N, surpassing earlier cubic and quartic bounds.

Findings

01

Efficient computation of the sequence for all n < N

02

Reduction of complexity from O(N^3) to approximately N^{1+ε}

03

Applicable to various combinatorial structures related to Huffman codes

Abstract

It is known that the following five counting problems lead to the same integer sequence~ $f_{t} (n)$ : the number of nonequivalent compact Huffman codes of length~ $n$ over an alphabet of $t$ letters, the number of `nonequivalent' canonical rooted $t$ -ary trees (level-greedy trees) with $n$ ~leaves, the number of `proper' words, the number of bounded degree sequences, and the number of ways of writing $1 = \frac{1}{t ^{x_{1}}} + \dots + \frac{1}{t ^{x_{n}}}$ with integers $0 \leq x_{1} \leq x_{2} \leq \dots \leq x_{n}$ . In this work, we show that one can compute this sequence for \textbf{all} $n < N$ with essentially one power series division. In total we need at most $N^{1 + ε}$ additions and multiplications of integers of $c N$ bits, $c < 1$ , or $N^{2 + ε}$ bit operations, respectively. This improves an earlier bound by Even and Lempel who needed $O (N^{3})$ operations in the integer ring or…

Tables1

Table 1. Table 2.1. Cost of operations of power series with precision N 𝑁 N and coefficients with bit size M 𝑀 M . (We assume M = 𝑂 ( N ) 𝑀 𝑂 𝑁 M=\mathop{{O}{}}(N) and state a simpler expression for the bit operations for division.)

Task	Ring operations	Bit operations
addition	$N$	$N M$
multiplication	$N \log N 2^{𝑂 (\log^{*} N)}$	$N \log N 2^{𝑂 (\log^{*} N)} \cdot M \log M$
division	$N \log N 2^{𝑂 (\log^{*} N)}$	$N^{2} M {(\log N)}^{2} 2^{𝑂 (\log^{*} N)}$

Equations71

1 = \frac{1}{t ^{x_{1}}} + \dots + \frac{1}{t ^{x_{n}}}

1 = \frac{1}{t ^{x_{1}}} + \dots + \frac{1}{t ^{x_{n}}}

f_{t}(r)\colonequals\bigg{\lvert}\bigg{\{}(x_{1},\ldots,x_{r})\in\mathbb{Z}^{r}\colon\mathopen{}0\leq x_{1}\leq\cdots\leq x_{r},\,\sum_{i=1}^{r}\frac{1}{t^{x_{i}}}=1\bigg{\}}\bigg{\rvert},

f_{t}(r)\colonequals\bigg{\lvert}\bigg{\{}(x_{1},\ldots,x_{r})\in\mathbb{Z}^{r}\colon\mathopen{}0\leq x_{1}\leq\cdots\leq x_{r},\,\sum_{i=1}^{r}\frac{1}{t^{x_{i}}}=1\bigg{\}}\bigg{\rvert},

1 = \frac{1}{2} + \frac{1}{4} + \frac{1}{8} + \frac{1}{16} + \frac{1}{16} = \frac{1}{2} + \frac{1}{8} + \frac{1}{8} + \frac{1}{8} + \frac{1}{8} = \frac{1}{4} + \frac{1}{4} + \frac{1}{4} + \frac{1}{8} + \frac{1}{8} .

1 = \frac{1}{2} + \frac{1}{4} + \frac{1}{8} + \frac{1}{16} + \frac{1}{16} = \frac{1}{2} + \frac{1}{8} + \frac{1}{8} + \frac{1}{8} + \frac{1}{8} = \frac{1}{4} + \frac{1}{4} + \frac{1}{4} + \frac{1}{8} + \frac{1}{8} .

1, 1, 1, 2, 3, 5, 9, 16, 28, 50, 89, 159, \dots,

1, 1, 1, 2, 3, 5, 9, 16, 28, 50, 89, 159, \dots,

1, 1, 1, 2, 4, 7, 13, 25, 48, 92, 176, 338, \dots,

1, 1, 1, 2, 4, 7, 13, 25, 48, 92, 176, 338, \dots,

1, 1, 1, 2, 4, 8, 15, 29, 57, 112, 220, 432, \dots .

1, 1, 1, 2, 4, 8, 15, 29, 57, 112, 220, 432, \dots .

\mathop{{#1}{}}(n)=R\rho^{n}+R_{2}\rho_{2}^{n}+\mathop{{O}{}}\big{(}r_{3}^{n}\big{)},

\mathop{{#1}{}}(n)=R\rho^{n}+R_{2}\rho_{2}^{n}+\mathop{{O}{}}\big{(}r_{3}^{n}\big{)},

ρ

ρ

N\log N\,2^{\mathop{{O}{}}(\log^{*}N)}\cdot NM\log\bigl{(}NM\bigr{)}=N^{2}M(\log N)^{2}2^{\mathop{{O}{}}(\log^{*}N)}

N\log N\,2^{\mathop{{O}{}}(\log^{*}N)}\cdot NM\log\bigl{(}NM\bigr{)}=N^{2}M(\log N)^{2}2^{\mathop{{O}{}}(\log^{*}N)}

N lo g_{2} ρ + O (1)

N lo g_{2} ρ + O (1)

\mathbf{D}+\bigl{(}\log_{t}N+\mathop{{O}{}}(1)\bigr{)}\mathbf{M}+2\bigl{(}\log_{t}N+\mathop{{O}{}}(1)\bigr{)}\mathbf{A}+\mathop{{O}{}}(\log N)\mathbf{S}+\mathop{{O}{}}(\log N)\mathbf{O}

\mathbf{D}+\bigl{(}\log_{t}N+\mathop{{O}{}}(1)\bigr{)}\mathbf{M}+2\bigl{(}\log_{t}N+\mathop{{O}{}}(1)\bigr{)}\mathbf{A}+\mathop{{O}{}}(\log N)\mathbf{S}+\mathop{{O}{}}(\log N)\mathbf{O}

N (lo g N)^{2} 2^{O (l o g^{*} N)}

N (lo g N)^{2} 2^{O (l o g^{*} N)}

N^{2} (lo g N)^{4} 2^{O (l o g^{*} N)}

N^{2} (lo g N)^{4} 2^{O (l o g^{*} N)}

H (q) = n = 0 \sum \infty g_{t} (n) q^{n} = \frac{\sum _{j = 0}^{\infty} q ^{[j]} ( - 1 ) ^{j} \prod _{i = 1}^{j} \frac{q ^{[i]}}{1 - q ^{[i]}}}{\sum _{j = 0}^{\infty} ( - 1 ) ^{j} \prod _{i = 1}^{j} \frac{q ^{[i]}}{1 - q ^{[i]}}}

H (q) = n = 0 \sum \infty g_{t} (n) q^{n} = \frac{\sum _{j = 0}^{\infty} q ^{[j]} ( - 1 ) ^{j} \prod _{i = 1}^{j} \frac{q ^{[i]}}{1 - q ^{[i]}}}{\sum _{j = 0}^{\infty} ( - 1 ) ^{j} \prod _{i = 1}^{j} \frac{q ^{[i]}}{1 - q ^{[i]}}}

[j] : = 1 + t + \dots + t^{j - 1} .

[j] : = 1 + t + \dots + t^{j - 1} .

j \leq J = lo g_{t} N + O (1) .

j \leq J = lo g_{t} N + O (1) .

\sigma_{j}=\sum_{i=1}^{j}[i]=\frac{t^{j+1}-t}{(t-1)^{2}}-\frac{j}{t-1}=\frac{t^{j+1}}{(t-1)^{2}}\Bigl{(}1-\frac{j(t-1)}{t^{j+1}}-\frac{1}{t^{j}}\Bigr{)}

\sigma_{j}=\sum_{i=1}^{j}[i]=\frac{t^{j+1}-t}{(t-1)^{2}}-\frac{j}{t-1}=\frac{t^{j+1}}{(t-1)^{2}}\Bigl{(}1-\frac{j(t-1)}{t^{j+1}}-\frac{1}{t^{j}}\Bigr{)}

j-1+\log_{t}\Bigl{(}1-\frac{j(t-1)}{t^{j+1}}-\frac{1}{t^{j}}\Bigr{)}-2\log_{t}\Bigl{(}1-\frac{1}{t}\Bigr{)}<\log_{t}N.

j-1+\log_{t}\Bigl{(}1-\frac{j(t-1)}{t^{j+1}}-\frac{1}{t^{j}}\Bigr{)}-2\log_{t}\Bigl{(}1-\frac{1}{t}\Bigr{)}<\log_{t}N.

\frac{1}{1 - q ^{[1]}} \frac{1}{1 - q ^{[2]}} \frac{1}{1 - q ^{[3]}} \dots \frac{1}{1 - q ^{[j]}}

\frac{1}{1 - q ^{[1]}} \frac{1}{1 - q ^{[2]}} \frac{1}{1 - q ^{[3]}} \dots \frac{1}{1 - q ^{[j]}}

\frac{( lo g N ) ^{2}}{2 ( lo g 2 ) ( lo g t )} + O (lo g N)

\frac{( lo g N ) ^{2}}{2 ( lo g 2 ) ( lo g t )} + O (lo g N)

\bigg{\{}(a_{1},a_{2},\dots,a_{J})\in\mathbb{N}_{0}^{J}\colon\mathopen{}\sum_{i=1}^{J}a_{i}[i]=n\bigg{\}}.

\bigg{\{}(a_{1},a_{2},\dots,a_{J})\in\mathbb{N}_{0}^{J}\colon\mathopen{}\sum_{i=1}^{J}a_{i}[i]=n\bigg{\}}.

\frac{2 ^{J} N ^{J}}{[ 1 ] [ 2 ] \dots [ J ]} \leq \frac{2 ^{J} N ^{J}}{1 \cdot t \cdot t ^{2} \dots t ^{J - 1}} = \frac{2 ^{J} N ^{J}}{t ^{(J - 1) J /2}} .

\frac{2 ^{J} N ^{J}}{[ 1 ] [ 2 ] \dots [ J ]} \leq \frac{2 ^{J} N ^{J}}{1 \cdot t \cdot t ^{2} \dots t ^{J - 1}} = \frac{2 ^{J} N ^{J}}{t ^{(J - 1) J /2}} .

\frac{2 ^{J} N ^{J}}{t ^{(J - 1) J /2}}

\frac{2 ^{J} N ^{J}}{t ^{(J - 1) J /2}}

\displaystyle\leq\exp\Big{(}\frac{(\log N)^{2}}{\log t}-\frac{(\log N)^{2}}{2\log t}+\mathop{{O}{}}(\log N)\Big{)}

\exp\Big{(}\frac{(\log N)^{2}}{2\log t}+\mathop{{O}{}}(\log N)\Big{)}.

\exp\Big{(}\frac{(\log N)^{2}}{2\log t}+\mathop{{O}{}}(\log N)\Big{)}.

i = 1 \prod j \frac{q ^{[i]}}{1 - q ^{[i]}},

i = 1 \prod j \frac{q ^{[i]}}{1 - q ^{[i]}},

i = 1 \prod j \frac{q ^{[i]}}{1 - q ^{[i]}},

i = 1 \prod j \frac{q ^{[i]}}{1 - q ^{[i]}},

\frac{( lo g N ) ^{2}}{2 ( lo g 2 ) ( lo g t )} + O (lo g N)

\frac{( lo g N ) ^{2}}{2 ( lo g 2 ) ( lo g t )} + O (lo g N)

N lo g N 2^{O (l o g^{*} N)} \cdot (lo g N)^{2} lo g lo g N = N (lo g N)^{3} lo g lo g N 2^{O (l o g^{*} N)}

N lo g N 2^{O (l o g^{*} N)} \cdot (lo g N)^{2} lo g lo g N = N (lo g N)^{3} lo g lo g N 2^{O (l o g^{*} N)}

\mathop{{O}{}}\big{(}N\big{)}\mathop{{O}{}}\big{(}(\log N)^{2}\big{)}=\mathop{{O}{}}\big{(}N(\log N)^{2}\big{)}

\mathop{{O}{}}\big{(}N\big{)}\mathop{{O}{}}\big{(}(\log N)^{2}\big{)}=\mathop{{O}{}}\big{(}N(\log N)^{2}\big{)}

J\bigl{(}\mathbf{M}+2\mathbf{A}+\mathop{{O}{}}(1)\mathbf{S}+\mathop{{O}{}}(1)\mathbf{O}\bigr{)}

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

https://gitlab.com/dakrenn/count-nonequivalent-compact-huffman-codes
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

Algorithmic counting of nonequivalent compact Huffman codes

Christian Elsholtz

Institute of Analysis and Number Theory

Graz University of Technology

Kopernikusgasse 24, A-8010 Graz, Austria

[email protected]

,

Clemens Heuberger

Department of Mathematics

Alpen-Adria-Universität Klagenfurt

Universitätsstraße 65–67, A-9020 Klagenfurt am Wörthersee, Austria

[email protected]

and

Daniel Krenn

Department of Mathematics

Paris Lodron University of Salzburg

Hellbrunnerstraße 34, A-5020 Salzburg, Austria

[email protected]or[email protected]

Abstract.

It is known that the following five counting problems lead to the same integer sequence $\mathop{{#1}{}}(n)$ :

(1)

the number of nonequivalent compact Huffman codes of length $n$ over an alphabet of $t$ letters, 2. (2)

the number of “nonequivalent” complete rooted $t$ -ary trees (level-greedy trees) with $n$ leaves, 3. (3)

the number of “proper” words (in the sense of Even and Lempel), 4. (4)

the number of bounded degree sequences (in the sense of Komlós, Moser, and Nemetz), and 5. (5)

the number of ways of writing

[TABLE]

with integers $0\leq x_{1}\leq x_{2}\leq\dots\leq x_{n}$ .

In this work, we show that one can compute this sequence for all $n<N$ with essentially one power series division. In total we need at most $N^{1+\varepsilon}$ additions and multiplications of integers of $cN$ bits (for a positive constant $c<1$ depending on $t$ only) or $N^{2+\varepsilon}$ bit operations, respectively, for any $\varepsilon>0$ . This improves an earlier bound by Even and Lempel who needed $\mathop{{O}{}}(N^{3})$ operations in the integer ring or $\mathop{{O}{}}(N^{4})$ bit operations, respectively.

Key words and phrases:

unit fractions, Huffman codes, $t$ -ary trees, counting, generating function

2020 Mathematics Subject Classification:

05A15; 05C05, 05C30, 11D68, 68P30

C. Elsholtz is supported by the Austrian Science Fund (FWF): W1230 and by Project Arithrand of the Austrian Science Fund (FWF): I 4945-N and of ANR-20-CE91-0006. C. Heuberger and D. Krenn are supported by the Austrian Science Fund (FWF): P28466-N35.

1. Introduction

Motivation

The purpose of this paper is to study the complexity of a counting problem, namely determining the number of nonequivalent compact Huffman codes of length $n$ over an alphabet of $t$ letters, and several equivalent combinatorial or number theoretic objects; see below and in particular (1.1) for a precise definition. The fastest algorithm in the published literature is due to Even and Lempel [10] (1972) and has a complexity of $\mathop{{O}{}}(N^{3})$ operations in the ring of integers.

When actually computing the number of such compact Huffman codes, we experimentally observed that an approach of evaluating a generating function—this generating function was first studied by Flajolet and Prodinger [11] (1987)—appears to be very fast. A detailed analysis (see Theorem 1) shows that the complexity is indeed only $\mathop{{O}{}}(N^{1+\varepsilon})$ (for any $\varepsilon>0$ ) additions and multiplications of integers of size $cN$ bits (see (3.1)), where $c<1$ is a positive constant depending on $t$ only.

In this paper, we will first describe the different but equivalent objects that we count and then present a quite detailed analysis of computing the number of these objects.

Codes, unit fractions and more

For a fixed integer $t\geq 2$ , Elsholtz, Heuberger and Prodinger [9] studied the number

[TABLE]

$r\geq 0$ , i. e. the number of partitions of $1$ into nonpositive powers of $t$ . It is known that this counting problem is equivalent to several other counting problems, namely the number of “nonequivalent” complete rooted $t$ -ary trees (also called “level-greedy trees”; see [9, 11, 18]), the number of “proper words” (in the sense of Even and Lempel [10]), the number of bounded degree sequences (in the sense of Komlós, Moser, and Nemetz [27]), and the number of nonequivalent compact Huffman codes111A Huffman code over an alphabet of $t$ letters is a prefix-free subset (the set of “code words”) of the set of finite words over this alphabet, i.e., no code word is a prefix of another code word. It is said to be compact if no further code word can be added without violating the prefix-freeness condition. Two compact Huffman codes are considered to be equivalent if the multisets of the lengths of the code words are equal, and one can choose a representative where shorter words are lexicographically smaller than longer words.

of length $r$ over an alphabet of $t$ letters. For a detailed survey on the existing results, applications and literature on these sequences; see [9]. As a small concrete example, we note that for $t=2$ , $r=5$ we have $f_{2}(5)=3$ , as can be seen from working out the following:

[TABLE]

As discussed in [9], $\mathop{{#1}{}}(r)$ is positive only when $r=1+n(t-1)$ , so it is more convenient to study $\mathop{{#1}{}}(n)=f_{t}(1+n(t-1))$ instead. For $t=2$ the values of $\mathop{{#1}{}}(n)$ start with

[TABLE]

for $t=3$ with

[TABLE]

and the first terms of $\mathop{{#1}{}}(n)$ are

[TABLE]

These are sequences A002572, A176485 and A176503 in the On-Line Encyclopedia of Integer Sequences [30].

Asymptotics

It has been proved (see Elsholtz, Heuberger, Prodinger [9]) that for fixed $t$ , the asymptotic growth of these sequences can be described by two main terms and an error term as

[TABLE]

where $1<r_{3}<\rho_{2}<\rho<2$ . Here all constants depend on $t$ . In particular, if $t=2$ , then

[TABLE]

Moreover, the authors of [9] also show that $\rho=2-2^{-t-1}+\mathop{{O}{}}\big{(}t\,4^{-t}\big{)}$ as $t\to\infty$ .

Beside the enumeration of all these objects, probabilistic questions concerning many different parameters have been studied asymptotically in [18, 19].

Algorithmic counting

As this family of sequences appears in many different contexts and as the sequences’ growth rates have been studied in detail (see the section above and the introduction of [9] for full details), it is somehow surprising that the current record on the algorithmic complexity of determining the members of the sequence (in the case $t=2$ ) appears to be a 50 years old paper by Even and Lempel [10]. Hence it seemed worthwhile to study this complexity from a new point of view and we thus succeeded to improve the upper bound complexity considerably; see section below.

The algorithm of Even and Lempel [10] produces the sequence $\mathop{{#1}{}}(n)$ for $n<N$ . It takes $\mathop{{O}{}}(N^{3})$ additions of integers bounded by $\mathop{{O}{}}(\rho^{N})$ (with $\rho<2$ ; so integers with roughly $N$ bits in size), which are $\mathop{{O}{}}(N^{4})$ bit operations. They only studied the case $t=2$ in detail, but mention that their result can be generalized to arbitrary $t$ .

Main result

In this paper we take an entirely new approach to the problem of evaluating $g_{t}(n)$ . Rather than thinking about an algorithm itself, as Even and Lempel [10] did, we think about how to evaluate the generating function (3.4) of $g_{t}(n)$ established in [9] efficiently. As it turns out the cost essentially comes from one division of power series of precision222We say that a power series $H$ has precision $N$ if we can write it as $H(q)=\sum_{n=0}^{N-1}h_{n}q^{n}+\mathop{{O}{}}(q^{N})$ with explicit coefficients $h_{n}$ . $N$ whose coefficients are integers bounded by $\mathop{{O}{}}(\rho^{N})$ (with $\rho<2$ ).

Estimating the cost of this evaluation strategy leads to tremendous improvement—to be precise, by a factor $N^{2}$ in both ring operations and bit operations—of the cost of using [10]. It is not obvious that the cost for evaluation of numerator and denominator of the generating function are asymptotically (much) smaller than the total cost; see Theorem 1 for details and also Section 5 providing even more details during the proof of this theorem. We in particular show that the cost for evaluating numerator and denominator are asymptotically almost (neglecting logarithmic factors) by a factor $N$ smaller.

Using the multiplication algorithms of Schönhage and Strassen [32], of Fürer [13, 14], or of Harvey and van der Hoeven [17] (see Section 2 for an overview) our algorithm leads to $N(\log N)^{2}\,2^{\mathop{{O}{}}(\log^{*}N)}$ operations in the integer ring and consequently $N^{2}(\log N)^{4}\,2^{\mathop{{O}{}}(\log^{*}N)}$ bit operations, where $\log^{*}N$ denotes the iterated logarithm.333The iterated logarithm (also called log star) gives the number of applications of the logarithm so that the result is at most $1$ . For example, we can define it recursively by $\log^{*}M=1+\log^{*}(\log M)$ if $M>1$ and $\log^{*}M=0$ otherwise. In Remark 6.2 a discussion on the memory requirements can be found. An implementation of this algorithm, based on FLINT [12, 16] (which is, for example, included in the SageMath mathematics software [31]) is also available;444The code accompanying this article can be downloaded from https://gitlab.com/dakrenn/count-nonequivalent-compact-huffman-codes. see also Appendix A for the relevant lines of code and remarks related to the implementation. In Appendix B, we discuss the running times of this implementation.

The literature describes a number of algorithms constructing the complete list of $t$ -ary Huffman codes of length $r=1+n(t-1)$ ; see [20, 24, 28, 29]. There is no performance analysis given. But, as the number of such codes grows exponentially in $r$ it is clear that listing all codes is not a fast method to determine the number of such codes only. The algorithm by Even and Lempel [10] computes the number $f_{2}(n)$ without listing all codes, and is to the best of our knowledge the fastest algorithm previously known. Our algorithm relies on calculations involving power series with large integer coefficients.

It should also be emphasized that the output $f_{t}(n)$ of the algorithm grows exponentially in $n$ (this was mentioned above), therefore the number of bits to represent $f_{t}(n)$ is linear in $n$ whereas the input is only logarithmic in $n$ . The quite general survey paper by Klazar [25] studies classes of problems where the output needs at most a polynomial number of steps, in terms of the combined size of input and output. As we can compute $f_{t}(n)$ efficiently, this problem falls into the class considered by Klazar.

Notes

It should be pointed out that in this article, we derive and compare upper bounds. It might be that the actual cost are smaller. However, as we compute the first $N$ coefficients all at the same time and the coefficients grow exponentially in $N$ , a lower bound for the number of bit operations necessarily contains a factor $N^{2}$ . Moreover, as multiplication of some sort is involved, lower order factors (growing with $N$ ) are expected as well.

We also mention that the following is open: How fast can a single coefficient $\mathop{{#1}{}}(n)$ (in contrast to all coefficients with $n<N$ ) be computed?

2. Cost of the underlying operations

In this section, we give a brief overview on the time requirements for performing addition and multiplication of two integers and for performing multiplication and division of power series. The current state of the art is also summarized in Table 2.1.

Addition and multiplication

First, assume that we want to perform addition of two numbers bounded by $2^{M}$ , i.e., numbers with $M$ bits. We have to look at each bit of the numbers exactly once and add those (maybe with a carry). Therefore, we need $\mathop{{O}{}}(M)$ bit operations.

Next, we look at multiplication of two numbers bounded by $2^{M}$ . It is clear that this can be achieved with $\mathop{{O}{}}(M^{2})$ operations, but it can be done better. An overview is given in the survey article by Karatsuba [21]. The Karatsuba multiplication algorithm [22, 23] has a complexity of $\mathop{{O}{}}(M^{\log_{2}3})$ . A faster generalisation of it is the Toom–Cook-algorithm [4]. Combining Karatsuba multiplication with the Fast Fourier Transform algorithm (see Cooley and Tukey [5]) gives an algorithm with bit complexity $\mathop{{O}{}}(M(\log M)^{(2+\varepsilon)})$ ; see [1, 2, 3, 26].

The multiplication algorithm given by Schönhage and Strassen (see [32]) takes $\mathop{{O}{}}(M\log M\log\log M)$ time. It also uses fast Fourier transform. An asymptotically even faster multiplication algorithm is given by Fürer [13, 14]. It has computational complexity $M\log M\,2^{\mathop{{O}{}}(\log^{*}M)}$ , where we again denote the iterated logarithm by $\log^{*}M$ . Fürer’s algorithm uses complex arithmetic. A related algorithm of the same complexity but using modular arithmetic is due to De, Kurur, Saha and Saptharishi [7, 8].

The asymptotically fastest known multiplication algorithm is due to Harvey and van der Hoeven [17]; it has a computational complexity of $\mathop{{O}{}}(M\log M)$ .

Power series operations

Let us also summarize the complexity of power series computations; for references see the books of Cormen, Leiserson, Rivest and Stein [6] or Knuth [26]. The multiplication can, again, be speeded up by using fast Fourier transform. We can use the algorithms for integer multiplication presented above; see von zur Gathen and Gerhard [33]. Also, the computational complexity can be improved: Given power series with precision $N$ (i.e., the first $N$ terms) over a ring, we can perform multiplication with $N\log N\,2^{\mathop{{O}{}}(\log^{*}N)}$ ring operations using Fürer’s algorithm.

In order to perform division (inversion) of power series with precision $N$ , we can use the Newton–Raphson-method. We need at most $4\mathop{{#1}{}}(N)+N$ ring operations, where $\mathop{{#1}{}}(N)$ denotes the number of operations needed to multiply two power series with precision $N$ ; see von zur Gathen and Gerhard [33, Theorem 9.4] for details; the additional summand $\mathop{{#1}{}}(N)$ in comparison to that theorem comes from the multiplication with the numerator. Therefore, by using Fürer’s algorithm, we can invert/divide with $N\log N\,2^{\mathop{{O}{}}(\log^{*}N)}$ ring operations.

The bit size occuring in the ring operations for a division of power series with precision $N$ and coefficients of bit size $M$ is $NM$ by the remarks after [33, Theorem 9.6]. Therefore and by assuming $M=\mathop{{O}{}}(N)$ for simpler expressions with respect to the logarithms, we end up with

[TABLE]

bit operations.

3. Cost for extracting coefficients

Our main result gives the number of operations needed for extracting the coefficients $\mathop{{#1}{}}(n)$ for all $n<N$ . It reflects three different aspects: First, we count operations on a high level, for example power series multiplications. (Below we will denote this operation by $\mathbf{M}$ .) Second, we count operations in the ring of integers. There, to stick with the example on power series multiplication, the precision of the power series is taken into account, but not the actual size of the integer. Finally and third, we count bit operations, where also the size of the coefficients (which are integers) is taken into account.

Let us make this more precise and start with the high level operations. We denote

•

an addition (or a subtraction) of two power series by $\mathbf{A}$ ,

•

a multiplication of two power series by $\mathbf{M}$ , and

•

a division of two power series by $\mathbf{D}$ .

As we compute the first $N$ terms, we may assume that all power series are of precision $N$ .

An overview and summary of the number of ring operations and bit operations of these high level operations is provided in Section 2. Clearly, we have to deal with the size of the coefficients. We first note that for $n<N$ each coefficient $\mathop{{#1}{}}(n)$ can be written with $M\colonequals\lfloor\log_{2}\mathop{{#1}{}}(N)\rfloor+1$ bits and that by using the asymptotics (1.2) we can bound this by

[TABLE]

when $N$ tends to $\infty$ . Here the constant $\rho<2$ depends on $t$ ; see [9] for details on $\rho$ .

Summarizing, all the operations $\mathbf{A}$ , $\mathbf{M}$ and $\mathbf{D}$ are performed on power series of precision $N$ with coefficients written by $M$ bits (numbers bounded by $2^{M}$ ), and the cost (number of bit operations) are stated in Section 2. There is one important remark at this point, namely, we will see during our main proof (Section 5) that the coefficients appearing in power series additions and multiplications are actually much smaller than coefficients written by $M$ bits; we will take this into account for counting bit operations.

Beside these main power series operations, we additionally denote

•

other power series operations of precision $N$ (for example, memory allocation or writing initial values) by $\mathbf{S}$ , and

•

other operations, more precisely operations of numbers with less than $\log_{2}N$ bits (for example additions of indices) by $\mathbf{O}$ .

Thus, an operation $\mathbf{O}$ is performed on numbers bounded by $N$ only (in contrast to the bounded-by- $2^{M}$ -operations).

With these notions and by collecting operations as formal sums of $\mathbf{A}$ , $\mathbf{M}$ , $\mathbf{D}$ , $\mathbf{S}$ and $\mathbf{O}$ , we can write down the precise formulation of our main theorem.

Theorem 1.

Calculating the first $N$ terms of $\mathop{{#1}{}}(n)$ can be done with

[TABLE]

power series operations,

[TABLE]

operations in the ring of integers, and with

[TABLE]

bit operations.

In order to prove Theorem 1—the complete proof can be found in Section 5,—we look at the cost of calculating the first $N$ terms, which is done by extracting coefficients of the power series

[TABLE]

with

[TABLE]

This generating function (3.4) can be found in Flajolet and Prodinger [11, Theorem 2] for $t=2$ and in Elsholtz, Heuberger and Prodinger [9, Theorem 6] for general $t$ . It is derived from the equivalent formulation as counting problem on trees, which was mentioned in the introduction.

4. Auxiliary results

When extracting the first $N$ coefficients, we do not need the “full” generating function, i.e., the infinite sums in the numerator and denominator of (3.4) can be truncated to finite sums. The following lemma tells us how many coefficients we need. We use this asymptotic result in our analysis of the algorithm; for the actual computer programme, we can check indices and exponents by a direct computation.

Lemma 4.1.

To calculate numerator and denominator of the generating function (3.4) with precision $N$ , we need only summands with

[TABLE]

Proof.

Because of an additional factor $q^{[j]}$ in each summand of the numerator, it is sufficient that the largest index of the denominator is less than $N$ . Therefore, we will only look at the indices of the denominator.

Consider the summand of the denominator with index $j$ . The lowest index of a non-zero coefficient of the denominator is

[TABLE]

where the notation $[i]$ is defined in Equation (3.5). We only need summands with $\sigma_{j}<N$ . Taking the logarithm yields

[TABLE]

As the first logarithm tends to [math] as $j\to\infty$ and the second is bounded, the error term $\mathop{{O}{}}(1)$ is large enough and the result follows. ∎

While the bit size of the coefficients $\mathop{{#1}{}}(n)$ is linear in $N$ , the size of the coefficients of numerator and denominator of (3.4) is much smaller. We make this precise by using the following lemma.

Lemma 4.2.

For $n\leq N$ , the $n$ th coefficient of

[TABLE]

with $j\leq J$ and $J$ of Lemma 4.1 as well as the $n$ th coefficients of numerator and denominator of (3.4) can be written with

[TABLE]

bits.

Proof.

We start proving the claimed result for (4.1) and postpone handling numerator and denominator of (3.4) to the end of this proof.

Each factor of (4.1) is a geometric series whose coefficients are either [math] or $1$ and whose constant coefficient is $1$ . In particular, these coefficients are nonnegative. Therefore, it suffices to show the result for $j=J$ .

As the coefficients are either [math] or $1$ , the $n$ th coefficient of the product equals the cardinality of the set

[TABLE]

By using the crude estimate $a_{i}\leq n/[i]$ , we see that we have at most $2N/[i]$ choices for $a_{i}$ because $n<N$ and $[i]<N$ by construction. Thus we can bound the cardinality of the set above by

[TABLE]

We use $J=\log_{t}N+\mathop{{O}{}}(1)$ of Lemma 4.1 to obtain

[TABLE]

from which follows that the $n$ th coefficient of (4.1) is bounded by

[TABLE]

The result in terms of bit size follows by taking the logarithm.

Numerator and denominator are sums where $J$ summands are added up (or subtracted). This corresponds to an additional factor $J$ in the bound (4.3) or an additional summand $\log_{2}J$ in the formula (4.2), respectively. As $J=\mathop{{O}{}}(\log_{t}N)$ by Lemma 4.1, this is absorbed by the error term, so the same formula holds. ∎

5. Proof of Theorem 1

We start with an overview of our strategy. For computing the first $N$ coefficients of the generating function $H(q)$ (see (3.4)), we only need the summands of the numerator and the denominator with $j<J$ according to Lemma 4.1.

First, consider the denominator of $H(q)$ . We compute the products

[TABLE]

iteratively by expanding the $J$ different terms $q^{[i]}/(1-q^{[i]})$ as geometric series and perform power series multiplications. After each multiplication, we accumulate the result by using one power series addition.

We deal with the numerator in the same fashion. However, by performing the computation of numerator and denominator simultaneously, the above products only need to be evaluated once.

Finally, to obtain the first $N$ coefficients of $H(q)$ , we need one power series division of numerator and denominator.

Pseudocode for our algorithm is given in Algorithm 1; an efficient implementation using the FLINT library is presented in Appendix A. The actual analysis of this algorithm is done by counting the operations needed, in particular the power series operations, and providing bounds for the bit sizes of the variables.

Let us come to the actual proof.

Proof of Theorem 1.

We analyse the code of Algorithm 1; see Appendix A for the details. It starts by initialising variables (memory allocation and initial values) for the power series operations, which contributes $\mathop{{O}{}}(1)\mathbf{S}$ . Further initialisation is done by $\mathop{{O}{}}(1)\mathbf{O}$ operations.

For computing the first $N$ coefficients of $H(q)$ (see (3.4)), we only need the summands of numerator and denominator with $j<J=\log_{t}N+\mathop{{O}{}}(1)$ according to Lemma 4.1. Speaking in terms of our computer programme, our outer loop needs $J$ passes. We now describe what happens in each of these passes; the final cost needs then to be multiplied by $J$ .

Suppose we are in step $j$ . After some update of auxiliary variables (needing $\mathop{{O}{}}(1)\mathbf{O}$ operations), we compute the product

[TABLE]

out of the product with factors up to index $i=j-1$ . Expanding $q^{[j]}/(1-q^{[j]})$ as geometric series contributes at most $\mathbf{S}$ and performing a power series multiplication contributes $\mathbf{M}$ and additionally one swap $\mathop{{O}{}}(1)\mathbf{S}$ . For obtaining the number of bit operations, we need estimates of the coefficients appearing in the multiplication. Lemma 4.2 bounds their value by

[TABLE]

bits. Therefore each of our power series multiplications $\mathbf{M}$ needs

[TABLE]

bit operations by the results of Fürer [13, 14] and Harvey and van der Hoeven [17]; see also Section 2.

After each multiplication, we accumulate the results for numerator and denominator by using one power series addition $\mathbf{A}$ for each of the two. For the numerator, we additionally need $\mathop{{O}{}}(1)\mathbf{S}$ operations for the multiplication by $q^{[j]}$ performed by shifting. Concerning bit operations, we use the bound of the coefficients for numerator and denominator provided by Lemma 4.2. In terms of bit size, this leads to the number of bits given in (5.1). Therefore a power series addition needs

[TABLE]

bit operations.

In total, we end up with

[TABLE]

operations to evaluate the outer loop; these operations translate to

[TABLE]

operations in the ring of integers and to

[TABLE]

bit operations.

We are now ready to collect all costs for proving the first part of Theorem 1. Additionally to the above, we divide the numerator by the denominator and need one power series division $\mathbf{D}$ . The clean-up accounts to $\mathop{{O}{}}(1)\mathbf{S}$ . This yields (3.2).

Using the Newton–Raphson-method and Fürer’s algorithm (see Section 2 and Table 2.1) a power series division $\mathbf{D}$ results in

[TABLE]

operations in the ring. Its operands555The actual bit size during the division is $\frac{N(\log N)^{2}}{2(\log 2)(\log t)}+\mathop{{O}{}}(N\log N)$ ; see the end of Section 2 for details. have bit size

[TABLE]

which results in

[TABLE]

bit operations for our computations.

We note that the number of bit operations of a power series operation $\mathbf{S}$ is linear in $N$ as the coefficients are bounded and that $\mathbf{O}$ is an operation on numbers with $\mathop{{O}{}}(\log N)$ bits. The error term includes all these. Collecting all bit operation results gives the upper bound (3.3). ∎

6. Remarks

In this last section, we provide some remarks related to the above proof and coefficient extraction algorithm.

*Remark 6.1**.*

In the proof above, we have seen that the cost (bit operations) of the power series division is asymptotically roughly (not taking logarithms and smaller factors into account) a factor $N$ larger than the cost for computing numerator and denominator, and all the overhead cost.

Moreover, only focusing on the computation of numerator and denominator, the costs (again bit operations) for computing these two are asymptotically dominated by power series multiplication, albeit only by roughly (again not taking into account logarithmically smaller factors) a factor $\log N$ compared to addition and other power series operations.

Note that when only considering operations in the integer ring, then the multiplications performed in the evaluation of numerator and denominator take the asymptotically leading role by $N(\log N)^{2}\,2^{\mathop{{O}{}}(\log^{*}N)}$ operations compared to $N(\log N)\,2^{\mathop{{O}{}}(\log^{*}N)}$ ring operations of the power series division.

At the end of this article we make a short remark on the memory requirements for the presented coefficient extraction algorithm.

*Remark 6.2**.*

Our algorithm needs $\mathop{{O}{}}(N)$ units of memory—a unit stands for the memory requirements of storing a number bounded by $\rho^{N}$ —plus the memory needed for the power series multiplication and division.666We have been unable to find a reference for the memory requirements of, for example, the Schönhage–Strassen-algorithm. It seems that the GNU Multiple Precision Arithmetic Library (GMP) can do this with $12N$ units of memory; see [15] for a comment of one of its authors. The above means that we can bound the memory requirements by $\mathop{{O}{}}(N^{2})$ bits.

Appendix A Code

Below are the relevant lines of a programme written in C for computing the coefficients $\mathop{{#1}{}}(n)$ with $n<N$ . The code can be found at https://gitlab.com/dakrenn/count-nonequivalent-compact-huffman-codes. The programme uses FLINT [12, 16]. Note that we do not use aliasing of input and output arguments in multiplication because providing our own auxiliary polynomial brings tiny performance improvements.

Appendix B Timing

The table below contains timings (in seconds) for computing the first $N$ coefficients with $t=2$ .

[TABLE]

Here, $t_{\mathrm{n\&d}}$ is the time for generating numerator and denominator, $t_{\mathrm{division}}$ for the one power series division and $t_{\mathrm{total}}=t_{\mathrm{n\&d}}+t_{\mathrm{division}}$ .

The benchmark was executed on an Intel(R) Xeon(R) CPU E5-2630 v3 at 2.40GHz. The limiting factor for our computations is the memory requirement; it is the reason computing at most $N=2^{17}=131072$ coefficients.

The timings in the table and the theoretical result of this article fit together; we can see the $\mathop{{O}{}}(N^{2+\varepsilon})$ running time of the algorithm in our implementation.

Bibliography33

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] Allan Borodin and Ian Munro, The computational complexity of algebraic and numeric problems , American Elsevier Publishing Co., Inc., New York-London-Amsterdam, 1975, Elsevier Computer Science Library; Theory of Computation Series, No. 1.
2[2] Jonathan M. Borwein, Peter B. Borwein, and David H. Bailey, Ramanujan, modular equations, and approximations to pi, or how to compute one billion digits of pi , Amer. Math. Monthly 96 (1989), 201–219.
3[3] E. Oran Brigham, The Fast Fourier transform , Prentice-Hall, Englewood Cliffs, NJ, 1974.
4[4] Stephen A. Cook, On the minimum computation time of functions , Ph.D. thesis, Harvard University, 1966.
5[5] James William Cooley and Tukey John Wilder, An algorithm for the machine calculation of complex Fourier series , Math. Comput. 19 (1965), 297–301.
6[6] Thomas H. Cormen, Charles E. Leiserson, Ronald L. Rivest, and Clifford Stein, Introduction to algorithms , second ed., The MIT Press, 2001.
7[7] Anindya De, Piyush P. Kurur, Chandan Saha, and Ramprasad Saptharishi, Fast integer multiplication using modular arithmetic , STOC’08: Proceedings of the fortieth annual ACM symposium on Theory of computing, ACM, New York, 2008, pp. 499–505. · doi ↗
8[8] Anindya De, Piyush P. Kurur, Chandan Saha, and Ramprasad Saptharishi, Fast integer multiplication using modular arithmetic , SIAM J. Comput. 42 (2013), no. 2, 685–699. · doi ↗

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Code & Models

Videos

Algorithmic counting of nonequivalent compact Huffman codes

Abstract.

Key words and phrases:

2020 Mathematics Subject Classification:

1. Introduction

Motivation

Codes, unit fractions and more

Asymptotics

Algorithmic counting

Main result

Notes

2. Cost of the underlying operations

Addition and multiplication

Power series operations

3. Cost for extracting coefficients

Theorem 1**.**

4. Auxiliary results

Lemma 4.1**.**

Proof.

Lemma 4.2**.**

Proof.

5. Proof of Theorem 1

Proof of Theorem 1.

6. Remarks

Remark 6.1*.*

Remark 6.2*.*

Appendix A Code

Appendix B Timing

Theorem 1.

Lemma 4.1.

Lemma 4.2.

*Remark 6.1**.*

*Remark 6.2**.*