On multisets, interpolated multiple zeta values and limit laws
Markus Kuba

TL;DR
This paper explores a parameter on weighted multisets, connecting it to multiple zeta values and deriving new identities, while also analyzing the distribution and limit laws of associated random variables.
Contribution
It introduces a new parameter on multisets, extends identities for interpolated multiple zeta values, and studies their probabilistic limit laws using symbolic combinatorics.
Findings
Derived new identities for interpolated multiple zeta values.
Established distributional properties of random variables on multisets.
Proved limit laws as parameters grow large.
Abstract
In this work we discuss a parameter on weighted -element multisets of . The sums of weighted -multisets are related to -subsets, -multisets, as well as special instances of truncated interpolated multiple zeta values. We study properties of this parameter using symbolic combinatorics. We rederive and extend certain identities for . Moreover, we introduce random variables on the -element multisets and derive their distributions, as well as limit laws for or tending to infinity.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Mathematical Identities · Analytic Number Theory Research · Advanced Combinatorial Mathematics
On multisets, interpolated multiple zeta values and limit laws
Markus Kuba
Abstract.
In this work we discuss a parameter on weighted -element multisets of . The sums of weighted -multisets are related to -subsets, -multisets, as well as special instances of truncated interpolated multiple zeta values. We study properties of this parameter using symbolic combinatorics. We (re)derive and extend certain identities for . Moreover, we introduce random variables on the -element multisets and derive their distributions, as well as limit laws for or tending to infinity.
Key words and phrases:
-multisets, -subsets, truncated multiple zeta values, interpolated multiple zeta values, harmonic numbers, limit laws.
2010 Mathematics Subject Classification:
60C05, 11M32.
1. Introduction
The multiple zeta values are defined as
[TABLE]
with admissible indices satisfying , for , see Hoffman [12] or Zagier [39]. Their truncated counterpart, sometimes called multiple harmonic sum, is given by
[TABLE]
We refer to as the weight of this multiple zeta value, and as its depth. For a comprehensive overview as well as a great many pointers to the literature we refer to the survey of Zudilin [40]. An important variant of the (truncated) multiple zeta values are the so-called multiple zeta star values, where equality is allowed:
[TABLE]
their truncated counterpart is defined analogous. On the combinatorial side, it is well known that two special cases, namely the series and occur in a multitude of different places. See for example [28] and the references therein for different representation of the two series. The value is closely related to the Stirling numbers of the first kind, and is related to alternating binomial sums. Here and throughout this work means repeated times. For non-truncated series and , Yamamoto [38] introduced a generalization of both versions called interpolated multiple zeta values. Noting that,
[TABLE]
let the parameter denote the number of plus in the expression . Yamamoto defines
[TABLE]
Thus, the series interpolates between multiple zeta values, case , and multiple zeta star values, case . It turned out that the interpolated series satisfies many identities generalizing or unifying earlier result for multiple zeta and zeta star values, see for example Yamamoto [38], Hoffman and Ihara [16] or Hoffman [13, 15]. In particular, a so-called quasi-shuffle product [13, 15, 16, 38], sometimes also called stuffle product or harmonic shuffle product, can be defined for the interpolated zeta values. Hoffman and Ihara used an algebra framework, which leads amongst others to expressions for interpolated multiple zeta values in terms of Bell polynomials and ordinary single argument zeta values, . We discuss and (re)derive in this work the results for and using symbolic combinatorics. Moreover, we relate and extend certain multiple harmonic sums identities [33, 34] to .
We study in particular the interpolated truncated multiple zeta values: we are interested in the truncated series
[TABLE]
and their properties. Note that interpolates between and , and , respectively.
On the other hand, classical combinatorial problems are the enumeration of -element multisets, short -multisets, of , leading to the multiset coefficients, and the enumeration of -element sets, short -subsets, of leading to the binomial coefficients, .
The goal of this work is to unify these two topics, namely interpolated (truncated) multiple zeta values and the enumeration of -subsets and -multisets. We study -multisets of and introduce a weighted enumeration of -multisets of in terms of a given weight sequence . Our weighted enumeration also takes into account an additional parameter . It is defined similarly to Yamamoto [38]: counts the number of equalities in the nested sum representation, corresponding to the number of plus signs in (1) or (2). The weighted enumeration also has an algebraic interpretation, connecting elementary symmetric functions and complete symmetric functions ; moreover power sum also appear in an alternative representation.
In the next section we introduce our weighted enumeration of -multisets, given an arbitrary sequence of positive reals, . Closely related ideas have been considered by Vignat and Takhare [37], who considered (non-truncated) generalized multiple zeta values and infinity products, with given by the zeros of certain special functions. Moreover, Bachmann very briefly discussed related ideas at the end of his article [1]. Also related are the partition zeta functions introduced by Schneider [35].
We study two aspects of the weighted enumerations: first, its relation to -sets and -multisets and to the interpolated (truncated) multiple zeta function, leading to our main result in Theorems 2, and second, the probabilistic aspects of . While the algebraic and number theoretic aspects of interpolated multiple zeta values have been studied in the literature, no studies so far have been conducted on its stochastic properties concerning structures of large size and its distribution, to the best of the author’s knowledge. Questions such as ”what is the expected value of the parameter ” or ”what is the limit law of ” have not been considered before.
Introducing suitably defined random variables , we study the distribution of , as well as limit laws for or tending to infinity. In particular, we show that the parameter in -multisets follows a hypergeometric distribution and derive Poisson and Gaussian limit laws. For we show that also different limit laws occur. In a later section we introduce refinements of the parameter an also discuss related random variables. It turns out that the random variables , its refinements and its limit laws are closely related to many quantities studied earlier in combinatorial probability theory: maxima in hypercubes [2, 22], unsuccessfull search in binary search tree [6], descendants in increasing trees [24], edge-weighted increasing trees [25, 29], distances in increasing trees [5, 6], leaf-isolation procedures in random trees [26], as well as asymptotics of the Poisson distribution [6, 21].
Throughout this work use the notation for the th harmonic number, for the th generalized harmonic number of order . We denote with a standard normal distributed random variable, with a Poisson distributed random variable with parameter , and with a hypergeometric distributed random variable with parameters . For the reader’s convenience a few basic facts about the probability distributions, appearing later in our analysis, are collected in the appendix of this work. We denote with the th falling factorial, , , with . Furthermore, we use the abbreviation for the evaluation operator at . Moreover, we denote by the equality in distribution of the random variables and , and by convergence in distribution of the sequence of random variables to a random variable . We denote with the set of ordered partitions of the integer , often also called compositions of the integer , and the length of a partition , defined as the number of its summands. E.g., for we have and
[TABLE]
Concerning length of partitions, we have and .
2. Interpolated weighted Multisets and parameter sigma
2.1. Parameter sigma
We consider -multisets of . Strictly speaking, we do not directly consider multisets. Instead, we induce an increasing order on the distinct elements in the multisets and consider the ordered sequences. For example, the unordered multiset corresponds to the increasing sequence . Let denote the set
[TABLE]
such that each corresponds to a -multiset of . In a slight abuse of notation, we denote with the length of , in other words the cardinality of the corresponding multiset. The cardinality of is given by the multiset coefficients, counting the number of -multisets of :
[TABLE]
Closely related is the set
[TABLE]
whose cardinality is given by binomial coefficients, counting the number of -sets of :
[TABLE]
In the following we introduce the parameter for elements .
Definition 1**.**
Let (3). Then is defined as the number of elements of equal to the preceding one in , or in other words
[TABLE]
Example 1**.**
Given then . Given then .
Example 2**.**
The parameter relates and , as defined in (3), (4), respectively:
[TABLE]
Next we define multiplicative weights for and the weighted enumerations of (3).
Definition 2** (Interpolated weighted multiset sums).**
Given an the weight sequence of positive real numbers . Then, the weight of is defined in terms of as and is defined as
[TABLE]
Note that we use the convention in the boundary case .
Remark 1**.**
The construction is also well defined for
[TABLE]
as long as the weight sequence is chosen in such a way that converges. This construction is very closely related to the generalized multiple zeta values of Vignat and Wakhare [37].
Remark 2** (Interpolated complete symmetric polynomials).**
Regarding the weight sequence as variables, the values can also be interpreted in an algebraic way. This is no surprise, due to the well known relationship between the multiple zeta values and quasi-symmetric functions [13, 15, 16, 37, 39]. They relate complete elementary symmetric polynomials and elementary symmetric polynomials : at or is given by
[TABLE]
and
[TABLE]
We will see later that the power sums on variables,
[TABLE]
also appear in an alternative representation of .
Example 3** (Interpolated -sets and multisets).**
Given the weight sequence we obtain
[TABLE]
interpolating between -sets and multisets of . The special values and enumerate -sets (4) and multisets (3) of :
[TABLE]
Example 4**.**
Given the weight sequence we obtain
[TABLE]
The special values are given in terms of the Stirling numbers of the first and second kind [10]:
[TABLE]
Example 5** (Interpolated truncated multiple zeta values).**
Given the weight sequence , we obtain a special instance of the truncated interpolated multiple zeta values,
[TABLE]
such that and .
2.2. Ordered partitions and Interpolated multiple zeta values
Let (3). We can associate an ordered partition to as follows: there exist integers and with multiplicities with such that . Here, we use that shorthand notation for appearing exactly times. We refer to the ordered partition as the shape of . The shape, as a map, is a surjection from to . Note that
[TABLE]
We may write
[TABLE]
Thus, for we get
[TABLE]
3. Generating functions and Bell polynomials
In this section we use symbolic combinatorics to obtain the generating function . This leads then to several expressions for , collected in our main result, Theorem 2, generalizing several results in the literature [3, 4, 16, 33, 28]. To the best of the author’s knowledge, our general results have not appeared before in literature, although related ideas have been developed earlier or in parallel, as mentioned in the introduction. Let denote all multisets of , , with as stated in (3). In order to gain more insight into we use symbolic combinatorics and study the generating function
[TABLE]
Theorem 1** (Generating functions and power sums).**
The generating function is given by
[TABLE]
as well as
[TABLE]
with as in (7). Moreover, assume that and that the are all distinct. Then,
[TABLE]
Remark 3**.**
Very similar results also hold for and infinite multisets (6). In special cases it is possible to derive the complete generating function . For we get
[TABLE]
For we get an ordinary hypergeometric function
[TABLE]
Similar, but more involved, partial fraction decompositions exist when the are not all distinct.
Before we state the proof of Theorem 1 we collect a result on partial fraction decomposition.
Lemma 1** (Partial fraction decomposition).**
Let and denote monic polynomials of degree with distinct zeros given by and , , respectively, such that . The rational function has the partial fraction decomposition
[TABLE]
Proof of Theorem 1.
First, we derive (8) using symbolic constructions from analytic combinatorics, see Flajolet and Sedgewick [8]. Let be a combinatorial class of size one, . Due to the sequence construction we can describe the class of multisets of as follows
[TABLE]
Thus, the generating function
[TABLE]
is given by
[TABLE]
All multisets of can be combinatorially described by
[TABLE]
Hence, the generating function is given by
[TABLE]
Next, we use the representation to obtain the expression (9) for .
[TABLE]
We use (7) and expand the two logarithm functions:
[TABLE]
Finally, we turn to (10). We assume that and that the are all distinct. Then, we can write as follows:
[TABLE]
Consequently, applying partial fraction decomposition (see Lemma 1) with and gives
[TABLE]
Multiplying the sum with directly gives the stated expression (10). ∎
As a consequence of the theorem before, we obtain alternative expressions for the values (5). In the following we use the complete Bell polynomials , which are defined via the identity
[TABLE]
Theorem 2**.**
The values (5) can be expressed in various ways:
- •
in terms of values and :
[TABLE]
- •
Bell polynomials and the power sums (7):
[TABLE]
- •
Determinantal expression: let , it holds
[TABLE]
- •
Combinatorial sum: assume that and that the values are all distinct. Then,
[TABLE]
Example 6**.**
For we solely sum over the odd indices , since only lead to a positive contribution. We obtain
[TABLE]
A direct byproduct of this result is an expression for . We state the formula for , generalizing the already known results for , and , ; see for example [28] and the references therein.
Example 7**.**
Given the weight sequence we obtain for the results
[TABLE]
as well as
[TABLE]
Moreover, we obtain for the identity
[TABLE]
For we obtain as a special case a truncated analog of the formula of Hoffman and Ihara [16, eqn. ]:
[TABLE]
The combinatorial sum extends a result of Prodinger for [33] (see also [32, 34, 36]), originally stated solely in terms of the combinatorial sum and the expression in terms of Bell polynomials (11):
[TABLE]
Finally, we mention that for the combinatorial sum turns into a well-known and often rediscovered formula [3, 4, 9, 28]:
[TABLE]
Proof of Theorem 2.
First, we turn to the expression (12) for . From Theorem 1 and (8) we obtain
[TABLE]
Thus, extraction of coefficients gives the stated result:
[TABLE]
For (13) we use (9) of Theorem 1. The definition of the Bell polynomials (11) and extraction of coefficients, , then directly gives the stated results, with the Bell polynomials (11) evaluated at x_{\ell}=(\ell-1)!A_{n}(\ell)\big{(}t^{\ell}-(t-1)^{\ell}\big{)}. Following Hoffman [14], the determinant (14) can be obtained using the theory of symmetric functions. Let denote variables of degree one. As in Remark 2 let denote the complete symmetric functions and the power sums. There exists polynomials such that
[TABLE]
Let and denote the generating functions of and :
[TABLE]
Then,
[TABLE]
This gives the Bell polynomial type expression for the :
[TABLE]
On the other hand, MacDonald [30] gives the determinant
[TABLE]
which proves the stated result. Finally, the combinatorial expression (15) is readily obtained by extraction of coefficients from the partial fraction decomposition (10) of in Theorem 1. ∎
4. Further generalizations
4.1. Refinements of the parameter
Definition 3** (Refined parameter sigma).**
Let (3). Then is defined as the number equal signs in stemming from numbers . Then, as functions acting from to :
[TABLE]
Example 8**.**
Given then , and , .
In the following we use the vector notation .
Definition 4** (Multi-interpolated weighted multiset sums).**
Given the weight sequence . Let be defined as
[TABLE]
The generating function can readily be obtained using the symbolic methods.
Theorem 3** (Generating functions and power sums).**
The generating function is given by
[TABLE]
Moreover, assume that for we have and distinct. Then,
[TABLE]
The proof of the theorem is very similar to the proof of Theorem 2 and therefore omitted.
Example 9** (Even-odd interpolated truncated multiple zeta functions).**
For even indices let , whereas for odd indices let . Then,
[TABLE]
The generating function is
[TABLE]
We consider for example , and assume . Then,
[TABLE]
and
[TABLE]
Thus,
[TABLE]
4.2. Sums of multisets: partitions
Yet another direction would be to introduce a new parameter on (3), with the one-norm of . It measures the numbers the partitions induce: . Then,
[TABLE]
Note that this can alternatively be achieved by a change of weights . Then, the generating function satisfies
[TABLE]
It is then possible to study the distribution of , or the joint distribution of and on .
Example 10** (Partitions - distributions).**
Setting we the generating function simplifies to \prod_{m=1}^{n}\Big{(}\frac{1}{1-a_{m}q^{m}z}\Big{)}.
Example 11** (Partitions and infinite multisets).**
Moreover, setting and letting in leads us directly to the generating function of the partition function:
[TABLE]
5. Distributions and limit laws
Before, we treated the variable as a parameter, mainly between zero and one, interpolating between weighted -subsets and -multisets or truncated multiple zeta values and their star counterparts. Here, we are interested in the distribution of and introduce a random variable . In the following we denote with the extraction of coefficients operator, , .
Definition 5** (Random variable ).**
Given the weight sequence with positive real weights . The random variable counts the number of equal signs in an element (3). Its probability mass function is defined in terms of (5):
[TABLE]
equivalently, the probability generating function is given by
[TABLE]
Note the the actual support of is the range , since for at most elements out of a total of can be unequal.
The expected value
[TABLE]
can be obtained by extraction of coefficients:
[TABLE]
Here denotes the evaluation operator at . The variance is determined via the second factorial moment ,
[TABLE]
with
[TABLE]
5.1. Interpolated -element multisets
We choose the weight sequence and obtain the distribution of
Theorem 4** (Distribution - -multisets).**
The probability mass function of the random variable in random multisets of , is given by
[TABLE]
Thus, follows a hypergeometric distribution. Its expected value and variance are given by
[TABLE]
Moreover, the factorial moments are given by
[TABLE]
Proof.
The probability mass function is readily obtained by extraction of coefficients. First, we note that
[TABLE]
Then,
[TABLE]
Alternatively, observe that equal signs are distributed amongst places, leading to , and that the remaining sums give a factor
[TABLE]
The expected value, the variance and the factorial moments of the hypergeometric distribution are well known, or directly obtained using the Vandermonde identity. ∎
Theorem 5** (Limit laws - -multisets).**
The limit laws for are given by three different distributions:
- (1)
Degenerate case: for and : ; similarly, for and : . 2. (2)
Poisson range: for and : ; similarly, for and with : . 3. (3)
Normal range: for and :
[TABLE]
Proof.
First we note that the expected value of the random variable , counting the number of equal signs in random multisets of , has the following behavior:
[TABLE]
The Poisson limit law for follows directly from the asymptotic expansion of the factorial moments:
[TABLE]
Thus, by the method of moments the (factorial) moments converge to the (factorial) moments of a Poisson distributed random variable with parameter . In the range and the Poisson limit law is indicated by the asymptotic expansions of and . We obtain elementarily
[TABLE]
By our assumption so . We use the asymptotic expansion of the factorials,
[TABLE]
as well as a precise expansion of the terms
[TABLE]
using the representation. This gives the desired result
[TABLE]
The normal limit laws for the hypergeometric distribution are classical and well known, see Nicholson [31] or Feller [7] and the discussion in the introduction; the proofs are omitted. ∎
5.2. Interpolated truncated multiple zeta values
We consider the weight sequence and the distribution of , defined by the probability generating function
[TABLE]
For example, the boundary values are given by
[TABLE]
We derive the expected value
[TABLE]
using (16).
Theorem 6**.**
The expected value is given by the following exact expression:
[TABLE]
Note that the variance can be obtained using the second factorial moment (17), but it is more involved. The limit laws for are technically much more involved compared to the case . The cases of either fixed and or and fixed are not too difficult to analyze. In contrast, for both the asymptotic expansions of the denominator require a more precise analytic combinatorial analysis; see the works of Hwang [18, 20, 22] and Bai et al. [2]. The case of and indicates that a normal limit should appear. Note that the limit law, given by the sum of independent Bernoulli random variables, also appeared in the analysis of algorithms literature; amongst others, in the the unsuccessful search in binary search trees [6], the degree of the root in increasing trees [6], distances in increasing trees [5, 6] or edge-weights in increasing trees [25, 29].
Proposition 1**.**
For and fixed,
[TABLE]
where denote independent Bernoulli-distributed random variables; hence the probability mass function of is given by
[TABLE]
For and fixed the random variable degenerates: .
Remark 4**.**
The growth range of the degenerate case can be extended and made more precise using the results of Hwang [19] for , stated in terms of the unsigned Stirling numbers of the first kind, and his results [22] (see also Bai et al. [2]) for , related to the expected number of maxima in hypercubes.
Remark 5**.**
It is well known that the sum of the limit law of , normalized and centralized, for is asymptotically normal:
[TABLE]
See for example Dobrow and Smythe [6] for an approach using Poisson approximation or Hwang [21]. Thus, we expect that the random variable is asymptotically normal at least in a certain range of both tending to infinity. We observe that , or , where is concentrated around . Thus should govern the behavior of and also its total mass . Note further, that even the asymptotic expansions of for derived earlier by Hwang [18, 20, 22] are more involved.
Proof. (Expected value).
Our starting point is the generating function
[TABLE]
Differentiating with respect to gives
[TABLE]
Evaluation at gives
[TABLE]
Extraction of coefficients leads to the stated result:
[TABLE]
noting that . ∎
Proof. (Limit laws).
For and fixed we study
[TABLE]
We use the representation given in Corollary 7 for and . We require the asymptotic expansion of the harmonic numbers for :
[TABLE]
Asymptotically, the summand dominates and gives for both and the same value . Consequently, for large .
For and fixed we use singularity analysis [9] and derive the asymptotic equivalent of , . We have
[TABLE]
Since for we get
[TABLE]
The product has a dominant singularity is at and can be written as
[TABLE]
with analytic inside a circle of radius . Consequently, for
[TABLE]
It remains to derive the asymptotic expansion of . We can use again singularity analysis as before; alternatively, the well known [9, 2, 28] binomial sum representation directly gives us the desired result:
[TABLE]
Finally, combining our results gives
[TABLE]
The product is exactly the probability generating function of the independent Bernoulli random variables with success probability . Extraction of coefficients directly leads to the stated result, using the representation of given in [28]. ∎
5.3. Interpolated truncated multiple zeta values - only twos
We consider the weight sequence and the distribution of , defined by the probability generating function
[TABLE]
For example, the boundary values are given by
[TABLE]
We obtain the following limit laws.
Theorem 7**.**
For the limit laws for are given by three different distributions:
- (1)
For and fixed: , with
[TABLE] 2. (2)
For and fixed: . The random variable is given by the sum of independent Bernoulli random variables:
[TABLE]
where denote independent Bernoulli-distributed random variables. Moreover, , . 3. (3)
For : ;
[TABLE]
Here , , denote independent Bernoulli-distributed random variables. Moreover, , .
Remark 6**.**
The random variable is exactly the limit law of the number of cuts in a recursive tree to isolate a leaf [26]; modified weight sequences lead to other families of increasing trees. Moreover, is closely related the the distribution of the number of cuts in a tree of size .
Proof.
Similar to (2) we have,
[TABLE]
By taking the limit , being fixed, and extraction of coefficients we directly obtain the stated result.
For and arbitrary we use singularity analysis [9] and derive the asymptotic equivalent of , . We have
[TABLE]
Setting as before , we get
[TABLE]
The product has a dominant singularity is at and can be written as
[TABLE]
with analytic inside a circle of radius . Consequently, for
[TABLE]
For the asymptotic expansion of we can use again singularity analysis and obtain . Finally, combining our results gives
[TABLE]
The product is exactly the probability generating function of independent Bernoulli random variables each with success probability for . ∎
5.4. Refined decomposition of sigma, random vectors and marginals
Similarly to the refinements of the parameter into , , the random variable can be decomposed into : . Given the weight sequence , the joint distribution of the random vector is determined by
[TABLE]
Another possibility is to distinguish only between odd and even indices, such that , and to study the bivariate probability generating function
[TABLE]
For truncated zeta values with the weight sequence , the random vector has the probability generating function
[TABLE]
Example 12** (Truncated zeta values - random vector).**
For truncated zeta values, weight sequence , the random vector has the probability generating function
[TABLE]
Example 13** (Truncated zeta values - marginals).**
In contrast to the multiset case, the marginals are not exchangeable anymore. The probability generating functions are given by
[TABLE]
It is expected that the marginals are asymptotically independent, at least in some growth range of and .
Theorem 8** (Marginals - interpolated multisets).**
The marginals are identically distributed, but not independent; the sequence is exchangeable. The probability mass function is given by
[TABLE]
The factorial moments, , are given by
[TABLE]
Theorem 9**.**
We obtain the following limit laws, depending on the growth of and :
- •
for and fixed the normalized random variable converges to a Beta-distributed random variable .
- •
for and the normalized random variable converges to a standard exponentially distributed random variable, .
- •
for and the random variable converges to a (modified) geometric distribution :
[TABLE]
- •
for and the random variable degenerates, .
Remark 7**.**
The random variable is closely related to the number of descendants of node labelled in a recursive trees of size [24]. Thus, can be described in terms of a Polya-urn model and the asymptotics for and fixed can be refined to almost sure convergence; moreover a classical result is available in terms of a martingale tail sum [11].
Proof of Theorem 8.
Let . The probability generating function of the marginal distribution is given by
[TABLE]
By the structure of the probability generating function, the random variables are identically distributed and the random vector is exchangable. Thus, we can directly obtain the expected value from .
[TABLE]
Additionally, the probability mass function is obtained readily by extraction of coefficients. All factorial moments, , can be obtained in a straightforward way:
[TABLE]
∎
Proof of Theorem 9.
We use the method of moments and derive asymptotic expansions of the factorial moments. For fixed and we get
[TABLE]
Consequently, the power moments of are asymptotically given by , which proves the Beta limiting distributions.
For we observe that
[TABLE]
Thus, the factorial moments are almost of mixed Poisson type [27] with standard exponential mixing distribution; the additional factor can be explained by the definition of the parameter , which influences the discrete limit case. This directly leads to the stated limit laws using Lemma 2 of [27]. Alternatively, the discrete limit for can be directly obtained as follows:
[TABLE]
The remaining case is treated in a similar way. For we observe that
[TABLE]
∎
5.5. Sum theorem for interpolated multiple zeta values
Yamamoto [38] established, amongst many other things, the sum theorem for interpolated multiple zeta values:
[TABLE]
with . Note that the case gives the sum theorem for multiple zeta star values
[TABLE]
whereas gives the ordinary sum theorem.
Using (18), we can study the distribution of the parameter on sums of interpolated multiple zeta values with the same depth and weight . Let denote the random variable with probability generating function
[TABLE]
Let denote the Bernstein polynomials
[TABLE]
We obtain the following result.
Theorem 10**.**
The probability generating function is given by a Bernstein form of degree with Bezier coefficients
[TABLE]
Let . The probability mass function , is given by
[TABLE]
setting with and . Consequently, , with , has the same distribution as the random variable , counting the number of descendants of node in a random recursive tree of size : .
As a byproduct of our identification of , we get the following limit laws from [24].
Corollary 1**.**
The limiting distribution behaviour of the random variable , with and , is, for and depending on the growth of , characterized as follows.
- •
for fixed,
- •
for small : such that : the normalized random variable is asymptotically Exponential distributed,
- •
for : such that , with . The shifted random variable is asymptotically negative binomial-distributed, ,
- •
for large : such that : , i.e. .
Proof.
We note that by definition, the summands are weighted Bernstein polynomials:
[TABLE]
Simplification gives the stated expression for the Bezier coefficients. The probability mass function is obtained by extraction of coefficients:
[TABLE]
Using an identity for binomial coefficients [10] then readily gives
[TABLE]
and thus we get the corresponding result for . We observe that for the probability mass functions and thus the distribution of and the random variable (case ) in [24] coincide and thus we can directly transfer the limit laws for . ∎
6. Summary and Outlook
We introduced a parameter on weighted -element multisets and studied properties of it using symbolic combinatorics. This allows to prove several relations for truncated interpolated multiple zeta values ( as well as reproving identities for truncated multiple zeta values ). Introducing refined enumeration leads to new refinements of previous identities. Interpreting the parameter as a random variable leads to several different limit laws, depending on the considered weight sequences and the growth of and . Interestingly, the limits laws are closely related to a great many results in combinatorial probability and analytic combinatorics.
It is of interest to complete the analysis of in the case ; we will report on this elsewhere. Similar to the random variable , defined in terms of the parameter , one may also study the distribution of the parameter or the joint distribution of and . Moreover, it is certainly of interest to study the distribution of for other interesting sequences, compare with Vignat and Wakhare [37] or Hoffman and Mező [17].
6.1. Acknowledgement
The author warmly thanks Paul Schreivogl for pointing out the connection to the Bernstein polynomials. Moreover, the author thanks the referees for their great help and very valuable suggestions, improving significantly the structure and presentation of this work, clarifying a few points, as well as providing additional references.
7. Appendix: Auxiliary results about probability distributions
A beta-distributed random variable with parameters has a probability density function given by , where denotes the Beta-function. The (power) moments of are given by
[TABLE]
The beta-distribution is uniquely determined by the sequence of its moments. In this work we will discuss a beta-distributed random variable , with moments given by , for .
An exponentially distributed random variable with parameter one has density , and power moments .
A Bernoulli distributed random variable with parameter has probability mass function determined by , . The sum of independent Bernoulli distributed random variables has probability generating function
[TABLE]
The factorial moments of are given by
[TABLE]
Under the assumption that the probabilities tend to zero fast enough, we may define as the sum of infinitely many Bernoulli random variable: , with factorial moments formally given by times a multiple series:
[TABLE]
A Poisson distributed random variable , , has probability mass function and factorial moments given by
[TABLE]
A hypergeometric distributed random variable with parameters has probability mass function and factorial moments given by
[TABLE]
In particular,
[TABLE]
Moreover, the following normal limit law can be deduced (see Nicholson [31] or Feller [7]).
Lemma 2**.**
Let denote a hypergeometric distributed random variable. Under the assumption and such that , the random variable , centered and normalized, is asymptotically standard normal distributed:
[TABLE]
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] H. Bachmann, Interpolated Schur multiple zeta values, J. Aust. Math. Soc. 104, 289–307, 2018.
- 2[2] Z.-D. Bai, L. Devroye, H. -K. Hwang, T. -H. Tsai, Maxima in hypercubes, Random Structures and Algorithms 27 , 290–309, 2005.
- 3[3] N. Batir, On some combinatorial identities and harmonic sums, Int. J. Number Theory 13 , 1695–1709, 2017.
- 4[4] K. Dilcher, Some q-series identities related to divisor functions, Discrete Math. 145 , 83–93, 1995.
- 5[5] R. Dobrow, On the distribution of distances in recursive trees, Journal of Applied Probability 33, 749-757, 1996.
- 6[6] R. Dobrow and R. Smythe, Poisson approximations for functionals of random trees, Random Structures & Algorithms 9, 79-92, 1996.
- 7[7] W. Feller, Intoduction to probability theory and its applications , John Wiley and Sons, Third Edition, 1968.
- 8[8] P. Flajolet and R. Sedgewick, Analytic Combinatorics . Cambridge Univ. Press, Cambridge, UK, 2009.
