A Nonlinear Transform for the Diagonalization of the Bernoulli-Laplace Diffusion Model and Orthogonal Polynomials
Chjan Lim, William Pickering

TL;DR
This paper introduces a novel method for diagonalizing the transition matrix of the Bernoulli-Laplace diffusion model, enabling explicit calculation of eigenvalues and eigenvectors through a mapping to a differential operator, simplifying analysis of such Markov processes.
Contribution
A new approach mapping the eigenproblem to a differential operator, making the diagonalization of Bernoulli-Laplace and similar models more straightforward and broadly applicable.
Findings
Explicit eigenvalues and eigenvectors derived for the Bernoulli-Laplace model.
Method applicable to other two-urn diffusion models.
Simplifies analysis compared to previous orthogonal polynomial methods.
Abstract
The Bernoulli-Laplace model describes a diffusion process of two types of particles between two urns. To analyze the finite-size dynamics of this process, and for other constructive results we diagonalize the corresponding transition matrix and calculate explicitly closed-form expressions for all eigenvalues and eigenvectors of the Markov transition matrix . This is done by a new method based on mapping the eigenproblem for to the associated problem for a linear partial differential operator acting on the vector space of homogeneous polynomials in three indeterminates. The method is applicable to other Two Urns models and is relatively easy to use compared to previous methods based on orthogonal polynomials or group representations.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsOpinion Dynamics and Social Influence · Complex Network Analysis Techniques · Quantum chaos and dynamical systems
A Nonlinear Transform for the Diagonalization of the Bernoulli-Laplace Diffusion Model and Orthogonal Polynomials
Chjan C. Lim
William Pickering Department of Mathematical Sciences, Rensselaer Polytechnic Institute, 110 8th Street, Troy, New York 12180, USA
Abstract
The Bernoulli-Laplace model describes a diffusion process of two types of particles between two urns. To analyze the finite-size dynamics of this process, and for other constructive results we diagonalize the corresponding transition matrix and calculate explicitly closed-form expressions for all eigenvalues and eigenvectors of the Markov transition matrix . This is done by a new method based on mapping the eigenproblem for to the associated problem for a linear partial differential operator acting on the vector space of homogeneous polynomials in three indeterminates. The method is applicable to other Two Urns models and is relatively easy to use compared to previous methods based on orthogonal polynomials or group representations.
1 Introduction
The Bernoulli-Laplace (BL) model arise from diffusion theory and is related to the shuffling of cards [8]. Symmetries of the permutation group appear naturally in this model and other random walks on groups. Previous solutions of this model have appeared in Diaconis and Shashahani [7] and in the works of Karlin and MacGregor [10]. Group representations are used explicitly in the first; the derivation of a non-standard inner product or equivalently a measure for orthogonal polynomials which are related to the eigensolutions of the BL model appears in the second.
In this paper, we give a third way for deriving exact solutions of all the eigenvectors of the BL model, through a nonlinear transform that triangularizes and then diagonalizes the transition matrix . In brief, our method associates a specific linear partial differential operator (LPDO) that acts on the vector space of homogeneous polynomials, to the matrix . The inherits the symmetries of the BL model; it encodes the tri-diagonal singly-stochastic (column sums are equal to 1 ) and anti-symmetric structure of . The components of the (right) eigenvectors (in view of the equal column sums of of is encoded in the coefficients of the homogeneous polynomial A classical theory for the symmetries of such LPDOs have been formulated in terms of the Lie algebra of symmetry operators that commutes with (cf. [13]).
It turns out and we exploit in our method, that the symmetries of appear iu the form of suitable linear and nonlinear transformations on the independent variables etc. or indeterminates of The ease of use of this method resides in the transparent or explicit way to find these transformations Our algorithm is completed by associating the transformed LPDO, back to what turns out to be a triangular matrix ; in other words, the transformation for encodes a similarity transformation that triangularizes i.e., , which is then solved directly for its eigenvalues and right eigenvectors.
Here, we give a summary of the Urn models to which the BL model is related as an extension. The Ehrenfest model and the Polya Urn models are two of the early solvable models in the literature [9]. They appear as two of the exactly solved cases in Friedman’s formulation of Urn models where precisely one urn and balls of two colors are drawn and replaced with additions [9]. A dual formulation of Friedman’s Urn models was introduced in a series of recent papers [16, 14]: instead of balls of two colors and one urn, the dual formulation uses two urns and one-colored balls. The latter is more convenient for modelling of certain network science models [3], such as the Voter model where two balls are drawn and returned to the two urns with prescribed probabilities that depend on the order in which they are drawn. This is because many of the network science models are irreversible Markov chain models [4] which have absorbing states. Their transition matrices , unlike for the BL model, are not symmetric, in an essential sense, that is, there are no non-standard inner products for in which these matrices have a symmetric form.
Using a new method based on diagonalization of transition matrices [15], we solved exactly the eigenvectors of several well-known models, including the Ehrenfest model, the Voter model [18, 12, 11, 5, 17, 2], the Moran model for genetic drift, and the Naming game models [19], [20]. Most of these models are irreversible Markov models with absorbing states, and have essentially non-symmetric transition matrices in the sense just mentioned. The BL model however, is based on two urns and balls of two colors. Thus, it is not strictly in the class of Two Urns models to which we recently applied our method. In modifying this method so that it applies to the BL model, we will have shown that the new method is not only easy to use but also flexible in extension to new problems.
One of the main points here is the technical simplicity of uncovering the symmetries of the above LPDOs within our method, through the explicit appearance of the expressions in the coefficients of We give here the LPDO for the Voter model [16, 14] to indicate what we mean: first the propagation equation for the transition matrix is given by
[TABLE]
the eigen-problem for the associated LPDO acting on the homogeneous polynomial (which encodes the components of the right-eigenvector of is given
[TABLE]
which clearly suggests the transformation Indeed this triangularized and diagonalized the Voter model and led to its complete solution.
Contrast this ease of use with the fact that triangularization and diagonalization of a given transition matrix of size has computational complexity In other words, exact integration of the Two Urns models via diagonalization of transition matrices are nontrivial problems, that are difficult to solve but once known, the solutions are easy to verify. [16, 14] provides a simple method to find such explicit diagonalization and hence all eigenvectors for a class of transition matrices from the Two Urns models, even when their transition matrices are essentially non-symmetric. Note that the eigenproblem and diagonalization of symmetric matrices have a lower computational complexity.
We aim here to highlight this method’s ease of use, relative to the group representation method and the method of orthogonal polynomials. Moreover, the BL model differs significantly from the original Two Urns subclass of models for which our method was initially formulated. Thus, we also aim to show that, with the specific introduction of a nonlinear change of independent variables, this method can be applied to more complex models than the original class of models. Since the BL transition matrix is from a reversible Markov chain with a stationary distribution [4], it is non-symmetric singly-stochastic only in a trivial sense. In other words, there exists (a difficult to find) non-standard inner product for in which is symmetric and hence doubly-stochastic. Karlin and McGregor [10] using their powerful integral representation method of finding an explicit way to symmetrize by introducing a non-standard inner product (or orthogonal measure) into the problem, have related the right eigenvectors of the transition matrix of the model to the orthogonal polynomials called the dual Hahn polynomials. A third aim of this paper is therefore to re-derive from the diagonalization of this non-standard inner product in which the dual Hahn polynomials are an orthogonal polynomial system. Note that this non-standard inner product, once found, yields a symmetric version of which is an example of a Jacobi operator that arise in the the classical moments problem [1], and is related to orthogonal polynomials via the Riemann-Hilbert method [6].
The paper is organized as follows: section 3 concerns the calculation of the right eigenvectors and eigenvalues of in closed form by method introduced in [16, 15, 14]; in view of the fact that these right eigenvectors are not orthogonal in the usual Euclidean inner-product, section 4 concerns the transformations needed to calculate the orthonormal system of left eigenvectors of and also the derivation of the non-standard inner product in which the right eigenvectors are now orthogonal; section 5 concerns the elementary proofs, based only on the eigenvectors and eigenvalues of for the tight upper and lower bounds for times to stationarity in the BL model [7] ; section 6 concerns a numerically exact evaluation of the expression for the TV norm in these bounds using the eigenvectors and eigenvalues of directly, hence slightly sharper estimates for the mixing times of the BL model.
Beyond the balanced special case of the BL problem treated in detail in this paper, the same generating function method can be used to prove similar tight bounds for mixing times in the other cases.
2 Transition matrix of the BL model
Let the transition matrix be defined so that , so that the sum of each column is . In the general BL model for balls of two colors and two urns, are fixed parameters satisfying the constraints
[TABLE]
where equals total number of balls in the model. For (where by abuse of notation stands for both the row label of transposed matrix and the number of white balls in urn 1, the transition probabilities are explicitly given by
[TABLE]
.
3 Diagonalization - Right eigenvectors of the
general BL model
In [16, 14], we developed an explicit method for exactly integrating or solving a 5-parameters subclass of a class of Two Urns models which is parametrized by six real parameters. Our method is based on a relationship between certain banded stochastic matrices (such as tridiagonal and pentadiagonal non-symmetric transition matrices of markov chain models) and the LPDOs acting on the vector space of homogeneous polynomials, of finite order in two indeterminates. The symmetries of the LPDO, associated with a given non-symmetric singly stochastic matrix from this solvable subclass of the Two Urns models, are identified and used explicitly to transform from the original indeterminates ( independent variables say) to suitable new variables (such as ). In the new variables the transformed LPDO, acts on the (again homogeneous of same order as polynomial We have shown in [16, 14] that the transformed eigen-problem
[TABLE]
for a well-defined subclass of such Two Urns problems is equivalent (via the inverse of the original relationship between banded matrix and LPDO) to the eigen-problem for a triangular matrix, which can then be solved explicitly for both right and left eigenvectors. In other words, at the end of this brief summary, the symmetries of LPDO inherited from the original banded stochastic matrix generate an explicit similarity transformation, such that
[TABLE]
where is diagonal, and contains the eigenvectors of
This method can be formalized as an Algorithm as follows: Given the input of a singly stochastic transition matrix of size ,
(I) Choose a suitable homogeneous polynomial of finite degree , that has the components of a right eigenvector of as coefficients of the monomials ; part of this choice is the number of indeterminates in . For example, the Voter model of size N (number of balls) with a transition matrix which is a by real matrix, requires a homogeneous polynomial of degree in the indeterminates because there are two urns.
(II) Associate the recursion inplicit in given Markov matrix to a LPDO, which acts on the homogeneous polynomial ; the basic elements of this association scheme are the standard linear differential operators for increasing, decreasing and not changing the numbers of balls in each urn (which correspond in the example below to the probabilities prescibed by the transition matrix), and a set of multiplication type linear operators that correspond to shifts.
(III) A transformation to new independent variables,(for instance, ) is chosen to satisfy two conditions:
the transformed polynomial
[TABLE]
is a homogeneous polynomial of the same finite degree as
is a transformation based on the symmetries of (cf. [13]), that is, the combinations of the original variables appear naturally in the coefficients of the LPDO,
These conditions and are clearly not sufficient to ensure the transformed LPDO eigenproblem
[TABLE]
is associated with a similar triangular matrix which explicitly yields all its eigenvectors . That they are sufficient has to be proved either in each problem to which we apply the Algorithm, or for a class of models as in the case of the Two Urns models.
(IV) Using the transformation in step (III), derive the corresponding transformed LPDO, that acts on the transformed polynomial .
(V) Without explicitly calculating the transformed matrix which is associated with the transformed LPDO, in step (IV), check that the transformed eigen-problem for is indeed a recursion system for the transformed eigenvectors that can be solved explicitly, i.e., it is equivalent to a triangular linear system of equations. Solve for the eigenvalues and then the eigenvector components , and if required transform back to the original components . These are the main outputs of the Algorithm.
(VI) Use the eigenvectors in step (V) to diagonalize the original matrix if necessary. This is the end of the ALgorithm.
Now we apply the Algorithm to the BL model. Given the transition matrix of the BL model (cf. section 2), it will be obvious that three independent variables (instead of the two before) should be used to formulate the BL problem. In step (I) of the Algorithm, we adopt the anzatz that the associated with the above by matrix , now acts on a homogeneous polynomial in three indeterminates, We encode the entries , of the eigenvector of the transition matrix for the BL model as follows:
[TABLE]
where number of white balls in urn 1 (also denoted . The choice of three independent variables to encode the components of an eigenvector of in the homogeneous polynomial is now made obvious by this explicit expression for
In step (II) of the Algorithm, we derive from the original eigen-problem for transition matrix an LPDO, , that acts on . Towards that aim, we note, in particular, the entries for and in correspond respectively to the following linear differential operators with coefficients that are monomials in and
[TABLE]
where for example. In addition, it is part of the association scheme that multiplication in the LPDO (cf. [13]) by the coefficient (resp. represents down (resp. up) shifts in the index within the discrete recursion equations of the original eigen-problem for matrix The associated with the eigen-problem of the tridiagonal Markov matrix is given by:
[TABLE]
In step(III) of the Algorithm, we note that the symmetries of with respect to transformations of its independent variables, is expressed in the factor in its coefficients. This suggests the transformation to the new independent variables
[TABLE]
Since the transformed homogeneous polynomial is now given by
[TABLE]
in terms of the (new) components of the right eigenvector, this transformation clearly satisfies both necessary conditions (A) and (B) in step (III) of the Algorithm.
To prove that it is sufficient for our purpose of obtaining the eigenvalues and eigenvectors exactly and for diagonalization, we proceed by direct calculations.
In step (IV) using the following obvious identities for the transformation of partial derivatives
[TABLE]
the transformed LPDO, in , is
[TABLE]
In step (V), by reversing the derivation of the original through the association scheme [13], this in is shown to be equivalent to the following triangular system for the (right) eigen-problem of the transformed matrix :
[TABLE]
We have therefore verified the sufficiency of the transformation where for triangularizing (and later diagonalizing) This triangular system implies the recursion
[TABLE]
which can be solved directly.
For nontrivial eigensolutions for the denominator in must vanish, yielding the following exact expressions for the eigenvalues,
[TABLE]
In the case
[TABLE]
The eigenvectors (in the transformed variables of are given explicitly by:
[TABLE]
Using these coefficients in the definition for gives
[TABLE]
We summarize the consequences of the above steps of the Algorithm on the BL model in the following theorem:
Theorem 3.1**.**
*In the above Algorithm for the BL model, for any size of the model, the LPDO, , after the transformation (3.9) on the independent variables, is equivalent to a triangular linear system (3.21) which has (right) eigenvectors given by (3.31) and eigenvalues (3.25). The (right) eigenvectors of the original BL matrix are in turn given by (3.38). *
3.1 Hypergeometric functions and dual Hahn polynomials
The next to final step left in this part of the paper is step (VI) in the Algorithm, to invert the above similarity transformation to obtain explicitly the closed-form expressions for the original components of the right-eigenvectors of For this purpose, let . Then,
[TABLE]
Using the hypergeometric identity,
[TABLE]
and the fact that any multiple of an eigenvector remains an eigenvector, we take the polynomial for the right eigenvector components to be
[TABLE]
whose coefficients are the original components of the right eigenvector corresponding to prior to the transformation above. These expressions are equivalent to the dual Hahn polynomials [10].
For the hypergeometric representation of the eigenvectors to be well defined, we require . There is no loss in generality with this assumption, because we can relabel and so that the assumption holds. From the solution for , we expand in
[TABLE]
to find the explicit form for the components of the right eigenvectors:.
[TABLE]
Notice that the solution is the order backwards difference of the components of the hypergeometric coefficients of .
The above treatment of the eigen-problem by transforming via symmetries, the independent variables of the associated LPDO, is equivalent to a similarity transformation of the transition matrix . Let for some transformation matrix . Then, the eigen-problem for is given by . The above calculations is equivalent to the matrix such that the new matrix is lower triangular. The last step in this section is to diagonalize We do this by diagonalizing the matrix triangular matrix . Here, and are the eigenvectors of . The components of these eigenvectors are corresponding to eigenvalue . Diagonalization of allows us to explicitly diagonalize the original transition matrix as
[TABLE]
Note that the matrix of eigenvectors is given by .
4 Symmetrizing transform, orthogonal measure and dual Hahn polynomials
For transition matrix let be given by (where we drop the subscript BL herein, i.e., )
[TABLE]
Recall the detailed balance of and its stationary distribution, given by [4] follows from the reversibility and ergodicity of the BL model. Note that is the symmetric version of the transition matrix:
[TABLE]
Therefore, has an orthonormal set of left eigenvectors, . Let be a matrix whose columns are . The spectral decomposition of by left eigenvectors is given by
[TABLE]
By the definition of , the transformation from to can be expressed as
[TABLE]
where is a diagonal matrix whose diagonal entries are . So, arbitrary powers of is given by
[TABLE]
Defining a new transformation by
[TABLE]
Since has a specific normalization, we can equate with after applying the appropriate normalization for . That is, for diagonal matrix , we take
[TABLE]
We can choose any normalization for the right eigenvectors given in , and will properly renormalize them. Here, we solve for and by appealing to the orthogonality of :
[TABLE]
Therefore
[TABLE]
Computing the matrix multiplication on the left side yields the diagonal entries of denoted by given by
[TABLE]
in terms of the right eigenvectors of Now that we have , we have the representations for both the left-eigenvectors and right-eigenvectors of given by
[TABLE]
in terms of the right-eigenvectors of the original that was obtained by our method in section 2.
From Eq. (57), we also have an explicit formula for given by
[TABLE]
So, by Eq. (54), we have
[TABLE]
Computing the matrix multiplication gives the following spectral decomposition
[TABLE]
as the explicit representation of in the BL model.
Since , take in Eq. (63) to find the stationary distribution of the BL model
[TABLE]
Note that this is the orthogonality relation for the right-eigenvectors of with orthogonal measure given in Eq. (63). We have derived the orthogonal measure in which the dual Hahn are an orthogonal polynomial system [10].
We summarize these results on the derivation of a non-standard inner product or orthogonal measure in which the original transition matrix of the BL model becomes a symmetric real matrix and the (right) eigenvectors are the system of orthogonal dual Hahn polynomials:
Theorem 4.1**.**
*The orthogonal measure in (4.12) symmetrizes , is related to the (left) and (right) eigenvectors of by (4.13) and (4.14), and yields the spectral decomposition (4.17). *
5 Bounds of mixing times - elementary proofs
We will discuss first the case for our method gave the eigenvalues of the BL model to be A heuristic estimate of the number of switches needed to mix the colors is thus,
[TABLE]
and therefore gives an idea of how many switches or time steps are needed until the variation distance between and is of order The lower bound is obtained along the lines of Diaconis et al, that is, by an application of the Chebyshev’s inequality. However, all estimates of the relevant mean values and variance needed to apply the Chebyshev’s inequality are constructed explicitly from the properties of the eigenvalues and eigenvectors of the BL model. We will state the theorems below but since the proofs are similar to those in [7], we have provided the details in an appendix.
Theorem 5.1**.**
*For for there is a universal constant such that *
Theorem 5.2**.**
*If then *
6 Exact calculations of mixing times
Given that we can calculate exactly by Eq. (63), and the stationary distribution is given by
[TABLE]
the total variational distance can be exactly computed for all time steps by
[TABLE]
This solution is shown in Figure 1, with the upper bound given in Figure 2.
Appendix A Proofs of Theorems 5.1 and 5.2
The eigenvalues and eigenvectors of the BL problem can be used to construct an upper bound on the variation distance between probability distribution after . As we proved in section 3, the right-eigenvectors of the original matrix are not orthogonal in the standard inner product of but a different inner product weighted by can be used to derive orthogonality of a related system of right-eigenvectors of
A.1 Upper bound
Let to define
[TABLE]
where
[TABLE]
Then, and hence
[TABLE]
implies that
[TABLE]
Using
[TABLE]
and averaging over initial data we have
[TABLE]
for some independent of This proves theorem 5.1.
A.2 Lower bound
In terms of the right-eigenvectors (with col sum =1 )
[TABLE]
and
[TABLE]
Next for , we get . A similar calculation gives
[TABLE]
Next we deduce and from , , and the orthogonality of :
[TABLE]
Therefore, taking , we fine . Furthermore, . Now, we have
[TABLE]
So with the same normalization as above for we deduce is uniformly bounded by constant since Now, by Chebyschev’s inequality,
[TABLE]
and
[TABLE]
Thus, if such that for we deduce
[TABLE]
Choose to obtain
[TABLE]
which proves theorem 5.2.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] N. Akhiezer , The Classical Moment Problem , Hafner, 1965.
- 2[2] A. Baronchelli, C. Castellano, and R. Pastor-Satorras , Voter models on weighted networks , Phys. Rev. E, 83 (2011), p. 066117.
- 3[3] A. Baronchelli, M. Felici, V. Loreto, E. Caglioti, and L. Steels , Sharp transition towards shared vocabularies in multi-agent systems , J. Stat. Mech.: Theory Exp., (2006), p. P 06014, http://stacks.iop.org/1742-5468/2006/i=06/a=P 06014 .
- 4[4] P. Bremaud , Markov chain models , Springer-Verlag, 2005.
- 5[5] F. Caccioli, L. Dall’Asta, T. Galla, and T. Rogers , Voter models with conserved dynamics , Phys. Rev. E, 87 (2013), p. 052114.
- 6[6] P. Deift , Orthogonal Polynomials and Random Matrices , Courant Lecture Notes 3, 2000.
- 7[7] P. Diaconis and M. Shahshahani , Time to reach stationarity in the bernoulli–laplace diffusion model , SIAM Journal on Mathematical Analysis, 18 (1987), pp. 208–218.
- 8[8] L. Flatto, A. M. Odlyzko, and D. Wales , Random shuffles and group representations , The Annals of Probability, (1985), pp. 154–178.
