General form of DMPK equation
I. M. Suslov (Kapitza Institute for Physical Problems, Moscow, Russia)

TL;DR
This paper derives a generalized form of the DMPK equation for quasi-one-dimensional systems, incorporating tensor diffusion and multiple approximation variants, extending the traditional model's scope.
Contribution
It introduces a comprehensive generalization of the DMPK equation with tensor diffusion and explores three diagonal approximation forms, including reproductions of existing models.
Findings
Derived a tensor-based diffusion form of the DMPK equation.
Presented three diagonal approximation variants, including the classic and generalized forms.
Identified additional terms in alternative variants that extend the traditional equation.
Abstract
The Dorokhov-Mello-Pereyra-Kumar (DMPK) equation, using in the analysis of quasi-one-dimensional systems and describing evolution of diagonal elements of the many-channel transfer matrix, is derived under minimal assumptions on the properties of channels. The general equation is of the diffusion type with a tensor character of the diffusion coefficient and finite values of non-diagonal components. We suggest three different forms of the diagonal approximation, one of which reproduces the usual DMPK equation and its generalization suggested by Muttalib and co-workers. Two other variants lead to equations of the same structure, but with different definitions of entering them parameters. They contain additional terms, which are absent in the first variant.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
General form of DMPK equation
I. M. Suslov
P.L.Kapitza Institute for Physical Problems,
119334 Moscow, Russia
E-mail: [email protected]
The Dorokhov–Mello–Pereyra–Kumar (DMPK) equation, using in the analysis of quasi-one-dimensional systems and describing evolution of diagonal elements of the many-channel transfer matrix, is derived under minimal assumptions on the properties of channels. The general equation is of the diffusion type with a tensor character of the diffusion coefficient and finite values of non-diagonal components. We suggest three different forms of the diagonal approximation, one of which reproduces the usual DMPK equation and its generalization suggested by Muttalib and co-workers. Two other variants lead to equations of the same structure, but with different definitions of entering them parameters. They contain additional terms, which are absent in the first variant.
1. Introduction
The Dorokhov–Mello–Pereyra–Kumar (DMPK) equation [1, 2, 3, 4] is an efficient instrument for investigation of quasi-1D disordered systems (see the paper [5] for a review). It describes evolution of diagonal elements of the many-channel transfer matrix due to increasing the system length. The DMPK equation is obtained from the maximum entropy principle (assuming the maximal randomness consistent with the symmetry restrictions) and conceptually close to the random matrix theory by Wigner and Dyson [6]. It is determined by one parameter (the system size in units of the correlation length) and manifests universality specific for the metallic state. The DMPK equation is equivalent to the super-symmetric sigma-model [7], derived from the microscopic Hamiltonians [8, 9, 10], but allows to work with distributions of physical quantities. Solution of the DMPK equation [3, 4] reproduces the universal fluctuations of conductance and quantum corrections to it obtained from diagrammatic calculations [11, 12].
In principle, the transfer matrix approach underlining the DMPK equation is not restricted by the quasi-1D geometry. Considering a system of coupled one-dimensional chains and arranging the chains in accordance with symmetry of a -dimensional lattice, one can construct the systems of higher dimensionality. However, assumptions underlining the DMPK equation lead to statistical equivalence of chains and eliminate all information on the topology of space in the transverse directions. As a result, the DMPK equation cannot be used for study of the Anderson transition and is restricted by the metallic phase in the corresponding -dimensional space. In the localized regime of a -dimensional system, the DMPK equation is not adequate even for the quasi-1D geometry: it predicts the minimal Lyapunov exponent to be of order , while the reasonable microscopic models lead to the result [13, 14]. The latter can be understood easily in the regime of strong localization, when conductance is determined by one resonant trajectory and the system in fact becomes strictly one-dimensional 111 The simple algorithm for construction of resonant trajectories is given in Footnote 4 of the paper [15]. In the strongly localized regime, the contributions of resonant trajectories to conductance are scattered exponentially, and the latter is dominated by the most transparent channel. .
It should be clear that in the general case assumptions used in the DMPK equation should be relaxed; in particular, it is necessary for describing universality arising near the Anderson transition. Derivation of the most general form of the DMPK equation is a problem, realized by scientific community [3, 4, 14, 16] and admitted to be of fundamental significance. One of the possible generalizations was suggested by Muttalib and co-workers [16]–[18].
We show below that the equation of the DMPK type may be derived under minimal assumptions on the properties of channels. Generally, this equation is of the diffusion type with a tensor character of the diffusion coefficient and finite values of non-diagonal components. We consider three different forms of the diagonal approximation, one of which reproduces the usual DMPK equation and its generalization suggested in [16]–[18]. Two other variants lead to equations of the same structure, but with different definition of their parameters. They contain the additional term, which is absent in the first variant and turned out to be very actual in the recent research of the conductance distribution [15].
2. Main concepts
Considering the system as a set of coupled one-dimensional chains, we can thought of it as a ”black box” with attached ideal contacts in the form of isolated conductors 222 It means that a concept of ”channels” is used in the real space representation, which removes all problems related with evanescent modes [19].. Then the system can be treated as an effective scatterer and described by the transfer matrix , relating the amplitudes of waves on the left ( in -th channel) and on the right () of the scatterer (Fig.1):
[TABLE]
where , , , are vectors with components , , , . In the vector notation, the form of Eq. 1 does not depend on the number of channels, while the transfer matrix is divided naturally into four blocks; it allows a parametrization [3, 20]
[TABLE]
where , , , are unitary matrices, and is a diagonal matrix with the positive elements , which are eigenvalues of the Hermitian matrix . In the presence of time reversal invariance the additional relations arise [2, 3]
[TABLE]
which as a rule are of no significance for the following.
Of the main interest are parameters , which in particular determine the conductance
[TABLE]
(in the Economou–Soukoulis definition [21, 22]). The DMPK equation describes evolution of their mutual distribution function with increasing the length of the system
[TABLE]
[TABLE]
where for the orthogonal ensemble (usual systems with a random potential), for the unitary ensemble (systems in the strong magnetic field), for the symplectic ensemble (system with the strong spin-orbit interaction); parameter has a sense of the inverse correlation length of the quasi-1D system. The quantity is well-known from the random matrix theory [6] and arises from the Jacobian of transformation
[TABLE]
when integration over elements of the matrix is replaced by integration over its eigenvalues and the elements of diagonalizing matrix (). In fact, is the distribution function of levels, if they are contained in the restricted interval with the periodic boundary conditions (the Dyson circular ensemble). In actual applications, the distribution includes the additional factor, providing localization of the spectrum in a finite interval and practically not affecting the distribution of close levels. Analogously, the distribution is a formal solution of equation (5) but does not satisfy the normalization condition; an additional factor becomes inevitable, whose evolution is described by the DMPK equation. Exact solution of Eq.5 for shows [23] that the additional factor is not reduced to a smooth envelope but essentially changes the distribution , making it different from even on the local level; correlations of are determined by the Jacobian only at the initial stage of evolution, when all are small. In the context of generalizations of the DMPK equation this circumstance acquires a deep sense (see Footnote 9 in Sec. 5).
In a strictly one-dimensional system we have and equation (5) reduces to the form
[TABLE]
and coincides with the Landauer resistance [24]; such equation was derived in many papers [25]–[29]. Recently it was shown by the present author [15] that the general evolution equation in 1D systems has a form
[TABLE]
and the additional term, specified by parameter , is physically significant: its incorporation in the Shapiro scheme [30] allows to explain all essential features in the conductance distribution, which was impossible on the basis of (7). This term disappears in the random phase approximation and is naturally not reproduced by equation (5), based on the analogous assumptions. However, this term is also not predicted by the generalized DMPK equation suggested by Muttalib and co-workers [16]–[18]
[TABLE]
[TABLE]
and containing as parameters the elements of a certain matrix . 333 The dependence of was unnoticed in [17] (see Appendix ), but practically it is not very actual [31]. This fact clearly demonstrates that attempts of generalization of the DMPK equation are not sufficiently advanced and do not reproduce all essential contributions. This point was the main motivation of the present paper.
3. Idea of derivation
Derivation of the evolution equation is based on the relation
[TABLE]
where is a matrix close to the unit one. The form of the DMPK equation depends on statistical properties of the parameters , determining deviations of from the unit matrix. These parameters can be divided into two groups: for the first of them
[TABLE]
while for the second
[TABLE]
The presence of parameters (12) is necessary for arising of an equation of the diffusion type: since there is no effect in the first order in , all calculations should be made in the second order, and this is a reason for appearing of the second derivatives which are characteristic for the diffusion equation. The general strategy consists in averaging only over parameters (12), and making no assumptions relative to parameters (11).
To obtain the most general form of the DMPK equation, we should distinguish the category of quantities, for which the property (12) is not a model assumption but is an inherent property, following from their nature. Such quantities are well known and related with a diagonal disorder. Consider the Schroedinger equation with a random potential, which is defined on the lattice sites by a set of independent random quantities (as in the Anderson model). Variables should have identical distributions to provide a spatial homogeneity in average. If the mean value is finite, then it is equal for all and can be excluded by a shift of the origin of energy , since a random potential enters in the combination . Thus, we can accept without a loss of generality
[TABLE]
as it is made in almost all theoretical papers.
This point can be used in the following manner. A typical quasi-1D system is a bar cutout of a -dimensional lattice and containing randomly located impurities (Fig. 2,a). We can divide it into a series of effective scatterers containing a lot of lattice sites (Fig. 2,b). Each scatterer should provide existence of two effects, (a) a partial reflection of the incident waves, and (b) mixing of channels. It is convenient to imagine these two processes as slightly separated in space (Fig. 2,c), so that there is a region where waves are reflected without mixing of channels, and there are two regions where channels are mixed for the transmitted and back-scattered waves but no reflection occurs. Such assumption is not very essential, since we suppose nothing on the degree of separation and it can be purely symbolic. In fact, the construction in Fig. 2,c corresponds to the canonical representation (2) of the transfer matrix: it is easy to see that the middle matrix in (2) provides a reflection of waves without mixing of channels, while the right and the left matrices provide mixing of channels without reflection of waves.
The middle part of the effective scatterer (Fig. 2,c) can be described by the transfer matrix
[TABLE]
corresponding to the diagonal disorder created by point scatterers on the independent one-dimensional chains, so is a diagonal matrix with real elements , possessing the property (12). Extracting from (14) the factors, not related with scattering, we can accept the following representation for matrix
[TABLE]
where , , , are matrices close to the unit one and elements are small. Accepting the canonical representation (2) for and composing the product (10), one can see that matrices , lead to a small renormalization of matrices and , which can be neglected 444 We do not use the canonical representation (2) in (15), since in this case matrices do not tend to the unit one in the limit , leading to a finite renormalization of and . The use of (14) as a middle matrix of (15) leads to the more tremendous calculations.. It is clear that we should compose the product
[TABLE]
[TABLE]
and reduce it to the canonical form (2).
The assumption of a diagonal disorder for the middle part of an effective scatterer (Fig. 2,c) is not very essential. Indeed, the notion of weak scatterers is inevitable in derivation of the differential equation; in the opposite case only the finite difference equation is possible. Having in mind a description of the Anderson transition, we should work near the band edge of the ideal crystal, since only there a weak disorder is compatible with localization in higher dimensions. Then the de Broglie wavelength and the mean free path are large in comparison with the atomic spacing and the wave function envelope changes slowly. It allows to introduce the coarse description, dividing the system into blocks, small in comparison with the wavelength but containing a lot of the lattice sites, and considering these blocks as the new lattice sites. In the result of such procedure practically any short-ranged random potential reduces to the diagonal Gaussian disorder. Universality arising near the Anderson transition as in other critical phenomena [32, 33] leads to equivalence of its description near the band edge and in the band center.
4. General evolution equation
Let describe the general scheme of deriving the evolution equation, while the calculation details can be found in Appendix . Parameters of the matrix can be found as eigenvalues of the Hermitian ”Hamiltonian” , where
[TABLE]
and can be presented as functions of () in the form of expansion over . Composing the distribution function of , we have
[TABLE]
[TABLE]
Making a change of variables , one can replace integration over by integration over
[TABLE]
while the inverse relations are found by iterations in . Integration over removes the -functions and leads to the result
[TABLE]
In calculation of the Jacobian one discovers that its diagonal elements are of order unity, while non-diagonal elements are of order , so in fact it reduces to the product of diagonal elements. Substituting for and their expansions in and expanding (20) to the second order, we produce averaging according to , and set 555 In coarsening of description discussed in the end of Sec. 3, the variances of individual scatterers are added and their sum is proportional to the volume; it gives a linear dependence on in the quasi-1D geometry. With such definition, the parameter appears to be of the order of the inverse mean free path. . As a result
[TABLE]
where the following functions of are introduced (the primes near the summation signs indicate the absence of terms with )
[TABLE]
[TABLE]
[TABLE]
[TABLE]
[TABLE]
with a definition of matrices
[TABLE]
[TABLE]
[TABLE]
[TABLE]
Equation (21) is the most general form of the DMPK equation: in its derivation we did not use any assumptions on the statistical properties of matrices and , and they even are not obliged to be random. The right hand side of (21) is a sum of full derivatives, which provides the conservation of the total probability.
5. Diagonal forms
Equation (21) is of the diffusion type, with a tensor character of the diffusion coefficient and finite non-diagonal components. In the general form it is rather complicated and hardly suitable for a constructive analysis; so consider its possible simplifications.
Equation (21) is simplified radically, if we assume the diagonal form for matrices and
[TABLE]
We accept also , since the statistical properties of matrices and are usually identical. Then equation (21) reduces to the form (see Appendix )
[TABLE]
[TABLE]
[TABLE]
[TABLE]
which reproduces Eq. 8 in the one-channel case. Conditions for realization of the diagonal approximation can be easily analyzed for the unitary ensemble, when matrices and are averaged independently. If a unitary matrix is restricted by real values of its elements, then it turns into the orthogonal matrix ; to restore the unitary matrix we should add to the elements of the appropriate phase factors. Producing the same manipulations with the matrix , we set
[TABLE]
and obtain after substitution to (23)
[TABLE]
[TABLE]
[TABLE]
If matrices and are completely random, while phases and have nonuniform distributions, then products , are averaged to zero for , providing the diagonal approximation (24) where and are independent, and the trivial result is valid for (see Eq. 28 below). Contrary, if matrices and are not sufficiently random, but phases and are completely stochastic, then we have another diagonal approximation with nontrivial values of and relation ; as a result, the terms with turn to zero and Eq. 25 reduces to the variant (9), suggested by Muttalib et al [16]–[18]. Finally, if both , , and , are completely random, then averaging occurs over the unitary group (see Appendix in [5]) and leads to the results
[TABLE]
[TABLE]
for the unitary and the orthogonal ensembles correspondingly, so equation (9) transforms to the usual DMPK equation (5). 666 For the orthogonal ensemble, the first diagonal approximation (25) is not realized.
Let us discuss the third variant of the diagonal approximation, which we consider as the most actual. It was argued in [15, 34], that for the correct definition of conductance of a finite system it is useful to introduce semi-transparent boundaries, separating the system from the ideal leads attached to it. In the limit of weak transparency one obtains universal equations, independent on the way how the contact resistance of the reservoir is excluded [35] (all formulas of the Landauer type [36]–[40] reduce in this limit to the variant by Economou–Soukoulis [21, 22]), which then can be extrapolated to transparency of order unity. Such definition is surely referred to the system under consideration (and not to the composed system ”sample+ideal leads”) and provides the infinite value of conductance for an ideal system [34].
Suppose that weakly-transparent boundaries are created by point scatterers inserted in one-dimensional chains attached to the system (Fig. 2,a); then its transfer matrix transforms to , i.e.
[TABLE]
where is a diagonal matrix. Reducing (30) to the canonical form (2), one has in the main approximation for large
[TABLE]
[TABLE]
[TABLE]
[TABLE]
where . Since the unitary matrices , , , have restricted elements, then and can be replaced by ; then (31) gives
[TABLE]
For large equations (21–23) reduce to the form analogous to (25), but with another definition of (see Appendix ), . Substitution of (32) into (23) gives , in the limit. For large, but finite the small deviations of from should be taken into account, setting
[TABLE]
where is the Hermitian matrix with small elements. Substituting (33) into (23) and expanding to the second order in , one has
[TABLE]
[TABLE]
[TABLE]
[TABLE]
It is easy to see, that independently of the statistics (in fact, it follows from the general expressions (23)). For large , the quantities are small in magnitude, but there are no other restrictions on their statistics. It is natural to think that fluctuate randomly and their fluctuations are independent of . 777 If matrix contains a dependence on , then this dependence manifests only in terms of order , which are neglected in (34). Then pair products , , with are averaged to zero, and the matrix becomes diagonal. As a result, equation (21) accepts the form
[TABLE]
[TABLE]
[TABLE]
[TABLE]
and has the same structure as (25), but with different definition of parameters. Since are small for large , parameters are surely finite and large in magnitude.
The first two diagonal approximations look somewhat artificial. If matrices and are completely random, then we return to the initial equation (5). If and are not sufficiently random, then a tendency to the non-diagonal situation arises: we do not see serious grounds why should be more random than or vice versa. Contrary, the third variant of a diagonal approximation looks quite natural: existence of weakly transparent boundaries restricts mutual fluctuations of and , but beyond these restrictions they are considered as completely random. Simultaneously, all situation with the definition of conductance becomes logically consistent.
It is well-known [2, 5], that equation (5) is easily solved in the limit of large , when parameters are large and obey hierarchy ; then reduces to the product of powers of and equation (5) splits into independent equations. Applying the same procedure to equation (35), we find the independent Gaussian distributions for quantities defined by their first two moments:
[TABLE]
[TABLE]
which for coincides with results of [14, 17]. In the approximation of equivalent channels one can set , , , and equation (35) is determined by three parameters , , ; in particular,
[TABLE]
and parameters , can be easily estimated from numerical data on Lyapunov exponents (see e.g. [41, 42]). One can see from formula (32) of the paper [42] that relation for the minimal exponent ( in our notation) is valid in the metallic regime but violated in other cases; hence the parameter is finite beyond the metallic phase. 888 The formula (4.5) of the paper [18] contains the more general expression for , reflecting a violation of the strong hierarchy of in the quasi-3D geometry; it reduces to results of [14, 17] in the limit for fixed , which is a proper limit for a definition of the Lyapunov exponents. Probably, in conditions of the paper [18] the matrix was diagonal with nonzero elements ; as a result, finiteness of was compensated by redefinition of and did not affect the quality of fitting on the basis of formula (4.5).
As clear from derivation, the structure of equation (35) is the same for the unitary and orthogonal ensembles; correspondingly, becomes a free parameter not related with the Wigner–Dyson values, and in the general case transforms to a matrix 999 At first glance, for non-integer we meets with violation of the repulsion law for two nearest levels at their anomalous approaching. In fact (see discussion after formula (6)), correlation of levels if determined by the Jacobian only in the region of small , where coincides with its Wigner–Dyson value.. It is clear that the ”pure” Wigner–Dyson ensembles loose their actuality beyond the metallic phase, and in particular are not adequate for description of the Anderson transition. The latter circumstance is not accounted for in the existent versions of the sigma-models [8, 9, 10], which are equivalent to the simplest equation (5) and require modification for incorporation of the discussed generalizations. The only exclusion is the situation for , where universality arising near the critical point approximately corresponds to universality specific for the metallic phase, which is adequately described by equation (5). It provides validity of results in the main -approximation but remains the open question on their validity in higher orders.
6. Conclusion
In the present paper we derive the DMPK equation under minimal assumptions on the properties of channels. It is of the diffusion type with a tensor character of the diffusion coefficient and nonzero off-diagonal components. We suggest three variants of the diagonal approximation, one of which reproduces the usual DMPK equation and its generalization suggested in [16]–[18]. Two other variants lead to equations of the same structure and contain additional terms specified by parameters .
The most general form of the DMPK equation, given by Eq. 21, probably is not very actual: it should be used as a starting point for new statistical hypotheses, which were adequate for description of the Anderson transition. The methods used in numerical experiments allow to calculate matrices and [19], and analyzing their statistical properties establish the form of matrices , , , . Numerical analysis undertaken in the context of equation (9) [18, 31] 101010 It should be noted that the present paper clarifies the conditions for validity of equation (9); in particular, self-averaging of , discussed in details by the authors of [31], in fact is of no significance., points out the realization of the diagonal approximation and deviation of parameters from their Wigner–Dyson values; a finiteness of parameters follows from Eq.32 of [42]. It is desirable to continue such analysis on the basis of the general expressions (23). On the other hand, mathematical methods developed for analysis of the usual DMPK equation [3, 4, 5], can be used for deriving more general relations; existence of large parameters may provide new possibilities.
Appendix A. Derivation of the evolution equation
Parameters of the matrix can be found as eigenvalues of the Hermitian ”Hamiltonian” (see (17)), which has the matrix elements 111111 All calculations are produced to the second order in . The imaginary unit enters in the several expressions as a factor and is easily distinguished from indices.
[TABLE]
[TABLE]
[TABLE]
[TABLE]
Eigenvalues of the matrix are calculated by the usual perturbation theory
[TABLE]
and have a form of expansion in
[TABLE]
[TABLE]
with the coefficients
[TABLE]
[TABLE]
[TABLE]
[TABLE]
[TABLE]
[TABLE]
[TABLE]
[TABLE]
Composing the distribution (18) and making a change of variables , one comes to Eq. 20, where the inverse functions are found by iterations in
[TABLE]
[TABLE]
Integration over removes the -functions and leads to the result (20). The Jacobian matrix has diagonal elements of order unity and off-diagonal elements of order ,
[TABLE]
[TABLE]
[TABLE]
so its determinant reduces to the product of diagonal elements. It is calculated according to the scheme
[TABLE]
and takes a form
[TABLE]
where
[TABLE]
[TABLE]
[TABLE]
Now we can make the expansion
[TABLE]
[TABLE]
where
[TABLE]
[TABLE]
Substituting () into (20) and averaging according , , one has
[TABLE]
[TABLE]
[TABLE]
which can be transformed to Eqs. 21–23.
Appendix B. Simplification of equation (21).
In the diagonal approximation (24) equation (21) accepts a form
[TABLE]
[TABLE]
[TABLE]
The sum over can be transformed using the identity [2]
[TABLE]
which is valid for a symmetrical matrix . It allows to simplify the combination
[TABLE]
[TABLE]
and reduce to the form (25). If a symmetry requirement for is ignored in , then it is easy to arrive at a false conclusion that are independent of and determined by parameters .
In the case of weakly transparent boundaries, parameters are large and expansions over are possible with retaining the first two terms; in particular,
[TABLE]
[TABLE]
and one has in Eq. 22
[TABLE]
[TABLE]
where . Having in mind that relation holds usually, we neglect the second term in the right hand side, but retain the symmetric definition for . Using , we can reduce (21), (22) to a form (35).
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] O. N. Dorokhov, JETP Letters 36 , 318 (1982).
- 2[2] P. A. Mello, P. Pereyra, N. Kumar, Ann. Phys. (N.Y.) 181 , 290 (1988).
- 3[3] P. A. Mello, A. D. Stone, Phys. Rev. B 44 , 3559 (1991).
- 4[4] A. M. S. Mac e ^ ^ e \rm\hat{e} do, J. T. Chalker, Phys. Rev. B 46 , 14985 (1992).
- 5[5] C. W. J. Beenakker, Rev. Mod. Phys. 69 , 731 (1997).
- 6[6] M. L. Mehta, Random Matrices, Fcfltvic, New York, 1991.
- 7[7] P. W. Brouwer, K. Frahm, Phys. Rev. B 53 , 1490 (1996).
- 8[8] K. B. Efetov, Adv. Phys. 32 , 53 (1983).
