Markov Duality for Stochastic Six Vertex Model
Yier Lin

TL;DR
This paper establishes a Markov duality for the stochastic six vertex model by extending Schütz's ASEP duality functional, introducing a novel induction-based proof method.
Contribution
It introduces a new induction-based method to prove Markov duality for the stochastic six vertex model, extending existing ASEP duality results.
Findings
Proves Markov duality for the stochastic six vertex model.
Extends Schütz's ASEP duality functional to this model.
Introduces a novel induction method for duality proofs.
Abstract
We prove that Sch\"{u}tz's ASEP Markov duality functional is also a Markov duality functional for the stochastic six vertex model. We introduce a new method that uses induction on the number of particles to prove the Markov duality.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRandom Matrices and Applications · Algebraic structures and combinatorial models · Advanced Combinatorial Mathematics
\AtAppendix\AtAppendix\AtAppendix
Markov Duality for Stochastic Six Vertex Model
Yier Lin
Y. Lin, Department of Mathematics, Columbia University, 2990 Broadway, New York, NY 10027
Abstract.
We prove that Schütz’s ASEP Markov duality functional is also a Markov duality functional for the stochastic six vertex model. We introduce a new method that uses induction on the number of particles to prove the Markov duality.
1. Introduction
1.1. Stochastic six vertex model
The stochastic six vertex model (S6V model) is a classical model in 2d statistical physics first introduced by Gwa and Spohn [GS92], as a special case of the six vertex (ice) model (see for example [Lie74] and [Bax16]). We associate each vertex in with six types of configurations with weights parametrized by , see Figure 1. The configurations chosen for two neighboring vertices need to be compatible in the sense that the lines keep flowing. We consider the lines from the south and the west as the inputs and the lines to the north and the east as the outputs. Each vertex is conservative in the sense that the number of input lines equals the number of output lines. The model is stochastic in the sense that when we fix the inputs, the weights of possible configurations sum up to .
The S6V model is a member of the KPZ universality class (see [Cor12, Qua11] for a nice survey). We briefly review a few results that have been obtained for the S6V model. [BCG16] proves that under step initial condition, the one point fluctuation of the S6V model height function is asymptotically Tracy-Widom GUE. One point fluctuations of the S6V model under more general initial condition including stationary is obtained in [AB16, Agg18]. In a slightly different direction, [CGST18] shows that under scaling with fixed, the fluctuation of the S6V model height function converges weakly to the solution of KPZ equation. More recently, [BG18, ST18] showed under a different scaling, the height fluctuation of the S6V model converges in finite dimensional distribution to the solution of stochastic telegraph equation.
In this paper, we consider the S6V model as an interacting particle system (see [GS92] or Section 2.2 of [BCG16]). Consider vertex configurations in the upper half-plane, we restrict ourselves to the boundary condition that there are no lines coming from the left boundary. Then the input lines coming from the bottom of the horizontal axis can be viewed as the trajectories of an exclusion-type particle system. We see the vertical axis as time variable and horizontal axis as space variable. The vertex configurations compose several paths, which can be viewed as trajectories of particles with the vertical lines denoting the particle location. As illustrated by Figure 2, we cut our plane by the line and the particle location at time is give by the intersections (red points in the figure) of these trajectories with . To rigorously define our interacting particle system, we first introduce the following state spaces.
Definition 1.1**.**
We define the space of left-finite particle occupation configuration to be
[TABLE]
where is understood as the number of particles (either zero or one) at location . We also define the space of left-finite particle location configuration to be
[TABLE]
where stands for the location of -th particle counting from the left. Note that there might be infinite or finite number of particles in our particle configuration. For the latter case, there exists some so that for . It is straightforward that there is a bijection defined by
[TABLE]
Having specified our state space, we proceed to define the particle interpretation of the S6V model as the following discrete-time Markov processes. The following definition is similar as the one that appears in Section 2.1 of [CGST18].
Definition 1.2**.**
We define the S6V location process, which is a discrete-time -valued Markov process with the update rule (transition probability) from to specified as follows:
We denote for any , this is just a convention to simplify the notation. We sequentially consider and update as following independent probabilities:
(a) When , we update to via
[TABLE]
(b) When , we update to via
[TABLE]
We also define the S6V occupation process by setting i.e.
[TABLE]
Clearly, is a discrete-time -valued Markov process. We remark that the occupation process and location process are just two ways to describe the particle interpretation of the S6V model.
For a left-finite particle configuration , we define the height function to be the total number of particles in this particle configuration that is on the left or at location , i.e.
[TABLE]
Our result is a Markov duality between the S6V occupation process and its space reversal, which we define below:
Definition 1.3**.**
Define the space of reversed -particle location configuration . The reversed -particle S6V location process is a -valued Markov process so that has the same update rule as the S6V location process.
1.2. Markov Duality
Definition 1.4**.**
Given two discrete (continuous) time Markov processes and and a function , we say that and are dual with respect to if for any and ( for continuous time), we have
[TABLE]
Here we use to denote that we take the expectation under initial condition . Likewise, represents the expectation with initial condition .
Markov duality has been found for different interacting particle systems including the contact process, voter model and symmetric simple exclusion process, see [Lig12, Lig13]. It also plays an important role in the analysis of models in the KPZ universality class. The first such example is the asymmetric simple exclusion process (ASEP), which is an interacting particle system on with at most one particle at each site. Each particle jumps to the left with rate and jumps to the right with rate . If the site is already occupied by another particle, the jump is excluded.
We consider ASEP as a process , where is an indicator for the event that at time , a particle is at site . We call the ASEP occupation process. When the ASEP has finite -particles, in terms of particle location, we also consider the -particle ASEP location process where denotes the location of -th particle counting from the right at time .
Schütz [Sch97] derived the following ASEP duality using a spin chain representation: For any fixed , the ASEP occupation process and the -particle ASEP location process with the jump rate and reversed are dual with respect to the duality functional
[TABLE]
where . This generalizes the the Markov duality satisfied by the symmetric simple exclusion process [Lig12] where and are set to be equal. We call (1.1) Schütz’s ASEP Markov duality functional.
[BCS14] uses a different approach to prove Schütz’s result by directly applying the Markov generator on the duality functional. Further, they use this method to show that the processes and are also dual with respect to the functional
[TABLE]
The ASEP is a continous time limit of the S6V model if we scale the parameter by , and scale time by and shift the space to the right by , see [BCG16, Agg17]. Given the ASEP is the limit of the S6V model and enjoys the Markov duality with respect to the functionals in (1.1) and (1.2), one might wonder if these functionals are the Markov duality functionals for the S6V model as well. Indeed, by setting , [CP16, Theorem 2.21] justifies that the S6V occupation process and the reversed -particle S6V location process are dual with respect to the functional in (1.2)111In fact, [CP16, Theorem 2.21] proves the duality for a higher spin generalization of S6V model called stochastic higher spin vertex model..
Our main result shows that the S6V model also enjoys a Markov duality with respect to the functional in (1.1).
Theorem 1.5**.**
Consider the S6V model with parameter and set . For any , the S6V occupation process and the reversed -particle S6V location process are dual with respect to the function given in (1.1).
Remark 1.6**.**
We remark that the Markov duality in Theorem 1.5 appeared first in [CP16, Theorem 2.23] and was later used in proving [CGST18, Proposition 5.3]. In fact, [CP16, Theorem 2.23] claims a Markov duality for the stochastic higher spin vertex model (see [CP16, Section 2] for definition), which is a higher spin generalization of the S6V model (the stochastic higher spin vertex model has vertical and horizontal spin , where . When , it degenerates to the S6V model). However, this Markov duality is false when . In fact, the author of the present paper found a counterexample which is recorded in the erratum [CPa]. For the S6V model, the Markov duality holds but the proof of [CP16, Theorem 2.23] still breaks down222The proof of [CP16, Theorem 2.23] claims that the S6V duality (1.1) can be deduced by taking the discrete gradient of the Markov duality functional in (1.2), which is not true when the number of particles in Theorem 1.5 is larger than [CPb].. In this paper, we offer the first correct proof for this Markov duality.
Remark 1.7**.**
It appears that the proof of Theorem 1.5 also adapts to the space inhomogeneous stochastic six vertex model, where we allow the parameters , in Figure 1 to vary at different locations and are expressed by and . Under the condition that there exists such that for all , Theorem 1.5 holds for this space inhomogeneous stochastic six vertex model as well. To avoid extra notation, we have opted not to state and prove this more general result here.
Remark 1.8**.**
It is natural to ask whether our method produces duality for stochastic higher spin vertex model with vertical and horizontal spin . Using fusion, one can prove the same duality in Theorem 1.5 holds if we take and (the proof is similar to that of [CP16, Corollary 3.2]). For , as presented in Section 2, our proof relies heavily on the structure of the duality functional (1.1) and the particle exclusion property (i.e. at most one particle allows to stay in each location) of the S6V model. It is unclear how to adapt our method proving duality for stochastic higher spin vertex model such as [Kua18, Theorem 4.10], since both of the duality functional and the model become more complicated.
Duality has been obtained for generalization of the ASEP and S6V model using algebraic methods, see [BS15a, BS15b, CGRS16, Kua16, Kua17, Kua18]. In particular, [CGRS16] proves two ASEP (which is a higher spin generalization of ASEP) dualities based on the higher spin representations of . In the spirit of [CGRS16], duality has also been proved for multi-species version of ASEP [BS15b, Kua17]. [Kua18] obtains a duality for the multi-species version of the stochastic higher spin vertex model via an algebraic construction. Instead of using algebraic tools to prove duality, our proof of Theorem 1.5 follows a straightforward induction approach.
We remark that the duality functional from [Kua18, Theorem 4.10] has a degeneration to S6V model. We state this degeneration here as a lemma.
Proposition 1.9**.**
Consider the S6V model with parameter and set . For any , the S6V occupation process and the reversed -particle S6V location process are dual with respect to the functional
[TABLE]
We remark that there is a misstatement in [Kua18, Theorem 4.10]. The particles in the process and were stated to jump to the left and to the right respectively (see pp.164 of [Kua18]). However, after discussing with the author, we realize that the correct statement is that the particles in jump to the right and those in jump to the left [Kua].
Proof.
Taking the spin parameter for all and species number and substituting by , the multi-species higher spin vertex model considered in [Kua18, Theorem 4.10] degenerates to the stochastic six vertex model (see [Kua18, Section 2.6.2] for detail). Referring to the duality functional considered in [Kua18, Theorem 4.10] (note that is the configuration for the process and is the configuration for the process ). By substituting the configuration by and the configuration by the -particle location configuration , we obtain that the reversed S6V occupation process (with particles jumping to the left) is dual to the -particle S6V location process (with particles jumping to the right) with respect to the functional
[TABLE]
In the last equality we used the fact that . Since and are nothing but the space reversal of and in the lemma. After swapping the role of left and right (then is exactly the height function ), we readily obtain the duality in (1.3). ∎
When we take in Theorem 1.5, our duality can be simply derived by subtracting the functional in (1.2) by the functional in (1.3) (see Lemma 2.1). However, it appears that when , there is no easy way to obtain our duality by combining the duality functionals in (1.2) and (1.3).
Finally, we explain several applications of our duality. Theorem 1.5 combined with the other S6V duality (1.2) are the main tools for proving the self-averaging property of the specific quadratic function of the S6V height function in [CGST18, Proposition 5.3], which is the crux in proving the convergence of stochastic six vertex model to KPZ equation. In a different direction, by using duality, we can compute the exact moment formula of certain observables of our model. [BCS14] uses the Schütz-type duality of ASEP to derive the moment generating function of the ASEP height function under Bernoulli step initial data. Applying a similar approach, we expect by using our duality and the S6V model Bethe ansatz eigenfunction given by [CP16, Proposition 2.12], we can reprove the moment formula appearing in [BCG16, Theorem 4.12] and [AB16, Theorem 4.4]. Since this application of the duality is not related to our paper, we do not pursue to give the proof here.
1.3. Acknowledgment
The author wants to thank Ivan Corwin for his helpful discussions and comments pertaining to the earlier draft of this paper and also for his advice on the aspect of paper writing. We wish to thank Promit Ghosal for his useful comments and Jeffrey Kuan for his helpful discussion about the result in his paper. We are also grateful to the anonymous referees for their valuable suggestions. The author was partially supported by the Fernholz Foundation’s “Summer Minerva Fellow” program and also received summer support from Ivan Corwin’s NSF grant DMS:1811143.
2. Proof of Theorem 1.5
In this section, we prove Theorem 1.5. We first introduce several notations for our proof. Define the space of -particle location configuration
[TABLE]
We also denote by , and the number of particles in the particle configuration , and (obviously when ) respectively.
Referring to the Definition 1.2 of Markov duality, we need to show that for any and under any initial states , , we have
[TABLE]
By Markov property, it suffices to prove that the preceding equation holds for , namely, for any and any , , we have
[TABLE]
Observing that is finite (since ) whereas can either be finite or infinite. We claim that it suffices to prove (2.1) for all such that is finite, here is the reason: Suppose we have proved (2.1) for every with . For and such that , we consider a particle configuration that corresponds with in the following way:
[TABLE]
Clearly, and hence \mathbb{E}^{\vec{g^{\prime}}}\big{[}H(\vec{g}(1),\vec{y})\big{]}=\mathbb{E}^{\vec{y}}\big{[}H(\vec{g^{\prime}},\vec{y}(1))\big{]}. Additionally, observing that the particles in the configuration which are on the right of have no contribution to both of the expectations \mathbb{E}^{\vec{g}}\big{[}H(\vec{g}(1),\vec{y})\big{]} and \mathbb{E}^{\vec{y}}\big{[}H(\vec{g},\vec{y}(1))\big{]} (see Figure 3), thus
[TABLE]
We conclude that \mathbb{E}^{\vec{g}}\big{[}H(\vec{g}(1),\vec{y})\big{]}=\mathbb{E}^{\vec{y}}\big{[}H(\vec{g},\vec{y}(1))\big{]} also holds for all with .
It remains to prove (2.1) when is finite. In other words, we need to prove (2.1) for all and all , satisfying . We apply induction according to . The first thing is to show that (2.1) holds when , as the induction basis.
Lemma 2.1**.**
When , (2.1) holds.
Proof of Lemma 2.1.
Since , we have either and , or . Note that
[TABLE]
If and , since has only one non-zero component and , we have for any and thus \mathbb{E}^{\vec{g}}\big{[}H(\vec{g}(1),\vec{y})\big{]}=0. Similarly, we have \mathbb{E}^{\vec{y}}\big{[}H(\vec{g},\vec{y}(1))\big{]}=0 hence the desired equality holds.
If , we note that , where and are given by (1.2) and (1.3). Since the subtraction of two duality functionals is still a duality functional (which follows from Definition 1.2), we obtain the desired (2.1). ∎
Before explaining how the induction works, we slightly reformulate (2.1). In order to keep track of the location of the particles, we utilize the S6V location process in Definition 1.2. Via the bijection (see Definition 1.1), we identify a configuration with and define the function as
[TABLE]
By the relation between the S6V occupation process and the S6V location process , (2.1) can be paraphrased into the following:
For any and any initial states and , we have
[TABLE]
When , (2.2) is established via Lemma 2.1. When , we define our induction hypothesis as
[TABLE]
It suffices to prove (2.2) for any and under (). We briefly explain our strategy: We decompose the LHS expectation of (2.2) into a combination of the expectations which are in the form of \mathbb{E}^{\vec{x^{\prime}}}\big{[}\widetilde{H}(\vec{x^{\prime}}(1),\vec{y^{\prime}})\big{]} with . A similar decomposition occurs in the RHS expectation. By applying (), we get the desired (2.2).
In the sequel, when there is only one particle in the S6V location process, we denote by the one particle transition probability from location to . Similarly denotes the one particle transition probability from to for the reversed S6V location process. Clearly, and
[TABLE]
In the sequel, we will frequently use the following elementary fact.
Lemma 2.2**.**
Consider S6V location processes
[TABLE]
and , if , then
[TABLE]
If , then
[TABLE]
where
Proof.
We only prove for (2.3), the proof of (2.4) is similar. Knowing , the particles at update as an independent S6V model (see Figure 4).
Note that the probability of is , furthermore, knowing , we have
[TABLE]
Therefore,
[TABLE]
which is the desired (2.3). ∎
Proof of Theorem 1.5.
We denote the processes in (2.2) by \vec{x}(t)=\big{(}x_{1}(t)<\cdots<x_{\ell}(t)\big{)}, and the initial states by , . We split our proof into different cases depending on the relation fo and .
Case (1):
Denote by the positive integer satisfying . We consider the S6V location processes
[TABLE]
and the reversed S6V location processes, see Figure 5 (here we do not put arrow on since it only has one particle).
[TABLE]
Observing and are both less than , hence via ()
[TABLE]
To prove (2.2), it suffices to show
[TABLE]
We first show (2.5). Observing that by the update rule of defined in Definition 1.2, if , thus
[TABLE]
Knowing , we can treat the particles at as an independent S6V model. Therefore, it is straightforward that
[TABLE]
The factor comes from knowing ,
[TABLE]
Using the fact that \mathbb{E}^{\vec{x^{\prime}}}\big{[}\widetilde{H}(\vec{x^{\prime}}(1),y^{\prime})\mathbbm{1}_{\{x_{s}(1)=y_{k}\}}\big{]}=\mathbb{E}^{\vec{x^{\prime}}}\big{[}\widetilde{H}(\vec{x^{\prime}}(1),y^{\prime})\big{]} in (2.7), we conclude (2.5).
Likewise, to show (2.6), we have
[TABLE]
Using the fact that
[TABLE]
we conclude (2.6).
Case (2):
We divide our discussion into three sub-cases.
Case (2a):
In this case, let us consider the S6V location process and the reversed S6V location process
[TABLE]
Since , the induction hypothesis () gives \mathbb{E}^{\vec{x^{\prime}}}\big{[}\widetilde{H}(\vec{x^{\prime}}(1),\vec{y^{\prime}})\big{]}=\mathbb{E}^{\vec{y^{\prime}}}\big{[}\widetilde{H}(\vec{x^{\prime}},\vec{y^{\prime}}(1))\big{]}. To prove (2.2), it suffices to show
[TABLE]
We first justify (2.8). Since and starts from , it follows from the update rule that unless . Thus,
[TABLE]
Using Lemma 2.2, we conclude (2.8). Under the same reasoning,
[TABLE]
which concludes (2.9).
Case (2b):
The proof for this case is more involved than the previous ones. We consider the S6V location processes and reversed S6V location process (see Figure 6)
[TABLE]
To simplify our notation, we denote
[TABLE]
Since are all less than , we have by induction hypothesis ()
[TABLE]
To prove (2.2), it suffices to show that
[TABLE]
Let us show (2.11) first. Since , by Lemma 2.2,
[TABLE]
Expanding the LHS expectation of (2.2) as following (according to the update rule, can not exceed )
[TABLE]
For the first term on the RHS of (2.14), given , the particles at update as an independent S6V model. Using the fact that
[TABLE]
and knowing
[TABLE]
one has
[TABLE]
Let us compute the second term on the RHS of (2.14),
[TABLE]
For and with , we have
[TABLE]
Plugging the expression on the RHS of (2.17) into (2.16) gives
[TABLE]
Consequently,
[TABLE]
It is straightforward that
[TABLE]
Substituting this back to (2.18) gives
[TABLE]
Note that the LHS of (2.15) and (2.19) are the first and second terms on the RHS of (2.14), one has
[TABLE]
Therefore, we conclude (2.11). Note that in the last line above we used the provided by (2.13) and the relation ,
We turn our attention to demonstrate (2.12). Since , according to the update rule of with initial state , the only possible case for is either or . Therefore, we have
[TABLE]
For the first term on the RHS of (2.20), we readily have
[TABLE]
The first equality above is due to the fact that the condition implies .
For the second term on the RHS of (2.20), we expand the expectation (using the condition )
[TABLE]
where
[TABLE]
It is easy to check that given and with condition and implies
[TABLE]
Therefore,
[TABLE]
Using \mathbb{E}^{\vec{y^{\prime}}}\big{[}\widetilde{H}(\vec{x^{\prime\prime}},\vec{y^{\prime}}(1))\big{]}=R_{3},
[TABLE]
Similarly, given and with and , we have
[TABLE]
Thus,
[TABLE]
Consequently, one has
[TABLE]
Under event , we have and hence
[TABLE]
We obtain from (2.24) that
[TABLE]
Recall (2.22) that \mathbb{E}^{\vec{y}}\big{[}\widetilde{H}(\vec{x},\vec{y}(1))\mathbbm{1}_{\left\{y_{k}(1)=x_{1}\right\}}\big{]}=\mathbf{A}+\mathbf{B}, using (2.23) and (2.25), we get
[TABLE]
Note that the LHS of (2.21) and (2.26) are the first and second term on the RHS of (2.20), hence
[TABLE]
We have proved the desired (2.12), thus concluding (2.2) for the case .
It only remains to prove (2.2) for the following case:
Case (2c): .
The computation for this case is similar to Case (2b). Let us consider the S6V location processes
[TABLE]
and denote
[TABLE]
[TABLE]
To conclude (2.2), it suffices to show that
[TABLE]
To prove (2.28), we first write
[TABLE]
Similar as (2.15), the first term on the RHS of (2.30) can be expressed as
[TABLE]
while the second term on the RHS of (2.30) equals
[TABLE]
Given \vec{z}=\big{(}z_{1}<\cdots<z_{\ell}\big{)} and \vec{z^{\prime}}=\big{(}z_{2}<\cdots<z_{\ell}\big{)} with , we have
[TABLE]
Substituting back to (2.32) yields
[TABLE]
Consequently,
[TABLE]
By Lemma 2.2, we have
[TABLE]
Using (2.33),
[TABLE]
Plugging (2.31) and (2.34) into the RHS of (2.30) yields
[TABLE]
We conclude (2.28). Here, in the last line above we used again the relation .
We turn to demonstrate (2.29). As implies , we have
[TABLE]
Since implies ,
[TABLE]
Consequently,
[TABLE]
For the second term on the RHS of (2.35), we have
[TABLE]
Since , given and satisfying , we have
[TABLE]
Substituting back to the RHS of (2.36) yields
[TABLE]
Consequently,
[TABLE]
Under event , we have and accordingly
[TABLE]
Therefore, we have by (2.37)
[TABLE]
Substituting (2.38) back to (2.35) entails
[TABLE]
which concludes the desired (2.29).
As all the possible cases for and were discussed, we justify (2.2) and conclude Theorem 1.5. ∎
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[AB 16] Amol Aggarwal and Alexei Borodin. Phase transitions in the ASEP and stochastic six-vertex model. ar Xiv:1607.08684, to appear in Annals of Probability , 2016.
- 2[Agg 17] Amol Aggarwal. Convergence of the stochastic six-vertex model to the ASEP. Mathematical Physics, Analysis and Geometry , 20(2):3, 2017.
- 3[Agg 18] Amol Aggarwal. Current fluctuations of the stationary ASEP and six-vertex model. Duke Mathematical Journal , 167(2):269–384, 2018.
- 4[Bax 16] Rodney J Baxter. Exactly solved models in statistical mechanics . Elsevier, 2016.
- 5[BCG 16] Alexei Borodin, Ivan Corwin, and Vadim Gorin. Stochastic six-vertex model. Duke Mathematical Journal , 165(3):563–624, 2016.
- 6[BCS 14] Alexei Borodin, Ivan Corwin, and Tomohiro Sasamoto. From duality to determinants for q-TASEP and ASEP. The Annals of Probability , 42(6):2314–2382, 2014.
- 7[BG 18] Alexei Borodin and Vadim Gorin. A stochastic telegraph equation from the six-vertex model. ar Xiv preprint ar Xiv:1803.09137 , 2018.
- 8[BS 15a] Vladimir Belitsky and Gunter M Schütz. Quantum algebra symmetry of the asep with second-class particles. Journal of statistical physics , 161(4):821–842, 2015.
