Concavity analysis of the tangent method
Bryan Debin, Etienne Granet, Philippe Ruelle

TL;DR
This paper provides a universal geometric framework explaining the tangent method's effectiveness in determining arctic curves, highlighting the role of entropy concavity in lattice path models.
Contribution
It introduces a general geometric explanation for the tangent method's tangency property based on entropy concavity, extending the proof to q-deformed paths.
Findings
Entropy concavity underpins the tangent method's tangency property.
The framework applies to models with directed lattice paths.
Extension of proof to q-deformed lattice paths.
Abstract
The tangent method has recently been devised by Colomo and Sportiello (arXiv:1605.01388 [math-ph]) as an efficient way to determine the shape of arctic curves. Largely conjectural, it has been tested successfully in a variety of models. However no proof and no general geometric insight have been given so far, either to show its validity or to allow for an understanding of why the method actually works. In this paper, we propose a universal framework which accounts for the tangency part of the tangent method, whenever a formulation in terms of directed lattice paths is available. Our analysis shows that the key factor responsible for the tangency property is the concavity of the entropy (also called the Lagrangean function) of long random lattice paths. We extend the proof of the tangency to -deformed paths.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Concavity analysis of the tangent method
Abstract
The tangent method has recently been devised by Colomo and Sportiello [12] as an efficient way to determine the shape of arctic curves. Largely conjectural, it has been tested successfully in a variety of models. However no proof and no general geometric insight have been given so far, either to show its validity or to allow for an understanding of why the method actually works. In this paper, we propose a universal framework which accounts for the tangency part of the tangent method, whenever a formulation in terms of directed lattice paths is available. Our analysis shows that the key factor responsible for the tangency property is the concavity of the entropy (also called the Lagrangean function) of long random lattice paths. We extend the proof of the tangency to -deformed paths.
Bryan Debin1, Etienne Granet2,3 and Philippe Ruelle1
1*Institut de Recherche en Mathématique et Physique
Université catholique de Louvain, Louvain-la-Neuve, B-1348, Belgium*
2*Institut de Physique Théorique, Paris Saclay, CEA, CNRS,
F-91191 Gif-sur-Yvette, France
3*Laboratoire de Physique Théorique, Département de Physique de l’ENS,
École Normale Supérieure, Sorbonne Université, CNRS, PSL Research University, F-75005 Paris, France*
1 Introduction
The phenomenon of phase separation and the emergence of limit shapes has been pointed out for the first time in the study of random domino tilings of Aztec diamonds [1, 2]. In this system, the random tilings exhibit two distinct phases, one external frozen (or solid) phase where the dominos are deterministically arranged as a brickwork, and one inner entropic, temperate (or liquid) phase with a non-zero density of horizontal dominos. The two phases are spatially separated by an interface which becomes sharper and sharper in the scaling limit, and converges to a circle, called the arctic circle. Since then, arctic phenomena generally refer to phase separations and arctic curves to lines separating distinct phases.
Arctic phenomena have been observed in many other models, including lozenge tilings of hexagons [3], the 6-vertex model with domain wall boundary conditions [4, 5], alternating sign matrices [6], lozenge tilings of polygonal domains [7, 8] and the 2-periodic Aztec diamond [9]. For dimer models on bipartite planar graphs, rather general results have been obtained in [10, 11]. The results obtained in these various models are technically involved and usually based on the calculation of appropriate bulk observables; in addition the methods used are specific to each model.
Recently a general strategy has been proposed by Colomo and Sportiello [12]. The tangent method, as it is known, was the result of a surprising geometric observation: these authors noticed that the arctic curve computed in [4] for the 6-vertex model is the caustic of a family of straight lines determined by a one-point boundary observable. It was then elevated to a universal geometric principle and successfully checked in a number of situations where it indeed reproduced known results [12, 13, 14, 15, 16, 17]. It has been subsequently used to make predictions in cases the analytic shape of the arctic curve was not known, see [12, 14, 17, 18, 19, 20].
The tangent method is as yet a heuristic but efficient method to obtain the analytic shape of arctic curves and is based on a few unproved though reasonable assumptions. The method itself as well as the underlying assumptions will be recalled in detail in Section 5. Let us simply say for now that it describes a way to produce a one-parameter family of curves (often straight lines), all tangent to the desired arctic curve, which can then be recovered as the envelope of that family. The concrete implementation of the method is model dependent but its principle is believed to be fairly universal. A minimal requirement is that the model under consideration can be formulated in terms of non-crossing (i.e. non-intersecting or osculating) random lattice paths. Apart from that, the domain of applicability is not precisely known.
A general, model independent proof of the assumptions which make the tangent method work is probably challengingly hard as it would be close to giving the shape of the arctic curve itself. Reference [13] presents a first rigorous (and rather technical) proof of these assumptions in a special model, namely the 6-vertex model at the ice point with particular boundary conditions. However the assumptions, namely that the arctic curve is deterministic in the scaling limit and is not affected by the displacement (or the addition) of a random path, can be separated from the conjecture stating the tangency property of the family of curves. The point of view which we propose here is that the tangency part of the tangent method may be addressed in a universal way, within the framework defined below.
Our approach may be summarized as follows. If one assumes that the arctic curve is deterministic and fixed, there remains only one random lattice path, travelling in a deterministic domain , bordered by pieces of the arctic curve and by the boundary in which the model is defined (the random path can be thought of as an extra path we introduce by hand). In known examples, this random path is made of elementary steps on a graph (usually ): in the Aztec diamond for instance, it is a Schröder path with steps , and , whereas for the 6-vertex model, it is a rotated Dyck path with possible steps and . As seen in the last two examples, the paths are directed, a condition we will assume throughout.
The problem is then reduced to study, in the scaling limit, the probability distribution of a random path or random walk that stays inside the domain (it cannot cross the arctic curve due to the non-crossing property of the paths underlying the arctic curve), the distribution being induced by the weights associated to the elementary steps, in finite number. While the endpoint is fixed and attached to the arctic curve, the starting point can be varied and taken away from the arctic curve. The distribution over random paths can be complicated on the lattice but dramatically simplifies in the scaling limit. By using a variational approach, we write the measure on continuous trajectories in terms of a classical action, and find that, in the scaling limit, this measure condensates on the unique trajectory which maximizes the action. We show that the unicity of the maximizing trajectory as well as its complete and explicit description follow from a central observation: the Lagrangean function defining the classical action is a strictly concave function, a property which we prove to hold for any directed random walk (on for simplicity).
Applied to what the tangent method prescribes to do, these results imply that the scaling limit of the random path in is, with probability 1, a straight line that hits the arctic curve tangentially and then coindices with it up to the (fixed) endpoint. Varying the starting point then yields a family of straight lines tangent to the arctic curve. We extend these results to the case where each random path receives an additional -dependent weight taking into account the area below it, the so-called -deformed case, considered in [21, 22, 17, 19]. The main conclusion remains, namely after an appropriate rescaling of , the random path in collapses in the scaling limit onto a deterministic curve (it is no longer a straight line) which meets the arctic curve tangentially.
The plan of the paper is as follows. Section 2 reviews the asymptotic combinatorics of directed random paths, introduces the entropy of paths with prescribed starting and ending points, and proves the main property underlying our analysis, namely that the entropy, soon to become a Lagrangean function, is a strictly concave function. In preparation for what has to come later, Section 3 computes passage probability distributions of random paths in the scaling limit. The variational approach, by which the measure of all the lattice paths which in the scaling limit collapse to a single continuous trajectory is written in terms of a classical action, is detailed in Section 4. Our main result, namely the tangency part of the tangent method, is presented in Section 5, while Section 6 contains a critical discussion of the method used to prove the tangency. The extension of the tangency property to the -deformed case is carried out in Section 7.
2 Partition functions for directed walks
The general framework which we consider is that of lattice statistical models which can be formulated entirely in terms of non–crossing random paths, and showing the phenomenon of arctic curve. In addition, and this assumption is crucial, the lattice paths can be viewed as directed random walks in the plane, defined in terms of a set of elementary discrete steps and a set of associated weights . We take as the case is trivial; for the same reason we also suppose that not all the are collinear; likewise we may assume without loss of generality that one of the is equal to 1. For convenience we assume that the steps belong to , although this should not be a strong restriction.
By a directed walk, we mean a walk for which a return to a previously visited site is not possible (no erasure of any kind is performed). Thus the available steps are such that the condition with non–negative integers has the only solution for all . Equivalently the convex hull of the points in the plane does not contain the origin111The previous argument shows that the origin is not in the rational convex hull. It is not difficult to show that it can neither be in the real convex hull.. This implies the existence of a straight line passing through the origin, which separates the plane in two halves, one of which strictly contains , viewed as a set of points. Then a unit vector exists such that all step vectors have scalar products with which are strictly positive, for all . For simplicity, we may assume that is such that the possible paths are contained in a cone which is itself contained in the right half-plane (thus for all ). We formulate here a number of preliminary results.
2.1 The Lagrangean function of a walk
For the directed walk defined in terms of a set of possible steps and weights , we denote by the weighted sum over all paths between the origin and the site , the weight of a path being the product of the weights of its elementary steps. The assumptions made above imply . The generating function of the numbers is given by
[TABLE]
We will be mostly interested in the asymptotic behaviour of , when are both large with the ratio finite, and more specifically in its exponential growth rate.
By a standard argument, we note that for positive integers, the inequality implies that the sequence is superadditive (satisfies ). By Fekete’s lemma, the following limit exists
[TABLE]
which we call the Lagrangean function for reasons that will be clear later on. The lemma does not guarantee that the limit is finite, but the following simple argument shows that it is finite, for all , and grows at most linearly in and .
Let , a strictly positive number. To reach the point requires at most elementary steps, taken from the set . We readily obtain the upper bound
[TABLE]
Unless the points are reachable for no , the function is also bounded below by any non–zero term in the sequence .
Repeating the above arguments in terms and final point , we obtain the obvious but important scaling relation,
[TABLE]
In the rest of this article, the function will be used with a single argument, meant to be a slope. The support of the function of one variable depends on the walk considered; it is given by the interval where are respectively the minimal and maximal slopes among the elementary available steps. Our assumptions imply that at least one of is finite, so the support is closed or semi-closed.
In simple enough cases, the function can be explicitly computed from the asymptotic form of the coefficients of the bivariate generating function , see for instance the review [23] (see below for more details). As an illustration, we find the following Lagrangean functions for two simple walks. All weights are equal to 1 in these, except the horizontal step in the second example, which has weight . For the second walk, we note that if ; in the other cases with the function given below (provided ).
[TABLE]
[TABLE]
2.2 Concavity
For an intermediate passage point between and , the simple inequality (which can be made strict except in degenerate cases)
[TABLE]
immediately yields (which cannot be made strict in general, as we will see later). Using the scaling relation (2.4), it becomes
[TABLE]
Setting , and , the previous inequality is the statement that the function is concave on its domain,
[TABLE]
However the function satisfies a stronger property which turns out to be central in the subsequent arguments: the Lagrangean function is strictly concave. The strict concavity means that
[TABLE]
The yet slightly stronger property, namely that on its open support, which itself ensures the strict concavity, proves to be useful later on and can be obtained from the way the function is to be computed. The reference [23] reviews a number of methods to compute the asymptotic value of the coefficients of multivariate generating functions. In our case, the relevant function is the two-variable generating function given in (2.1), for which the results quoted in [23] apply222In the survey [23], the generating function is assumed to be non singular at the origin and to expand in positive powers of and (). This assumption is not verified here if some of the elementary steps have a second component which is negative. When this is the case and since the step vectors are contained in a cone of aperture strictly less than , they can be mapped by an affine transformation to steps contained in the first quadrant for which the results of [23] apply. However, one can show that this procedure and a naive application of the results of [23] yield exactly the same results. For this to hold, the directedness of the walks considered here is crucial..
Let be two functions of satisfying the system (the indices of denote partial derivatives)
[TABLE]
Theorem 1.3 and Corollary 3.21 of [23] state that (i) there is a unique solution for which are real positive functions, and (ii) the asymptotic value of , if non–zero, is given by
[TABLE]
Here is an explicit function of and , generally a rather complicated algebraic function of , and
[TABLE]
In the two examples mentioned in the previous subsection, these functions are found to be equal to
[TABLE]
The corresponding functions are shown in Figure 1.
Before proving that is strictly negative, we show that the three functions are smooth. The first equation of (2.11) yields a first implicit equation , where nowhere vanishes in the domain . Moreover since is in both variables in the same domain, the implicit function theorem implies that is a function of . Similarly the second equation in (2.11), after substituting , yields an implicit equation , which is in both and . The derivative is found to be equal to
[TABLE]
where we used the identity
[TABLE]
obtained by differentiating the first equation of (2.11) with respect to , and using the second equation (here and in what follows, denotes the derivative of with respect to its argument, made explicit). Recalling the form of the function
[TABLE]
a simple computation shows that
[TABLE]
is strictly negative. Thus nowhere vanishes in the domain , . A second use of the implicit function theorem implies that is a smooth function of on its open domain . The statement that the functions and are smooth readily follows. We note that the double use of the implicit function theorem readily proves the uniqueness of the real functions .
In order to check that , we differentiate the first equation of (2.11) with respect to (after the substitution and ) and use the second equation to get the condition . It allows to write the first two derivatives of as
[TABLE]
Differentiating now the second equation of (2.11) with respect to and using (2.17) once more yields
[TABLE]
From the specific form of , we see that is strictly positive in the domain , whereas the denominator, equal to computed above, is also strictly positive. Thus the numerator and denominator of (2.21) are both strictly positive for , which implies and thus as claimed.
3 Passage probability distributions
In this section, we prove that in the scaling limit, by which all large lattice distances are rescaled to (equivalently, the lattice spacing is shrunk from 1 to ), the probability measure on paths connecting two points concentrates on a unique trajectory, namely a straight line. We also investigate the size of the fluctuations around the straight line.
3.1 Passage probabilities
Given a distant site , the probability that a directed walk passes through the point is given by
[TABLE]
up to the multiplicative non–exponential terms given in (2.12). The strict concavity of the Lagrangean function implies that the combination in the square brackets is strictly negative, except when the two points and coincide, namely when , in which case it trivially vanishes. We therefore obtain that the above probability is exponentially depreciated with when the intermediate point is not on the straight line, and that it is asymptotic to 1 otherwise. An analogous though somewhat weaker result has been proved in [14].
3.2 Fluctuations
We have seen that in the rescaled domain, the path between and is almost surely a straight line. However at finite but large , the distribution shows fluctuations. It is natural and important for what follows to understand the size of these fluctuations.
In the asymptotic regime, the passage probability is given by
[TABLE]
where is the combination of three terms in the square brackets in (3.1). As noted above, is strictly negative except at where it vanishes. Because of the factor in the exponential, only the values of very close to will make a significant contribution to the probability. Therefore we can expand, to leading order, and the factor in front of the exponential around . From above we know that but also that since is strictly negative away from , which is therefore a maximum. Moreover a simple computation yields , a strictly negative number according to the results of the previous section. We therefore obtain the following Gaussian approximation,
[TABLE]
for which we allow to take all real values (rather than those related to the reachability to and from ), and where
[TABLE]
The above probability is not normalized upon summation over , since one readily obtains
[TABLE]
On general grounds, we do not expect it to be normalized. Indeed if a vertical step of the form is allowed (has non–zero weight), the walk may visit several values of for the same value of , causing an overcounting (integral larger than 1). Likewise, if some of the possible steps have horizontal components , a fraction of walks will not visit any site on the vertical line at , causing this time an undercounting (integral smaller than 1). Finally, there may be arithmetical constraints on the sites that are actually reachable, in which case the summation over must be further restricted. These various situations are reflected by the presence of the factor in (3.3).
Despite the (expected) absence of a proper normalization, the distribution (3.3) shows a dispersion of order around the mean value , in the rescaled domain. On the lattice, keeping the large variables , we expect a standard deviation of order . This is well confirmed by numerical simulations, as shown in Figure 2 for the lattice paths of example 1 given in (2.5).
4 Variational approach
In this section, we would like to reformulate the problems and results developed in the previous sections as a continuous variational problem. Rather than counting the possible paths within a certain class dictated by the choice of possible elementary steps, and then doing statistical estimates about passage probabilities, we associate, in the continuum limit, a classical action to each path. The action allows to assign a statistical weight to a whole trajectory, represented as a continuous function , the weight being itself inherited from those of the lattice elementary steps. The resulting variational problem appears to be the appropriate tool to determine the extremal paths used in the tangent method to characterize arctic curves.
Let us consider all possible paths from (0,0) to , made of elementary steps taken from the set . For large, their weighted sum has the asymptotic value given in (2.12). Let us now focus on all the paths which pass through intermediate points , with for . Here the are fixed numbers independent of satisfying and . Provided is large, the weighted sum over such paths is
[TABLE]
Assuming that is the value at of some real function333This is largely legitimate because the paths we consider are directed to the right. In case vertical steps are possible, a vertical macroscopic stretch is unlikely. , namely , we obtain, for large enough , that , and that the sum over in the exponential is well approximated by an integral, leading to
[TABLE]
The function needs not be everywhere differentiable. Looking at the way it was introduced, the mere existence of a right derivative is sufficient. More generally, we may consider the set of functions which have a left and a right derivative, hence piecewise of class .
The other dependence of in the trajectory is in the factor in front of the exponential, which can similarly be written as
[TABLE]
Since was assumed to be small (we will argue that choosing proportional to is good enough), this term is subdominant, as expected.
We obtain that the fraction of paths going from the origin to the point , and which collapse in the scaling limit to a continuous trajectory described by the function , for , and , has a relative statistical weight given in terms of an action associated with ,
[TABLE]
Interestingly the weights of the paths depend on the underlying lattice walk only through the function (which was called Lagrangean for obvious reasons). A trivial but important remark is that not every function is permitted: it must be compatible with the way the discrete walk has been defined, implying that must be in the interval for all .
Because the Lagrangean function depends on the derivative of only, the Euler-Lagrange equation, computed on the subdomains where is , takes the simple form
[TABLE]
As is strictly negative on its (open) domain, the trajectories which make the action extremal satisfy , that is, is piecewise rectilinear. The concavity of ensures that the straight line is the unique global maximum of , as proved below. In the limit , the straight line is therefore observed almost surely, consistently with the results obtained in Section 3.
5 The tangent method
The applicability conditions of the tangent method are not precisely known, but it is generally expected to be valid in models which can be formulated in terms of random non-crossing lattice paths (or at least showing some form of repulsion). In these models, the arctic phenomenon arises when a large number of paths accumulate to form, in the scaling limit, a sharp and deterministic interface between a frozen region, which contains no path at all, and an entropic region, densely filled with paths. Focusing on a piece of the arctic curve connecting two points on the boundary of the domain, say and , we would see, close to the scaling limit, a compact cluster of paths starting from a neighbourhood of and ending at sites in a neighbourhood of . In particular the outermost path would start from site and go to site , in the scaling neighbourhood of and respectively.
The tangent method requires to slightly modify the model in such a way that the starting (or ending) point of the outermost path is moved from to another point , located on the boundary between and and at a macroscopic distance from , see Figure 3. The specific way this change is implemented depends on the model under consideration. In one of the best known cases, namely the Aztec diamond, one can achieve this by inserting two monomers, as shown in Figure 4. As a result of this change, the outermost path, now starting from but still tied to its ending point , will first traverse the frozen region, then approach and merge in the compact cluster of paths, and eventually reach its endpoint along with the rest of the cluster.
The tangent method relies on one assumption and one conjecture, both supposedly valid in the scaling limit.
- (i)
It makes the working hypothesis that the change of starting point of the outermost path, from to , has so little influence on the other paths that the arctic curve itself is not affected. If this were not the case, the method would predict a different arctic curve, as compared to other methods.
- (ii)
It postulates that the outermost path is almost surely a straight line traversing the frozen region from until it meets the arctic curve tangentially. After the meeting point, the path coincides with the arctic curve till the endpoint . In fact, the same conjecture more generally holds if the starting point is taken out of the domain, namely to a point chosen in such a way that the paths can enter the domain in the frozen region (through a boundary point between and ). Let us observe that the tangency of the straight lines requires local curvature properties of the arctic curve (the tangent lines must lie in the frozen region).
The efficiency of the method relies on the postulate (ii). By varying the starting point , one obtains a one-parameter family of straight lines tangent to the arctic curve, which is then simply recovered as its envelope. We note that in some cases, like for instance in the model of -weighted lattice paths studied in [17, 19], the straight lines are replaced by other known curves; however the method remains and the way the arctic curve is retrieved as the envelope of a family of tangent curves is the same.
Clearly, if the arctic curve is known, it is trivial to compute the family of tangential lines, each of which arrives at a given point of the boundary. The tangent method aims at solving the inverse problem: given a point , find the slope of the line starting from and hitting the arctic curve tangentially. This is usually done by resorting to a lattice point outside the domain, as mentioned in (ii), and by considering the paths from to which enter into the domain through a (random) boundary point between and . In the scaling limit and given , the paths are straight lines with probability 1, entering the domain at some boundary point and hit the arctic curve tangentially. Since the probability distribution over the point of entry concentrates on a unique point in the scaling limit, call it , it can be computed on the lattice as the entry point of maximal probability. In this way, the point can be obtained as a function of and allows to compute the slope of the line that will eventually meet the arctic curve tangentially. Varying yields the required family of tangent lines.
As pointed out in [12], the hypothesis (i) is hard to formalize in general, let alone to prove. The conjecture (ii) is easier to formulate in general; it is our main purpose to provide strong arguments in favour of it.
Assuming the hypothesis (i), we view the compact cluster of paths collapsing to the arctic curve in the scaling limit, as being given. It influences the shape and the distribution of the outermost path because it acts as an impenetrable frontier, due to the non-crossing property built in the model. The problem for the outermost path can then be posed in the following terms: what is the distribution on the lattice paths from to (same notations as above), made of elementary steps from the set , and constrained not to cross a certain curve ? In particular, in the scaling limit, does this distribution condensate on a deterministic trajectory with the properties given in (ii) ?
Compared to the results obtained in the previous sections, the novel issue is of course the presence of the impenetrable curve because in actual situations, the endpoint cannot be reached from by following a straight line, as it would cross . As the arctic curve is only defined in the continuum, we formulate the above question in the continuum. To do this we take advantage of the variational approach developed in Section 4, in which we assigned the action to a trajectory between two points in the plane. For the present problem, we use the same action (4.4) as when there was no constraint at all, and restrict it to those that do not cross the given curve; a justification for keeping the same action is provided in Section 6.
The following setting should be general enough. As we did before, we consider a trajectory in the plane as the graph of a continuous function . The general problem is, given a connected domain , whose boundary is sufficiently smooth (say, it has a tangent everywhere), to find the trajectories from to , wholly contained in , which maximize the action
[TABLE]
We recall that is the Lagrangean function, strictly concave, associated to the directed walk under consideration, and computable in terms of the sets and . Also the function must have its (left- and right-) derivatives in the domain of , namely the interval , itself determined by the set . This imposes certain constraints on for the point to be reachable. The last two conditions are implicit in the rest of this section.
Our main result is the following
Theorem For the variational problem formulated above (connected and simply-connected domain , boundary of class ), the action given in (5.1) with strictly concave, has a global maximum in the set of continuous piecewise functions, attained for a unique function of class . The maximizing is universal in the sense that it does not depend on the specific form of . The function is best described as the shortest path in between the points and : the graph of has rectilinear pieces alternating with portions of the boundary of ; moreover the linear parts of meet or exit the boundary tangentially.
The proof relies in an essential way on the strict concavity of the Lagrangean function and on the simple form of the action , which depends on through its derivative only. It basically is a consequence the following lemma.
Lemma Fix and two points in the plane and consider the set of continuous and piecewise functions connecting these two points, so that and (the functions are not restricted to be contained in a certain domain). Then the action S[h]=\int_{x_{i}}^{x_{f}}\mathrm{d}x\>L\big{(}h^{\prime}(x)\big{)} has a unique maximum for given by the straight line.
Let be the slope of the straight line . Assume first that is of class . In this case the derivative for takes all values in a finite interval . Since the average value of over is equal to , the interval must contain . The function being strictly concave, its graph lies below that of any of its tangents, in particular it lies below its tangent at , namely for all . If the interval is not reduced to the singleton , that is, if is not the straight line , then
[TABLE]
In case is only piecewise , the image of under is the union of intervals (when is not linear) and singletons (when is linear). The inequality used above nonetheless remains valid for any value in the domain of , and leads to the same conclusion .
To prove the Theorem, we take and to be the opposite vertices of a parallelogram (with slopes ), in which the allowed region is delimited by the boundary , see Figure 5 (as we can see, the property is only required for certain portions of ). We repeatedly use the Lemma: to go from point to point , the optimal path is the straight line, whenever this is possible (if the line does not cross the boundary). In particular, if is reachable from on a straight line, this will be the optimal .
If not, the endpoint , seen from the origin, is hidden by a portion of . In this case, is a straight line starting from the origin that will necessarily make a first contact with , say at : it cannot go round without touching it because such a path could be optimized by following a shorter chord. Thus must be composed of straight lines and portions of . must be the farthest point of the boundary that is reachable by a straight line from the origin hence must be a tangential point. Indeed, the Lemma implies that the union of a straight line to another contact point, and the portion of from it to the tangential contact point, has a smaller action.
From , the optimal path will follow for a while, exit the boundary at a certain point and follow a straight line towards some point (it could potentially follow the boundary till it reaches the endpoint, like for the Aztec diamond). is either a tangential point or else lies further down the boundary. Again the Lemma shows that a tangential point is optimal. Iterating these arguments proves the claim. It should be noted that the successive exit and passage points cannot be precisely determined without knowing the shape of the domain that the paths will encounter further ahead. These points can only be determined from the global shape of . The resulting global process leads to the optimal path as depicted in Figure 5. The simply-connected assumption, usually satisfied in arctic curve situations, prevents the possibility to have two or more equally maximizing paths, ensuring the unicity of .
In the examples of arctic curves known to the authors, the above result applies directly where the boundary is made of the arctic curve itself and the boundary of the domain in which the model is naturally defined.
6 Discussion
We have taken a definite stand on how to treat the paths in presence of an impenetrable boundary, by restricting the variational problem obtained without boundary to trajectories contained in the allowed domain. If the endpoint is not reachable by a straight line, the optimal path will inevitably approach and coincide with portions of the constraining boundary. Since our derivation of the variational principle ignores the presence of a boundary, our approach calls for a justification. We provide it in this section.
An alternative route would be to take into account, in the discrete lattice formulation itself, the constraint that the paths be contained in the allowed region, by modifying the weights of the elementary steps so as to prevent the walk from entering the forbidden region. In that formulation, one would compute the number of acceptable paths and the asymptotic form of the passage probabilities in order to figure out on what sort of trajectories the distribution concentrates in the scaling limit. Though probably more natural and more satisfactory, this approach is likely to be impractical for general domains as those considered above in the continuum, as it would require to handle position-dependent weights . A (admittedly very) simple example is nonetheless worked out below which shows that the two approaches give identical results in the scaling limit.
The main reason which validates our approach is based on the following argument. If it is true that the optimal path coincides with portions of the boundary in the continuum limit, it is not so on the lattice. The lattice formulation of the problem considers random lattice paths from to , and asks for the distribution over these paths, for large , equivalently the distribution of passage on any given vertical line. In regions where the constraint plays no role, we know from Section 3.2 that the distribution on paths has its support essentially contained in a window of size around the straight line (the average path). When the constraint is effective enough to force the lattice paths to deviate from their ballistic, straight line behaviour, one does not expect that the average path (or the likeliest, it does not make much difference) remains in a close neighbourhood of the constraint, because the entropy away from the constraint is higher: the paths which stay away from the constraint are more numerous. The entropic bias increases with the distance to the constraint and keeps the average path away from it. The average path will however not be pushed away from the constraint beyond a characteristic length scale of order since this is the scale of the fluctuations when there is no constraint. It is therefore natural to expect that the average path will be localized at a distance of order from the constraint. In this case, the derivation we made in Section 4 of the action associated with a macroscopic path remains valid if we partition the horizontal interval into small cells whose size is of order . Over this horizontal distance, the vertical displacement is of order , also proportional to . It then follows that the path trajectory is partitioned into windows of linear size , big enough to contain the main fluctuations, and small enough to stay clear of the constraint. Inside each window, the path statistics is the same as if there were no constraint. So on the lattice, the constraint has a repulsive effect that moves the windows away from itself, but the statistics inside the windows remains unchanged, suggesting that the Lagrangean function is the same whether or not there is a constraint (this will be confirmed in the explicit example below). In the continuum limit, namely after rescaling by a factor , the repulsion scales away and the restriction that the paths do not enter the forbidden region is then all what remains of the constraint.
These expectations are borne out by the results of numerical simulations, shown in Fig. 6. The simulations were carried out for the walks made of the two elementary steps and , each with weight 1 (the example 1 in Section 2), conditioned to start at and to end at the diagonal site . The impenetrable boundary is represented by the parabola-like curve. The left panel shows the average paths, in the rescaled domain, for four values of , whereas the right panel plots the distance between the average paths and the boundary , rescaled by a factor . In the region where the constraint is effective, one clearly sees that the average paths stay at all times at a distance of order from it.
Exact calculations are often possible when the constraint is piecewise linear. As a simple example, we consider the same walk as in the previous paragraph, with steps and each of weight 1. We condition the walk to start from the origin, to end at a distant site , with , and we constrain it not to visit sites under the diagonal line for , in such a way that the endpoint is not reachable along a single straight line. For , the walk is no longer constrained although, due to the nature of the available steps, the walk will stay above (or on) the horizontal line .
The lower boundary of the accessible domain is made of two linear segments, a diagonal one and a horizontal one. We want to examine the alternative approach alluded to above, namely we solve the combinatorics of the constrained random paths, compute the continuous path on which the measure concentrates in the scaling limit and check that wherever the optimal path coincides with the diagonal constraint, the random lattice paths stay at a distance from it.
Let us denote by the number of lattice paths from to , and by the number of those which do not go under the diagonal line. They are explicitly given by
[TABLE]
Indeed they both satisfy the recurrence and the appropriate boundary conditions. These expressions make it clear that for and fixed , is asymptotically equal to
[TABLE]
where the functions and , given in Section 2, pertain to the walk without the constraint. As anticipated, the Lagrangean function is the same in the constrained and unconstrained cases.
Let us first look at the passage probability distribution on the vertical line . Because vertical steps are allowed, we consider the probability that is the first visited site of the vertical line above the endpoint of the diagonal constraint, given that the path eventually reaches . This probability is proportional to the product of and . For large , we find that the distribution is given by
[TABLE]
where is the normalization factor. It follows that the average passage point remains at a distance above , and implies that in the scaling limit, the continuous path visits the point with probability 1. Since after that point there is no effective constraint, is a straight line from to the final point .
Let us now focus on the region above the diagonal constraint. As we have just seen that the passage distance to is of order , we can consider the paths from to not going under the constraint. For , we look for the probability that the first passage on the vertical line occurs at site . We obtain that this probability is equal to
[TABLE]
For large and such that is small, we obtain the following asymptotic value,
[TABLE]
It implies that the average value is of order above the diagonal constraint.
In this particular example, we can conclude that indeed the random lattice paths stay at a distance away from the constraint, with the exception of a small neighbourhood of the extremal point where the two linear segments of the boundary meet, which is also the only point where the boundary is not . We can however note that the entropy bias advocated above becomes negligible for such a small region: the gain of entropy is only substantial when the paths keep away from the constraint over a macroscopic length scale. We expect these characteristics to be generic.
The above combinatorial analysis shows that in the scaling limit, the constrained random lattice paths condense with probability 1 on the single continuous path going from the origin to the point by following the diagonal constraint, and then goes straight from to . This is also what the variational problem in the continuum yields. Despite the fact that this boundary is not , it is easy to see that the concavity argument implies that the optimal path in the (rescaled) domain is piecewise linear, and made of the two linear pieces mentioned above.
7 Area weighted paths
The tangent method is expected to be applicable in more general situations. A natural generalization of the previous discussion is a position-dependent weighting of the non-crossing paths for which the trajectory of a single path, without constraints, can differ from the straight line. In this case however, the assumption (i) of Section 5 is expected to hold true. In this section we generalize the proof of the conjecture (ii) for a particular position-dependent weighting discussed in the context of the tangent method in [17, 19].
7.1 Definition
As before, we consider directed lattice paths with steps and weights . In addition, we assign each path a weight related to its area and controlled by a parameter . For a path we define to be the (signed) area between and the horizontal axis . The total weight of is then the product of the weights of its elementary steps times . This is a mild case of position dependent weights as it corresponds to assign a weight to an elementary step taken at .
We denote by the weighted sum over all paths between and . The generating function approach used to compute the numbers can be extended to the -weighted case. The main idea is to promote the variables and used in (2.1) to non-commuting variables satisfying . Defining by setting , the previous relation then implies . For a given set of elementary steps and weights , we define
[TABLE]
Here is the part of the weight that is independent of the position . The generating function can then be written
[TABLE]
where denotes the operation by which all the ’s are moved to the left of the ’s by using the above commutation relations, after what and are substituted by and , respectively. This reordering of the and variables brings in a power of which, combined with the partial weights contained in , exactly produces the right total weight of a path , equal to \big{(}\prod_{i}w_{i})\,q^{A(p)}.
For an explicit example, let us reconsider the example 1 introduced in Section 2, namely with . In this simple case, and we find
[TABLE]
from which we deduce
[TABLE]
One readily checks that these numbers satisfy the recurrence relation and the appropriate boundary conditions.
7.2 Variational approach
For (resp. ), paths with an area close to the maximal (resp. minimal) value are more probable. However, in the scaling limit, trajectories with an extremal area are associated with less lattice paths and therefore have less entropy. Our first objective is to determine, in the continuum, the likeliest trajectory between two points, that is the trajectory which has the optimal balance between these two tendencies. We use the same notation as before for this curve, namely (omitting the explicit dependence in ), and call it a free trajectory (or a geodesic) between and say .
In the discrete setting, when the endpoint becomes large, the area of a generic path increases like , so that must be rescaled in the scaling limit. As we will see, a non-trivial limit is obtained if we define for a fixed ; in the following we also set .
The variational approach of Section 4 can be extended in a straightforward way. Consider again the collection of paths passing through the intermediate points for , where is, as before, chosen to be proportional to . If were equal to , their total weight would be equal to given in (4.1) and associated with the weights of the elementary steps. However the paths contributing to do not all have the same area since also depends on the intermediate passage points between the ’s. It turns out that the cumulated area of the rectangles of width and heights provides a Riemann sum that is suitable to compute , the corrections to it bringing a contribution depreciated by a relative factor . Thus the dominant contribution of all these paths can be written in the form that only depends on the ’s,
[TABLE]
In the scaling limit, the first factor converges to for the action (4.4) used earlier. The power of in the second term converges to and justifies the scaling announced above, as the only way to obtain a non-trivial limit for large : if the area term dominates, it leads to degenerate trajectories, and if it is dominated, we are back to the case. We obtain that the weight of a continuous trajectory has the following form,
[TABLE]
Note that one does not need to know explicitly, since is computed solely from and , as explained in Section 2. For Schröder-like paths, the action has been computed in [19] by using the explicit value of (a -deformed trinomial coefficient) and exactly matches the form above upon using the function given in (2.6) (and an appropriate change of coordinates).
The problem now is to compute the free trajectories, namely to maximize the functional (7.6) in the set of functions that are continuous and piecewise , with
[TABLE]
The strict concavity of ensures that the global maximum is unique. Indeed, let us suppose that and are two distinct global maxima. Then (or any convex combination of and ) satisfies (7.7) and is such that , which is a contradiction. Using once more the strict concavity of the Lagrangean function , the Weierstrass-Erdmann corner condition [24] implies that is in fact , and therefore satisfies the Euler-Lagrange equation over the entire domain,
[TABLE]
This equation can be explicitly solved. A first integration yields . Since , is a strictly monotonic function and is therefore invertible. In fact, from Section 2, with itself being invertible (), so we obtain
[TABLE]
where the second equality follows from the change of variable . The relation t\big{(}\log y(t)\big{)}^{\prime}=-\big{(}\log x(t)\big{)}^{\prime} obtained Section 2 allows to carry out the remaining integration,
[TABLE]
in terms of the functions and introduced in Section 2. The two integration constants are uniquely fixed by the boundary data, for instance the endpoints of the trajectory (7.7) (in which case ). On account of the relation \log x\big{(}y^{-1}(e^{-z})\big{)}=zL^{\prime-1}(z)-L\big{(}L^{\prime-1}(z)\big{)}, the solution can be expressed solely in terms of (and the inverse of its first derivative). Whatever the form we give it, the unique solution is in the variable , and also in and .
For concreteness, we give the explicit form of the general solution for the two specific examples considered in Section 2. Using the functions or given there, we find (the constants have been redefined to in a -dependent way)
[TABLE]
In each case, the geodesics appear as bent lines of constant concavity, as seen in Figure 5. Indeed the Euler-Lagrange equation implies that they are strictly convex for , strictly concave for .
Contrary to the case, the free trajectories are not universal and depend on the elementary steps of the underlying walk as well as their weights. However, the tangency at the contact point between a free trajectory and a constraint is universal, as will be shown below. It is worth mentioning that the previous argument for the uniqueness of is easily generalized for position dependent weights of the form when is a smooth (not necessarily strictly) concave function. However for a general , it is less clear how to construct a solution to the Euler-Lagrange equation, and one should rely on more abstract results to study the regularity of .
7.3 Tangency
We now turn to the case where the paths are constrained to stay inside a given domain . As before, we assume that the boundary is in the regions where there is a contact between the optimal path and the boundary itself. The boundary repulsion mechanism discussed in Section 6 is similar in the -weighted case, so we will use the corresponding action (7.6) in the constrained situation. Let us denote by the solution of this maximization problem, and first prove that is tangent to the boundary at the point of first contact , if there is one (see figure 5).
Let us denote by the free trajectory between the points and . The function is in and , as follows from (7.10). The uniqueness of the extremal solution (7.10) between two points and ensures that , the optimal trajectory in presence of constraints, can only consist of free trajectories and portions of the boundary. Indeed, if a part of is not a portion of the boundary, then it must make the action stationary and therefore be of the form (7.10).
If the endpoint is reachable from , then is the free trajectory between these two points. If it is not, starts from the origin and follows a free trajectory until it reaches the boundary at . Without loss of generality, we assume that the free trajectory hits the boundary from above. For convenience, we also view the boundary around as the graph of a function . It is clear that with
[TABLE]
Indeed, if the first contact point is at , it is not reachable by a free trajectory that starts from the origin, and if , one can optimize the path. It remains to show the tangency, namely that with .
If , there exists such that , which is impossible since is a valid trajectory (that is, it stays above the boundary). 2. 2.
Assume instead that . The property of the free trajectories ensures the existence of such that for all . Upon integration on , this implies for . In addition must also be above for because the two curves cannot cross (they would have two points in common, the crossing point and the origin, and would therefore coincide). Thus we obtain for with , contradicting the definition of in (7.13).
Hence we must have . The argument for the tangency of the point of first escape (if there is one) is similar. As before, the tangency at and implies the tangency of the subsequent contact points, which completes the proof.
We note that if the domain is simply-connected, the proof of the uniqueness of in presence of constraints generalizes. Indeed, let and be two distinct global maxima () of the constrained problem. Then lies between and for . Since the domain is simply-connected and since and belong to it, is in the domain as well. It is therefore a valid trajectory and satisfies , which contradicts the assumption.
Acknowledgments
We thank Gilles Parez for his careful reading of the manuscript. B.D. acknowledges the financial support of the Fonds Spéciaux de Recherche (FSR) of the Université catholique de Louvain. P.R. is a Senior Research Associate of FRS-FNRS (Belgian Fund for Scientific Research). This work was supported by the Fonds de la Recherche Scientifique - FNRS and the Fonds Wetenschappelijk Onderzoek -Vlaanderen (FWO) under EOS project no 30889451.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] H. Cohn, N. Elkies and J. Propp, Local statistics for random domino tilings of the Aztec diamond , Duke Math. J. 85 (1996) 117-166 ( ar Xiv:math/0008243 [math.CO] ).
- 2[2] W. Jockusch, J. Propp and P. Shor, Random domino tilings and the arctic circle theorem , ar Xiv:math/9801068 [math.CO].
- 3[3] H. Cohn, M. Larsen and J.Propp, The shape of a typical boxed plane partition , New York J. Math. 4 (1998) 137-165 ( ar Xiv:math/9801059 [math.CO] ).
- 4[4] F. Colomo and A.G. Pronko, The arctic curve of the domain-wall six-vertex model, J. Stat. Phys. 138 (2010) 662-700 ( ar Xiv:0907.1264 [math-ph] ).
- 5[5] F. Colomo, A.G. Pronko, P. Zinn-Justin, The arctic curve of the domain wall six-vertex model in its antiferroelectric regime , J. Stat. Mech. (2010) L 03002 ( ar Xiv:1001.2189 [math-ph] ).
- 6[6] F. Colomo and A.G. Pronko, The Limit Shape of Large Alternating Sign Matrices , SIAM J. Discrete Math. 24 (2010) 1558-1571 ( ar Xiv:0803.2697 [math-ph] ).
- 7[7] L. Petrov, Asymptotics of random lozenge tilings via Gelfand-Tsetlin schemes , Probab. Theory Relat. Fields 160 (2014) 429-87 ( ar Xiv:1202.3901 [math.PR] ).
- 8[8] E. Duse and A. Metcalfe, Asymptotic geometry of discrete interlaced patterns: Part I , Int. J. Math. 26 (2015) 1550093 ( ar Xiv:1412.6653 [math.PR] ).
