When random walkers help solving intriguing integrals
S.N. Majumdar, E. Trizac

TL;DR
This paper demonstrates how viewing complex integrals through the lens of random walks reveals their properties, enables generalizations, and derives related identities without explicit calculations.
Contribution
It introduces a novel random walk framework to analyze and generalize intriguing integrals, providing new insights and identities in mathematical analysis.
Findings
Random walk perspective clarifies integral properties
Generalizations of integrals are systematically derived
Related identities are obtained without explicit computation
Abstract
We revisit a family of integrals that delude intuition, and that recently appeared in mathematical literature in connection with computer algebra package verification. We show that the remarkable properties displayed by these integrals become transparent when formulated in the language of random walks. In turn, the random walk view naturally leads to a plethora of nontrivial generalizations, that are worked out. Related complex identities are also derived, without the need of explicit calculation. The crux of our treatment lies in a causality argument where a message that travels at finite speed signals the existence of a boundary.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
When random walkers help solving intriguing integrals
Satya N. Majumdar and Emmanuel Trizac
LPTMS, CNRS, Univ. Paris-Sud, Université Paris-Saclay, 91405 Orsay, France
Abstract
We revisit a family of integrals that delude intuition, and that recently appeared in mathematical literature in connection with computer algebra package verification. We show that the remarkable properties displayed by these integrals become transparent when formulated in the language of random walks. In turn, the random walk view naturally leads to a plethora of nontrivial generalizations, that are worked out. Related complex identities are also derived, without the need of explicit calculation. The crux of our treatment lies in a causality argument where a message that travels at finite speed signals the existence of a boundary.
Introduction. While intuitions and experimentations are both crucial in mathematical works, inductive thinking may be spectacularly misguided in some cases. A celebrated illustration of the dangers of pattern extrapolation is provided by the question of circle division by chords Pizza : Consider points on the circumference of a circle and join every pair of points by a chord such that at any point inside the circle at most two chords can intersect. How many regions gets the circle divided into? By simple drawing, one sees that (by convention), , , , . At this point one may naively guess that for general , . Wrong ! It turns out that . Indeed, the correct answer is , which happens to coincide with the sequence up to , but starts differing from it for onwards !
Our interest goes here to a lesser known such problem, and the surprising behavior of integrals of the type
[TABLE]
where denotes the cardinal sine function rque1 ; rque2 . We do not dwell on the prevalence of sinc function in mathematics (geometry, spectral analysis…) and physics (signal processing, optics…), see e.g. GS . It was shown that , whereas , for all BoBo01 . In the latter situation, the difference is minute, less than for , which was first realized numerically, and attributed to a bug in the software BoBo01 . A related phenomenon was observed for the -family: for , but for all Schm14 . A theorem shown in BoBo01 rationalizes this matter of fact: it states that provided ,
[TABLE]
Without loss of generality, one can choose the coefficients to be positive real quantities, and can then be taken as the largest of them. Given that while , this explains the behavior of the -family, for which . The -family falls under the same argument rque10 . When the equality in (3) breaks, the explicit integrals could be computed. The corresponding values, related to the volume of hypercubes, cut by parallel hyperplanes, is immaterial for our purposes BoBo01 . Our goal is rather to provide a transparent understanding of the statement (3). To this end, we will show that the language of random walks, and physical intuition not only provide a natural framework to understand this change of behavior, but also leads to relevant and interesting generalizations, thereby offering an explicit and effort-free calculation of complex multidimensional integrals comment52 . At the heart of our approach lies a causality argument, formulated in terms of a message that signals the existence of a boundary.
Random walkers in a finite or infinite “world”. We start by considering a random walk making steps on a line, starting at , where is uniformly distributed in and the ’s are independent random variables. The probability density function (pdf) of each is thus a rectangle function, with simple characteristic function (Fourier-Transform) . The characteristic function of , sum of independent increments, thus reads
[TABLE]
which allows us to write its pdf as the inverse Fourier transform
[TABLE]
The integral under scrutiny is thus isomorphic to , i.e., the probability density of the random walk to be back at the origin, while starting from the origin ().
To proceed further, it is useful to reinterpret as follows. Consider a large (infinite actually) number of independent random walkers, all starting at . Then is just the fraction of walkers at after steps, i.e., the density of this gas of independent particles at the origin after steps. After step 1, their density is uniform in so that (incidentally meaning that ). A second step is then made, with amplitude . Because the jump is finite and , it is clear that all the walkers that were, after step 1, in the range will not leave their step-1 domain following the second step. Only walkers near the two edges, e.g., those in the range or may leave the step-1 domain after the second jump. Hence, the walkers in do not ‘see’ the edges of the step-1 domain–for them, it is as if the system was infinite with uniform density . In such an “infinite world”, the gain and loss contribution balancing those walkers leaving the origin and those reaching it after the second step do cancel: . This is illustrated in Fig. 1 where one can appreciate that the flatness of the density near the origin is preserved, although in a range that diminishes with the number of steps performed. The argument does not depend on the left/right symmetry of the random steps rque65 . In other words, we invoke causality and the boundedness of the steps to state that the only possibility for to be affected by a new step is when walkers having started from the edges at do reach the origin. Those ‘messengers’ carry the information that the “world” is not infinite, which in turn impinges on . If , the distance traveled by the messengers is not sufficient to reach the origin, and is -independent. We therefore recover statement (3) rque66 . Besides, while our random walk argument directly applies to integrals, it also is relevant for the -type, as explained in the supplemental material suppl . It is nevertheless necessary here to supplement the analysis with a new property, the left-right symmetry of the random steps. The key feature becomes the preservation of the edge density under performing random steps, while it pertained to the preservation of when treating integrals suppl .
The random walk reformulation provides us with an immediate generalization. Consider, for instance, the case where the first step is of Pearson’s type Pearson (i.e. of a fixed amplitude , ending at ), while the subsequent steps are again uniform as before with for , then for . In this case, the characteristic function after steps is given by: . Then our causality argument tells us that for . Evidently, the origin remains void of walkers, until the messengers arrive at . Provided that , this implies rque50 :
[TABLE]
This identity can be recovered by invoking a different random walk sharing with the previous one steps , but not the first step suppl .
One dimensional generalizations. A natural extension of the above results consists in considering that all steps except the first are arbitrary, but of finite range. The corresponding pdfs are therefore of finite support. The argument now involves the associated characteristic functions, that we denote . These are defined as the Fourier transforms of pdfs that have a finite support, taken for convenience to be unity. Provided , one can write:
[TABLE]
Under the same condition and taking once more advantage of causality, we obtain rque50
[TABLE]
where again all amplitudes are considered positive. Some particular cases have been addressed in earlier studies BoBo01 ; AlGu14 , but we stress that many more are subsumed under Eqs. (8) and (9). The task amounts to establishing a catalogue of eligible . It is not our purpose here, and we simply mention some emblematic such functions: the Bessel functions , and more generally for , , and a number of hyper-geometric functions. To generate candidates, advantage can be taken from the study of hyper-uniform systems, that feature potentials of bounded Fourier Transform, see e.g. Torquato .
Beyond dimension one. A second natural extention of previous considerations consist in considering dimensional random walks, with . A straightforward calculation shows that the counterparts of the one dimensional and functions are given as follows. For a one-step walk with jump , chosen respectively (i) uniformly within a -dimensional sphere of radius and (ii) uniformly on the surface of the same sphere (Pearson’s type jump), the associated characteristic functions of the jumps are
[TABLE]
where and are respectively the volume and surface of a -dimensional unit sphere. For , using and , one recovers respectively and . A new catalogue of functions can then be established, such that their -dimensional Fourier Transform is of bounded support (several interesting candidates can also be found in Torquato ; a rather generic one being the hyper-geometric function , where is some arbitrary parameter). Knowing the eligible building blocks , one can write upon setting
[TABLE]
provided .
A nontrivial identity follows from Eq. (11) by considering a special case. Choose the -th step uniformly from a -dimensional sphere of radius (for ). Consequently, using in Eq. (11), we get upon setting the following identity
[TABLE]
provided rque101 . Not surprisingly, with a Pearson first step that depopulates the origin and for :
[TABLE]
Mixing dimensions, we note that the steps can be of any type provided the associated Fourier Transform is bounded: in (11) and (13), the building blocks can be some , borrowed from a lower dimensional catalogue with . In doing so, we generate a wealth of complex integrals. Some of the simplest are known Grads ; Wats , for instance
[TABLE]
for , which follows from (13) with , and that appears under section 6.711.2 in Grads , when rque20 . Yet, infinitely many other identities that are subsumed in (11) or (13) are complex and seemingly unknown.
It is worth stressing that in some cases the random walk reformulation may not immediately lead to an explicit result, however it may nevertheless offer a direct means of calculation. As an example, we find the following nontrivial identity
[TABLE]
with . The above result holds for (and say , ); otherwise, the integral vanishes. The proof is provided in the supplemental material suppl . We outline here the main steps. Consider a -d random walk, starting at the origin and making successive steps: a first jump along -direction, a second jump along -direction, then a Pearson jump on the circle of radius with a final fourth step distributed as the third, but with a radius and say , see Fig. 2. Then the lhs of Eq. (15), using the results from line (ii) of Eq. (10) ( for the first two steps and for the last two steps), is precisely where denotes the density at the origin after steps. This density can, in turn be computed by elementary means, see the Suppl. Mat. suppl , leading to the rhs of Eq. (15). If , the random walk which is at a distance from the origin after step 2 cannot be back to be origin after step four, exploring an annulus with inner radius and outer radius (see Fig. 2). Equally tractable is the case where the fourth step is not Pearson but uniform within the disc of radius (for the 4-th step we use (i) of Eq. (10) with ), leading to
[TABLE]
The lhs can be made more complex without sacrificing the possibility of an explicit calculation of .
When sums and integrals coincide. We now turn to a distinct problem, that bears a similarity with the previous ones, after suitable reformulation. There was some interest recently in identities of the form BaBB08 ; DoGL12
[TABLE]
which hold provided rque30 . The latter condition can be compared to that applying to (3), , that can be rewritten as . The difference between the two criteria, where the same quantity if bounded either by or by , indicates that the identity (17) cannot be reduced to any of the previous arguments. Yet, the random walk reformulation also is insightful to show, and understand, relation (17). The idea is to compare two population of random walkers, one on the infinite line (case F, for “flat”), and the other on the unit circle (case C). Both population, starting from the origin, undergo the same random jumps. Provided that the front-runners (the random walkers having travelled the greater distance from the origin) did not travel round the circle in case C, moving on a flat line or on a finite circle is immaterial. The corresponding condition reads . When this inequality is fulfilled, the probability density of the walkers at the origin is thus the same in cases C and F. Expressing the pdf as either a Fourier series for case C or a Fourier transform for case F, we then get
[TABLE]
which includes (17) and many other cognate relations Stormer . As above, the refer to arbitrary functions, the Fourier transform of which are bounded with unit support. Loosely speaking, Eq. (18) can thus be viewed as the “flat world equation”.
Conclusion and discussion. We have proposed a random walk interpretation of a curious phenomenon, exhibited by integrals of type (1)-(2). The underlying physical image is that of an ensemble of random walkers starting from the origin, and performing a first step so as to populate uniformly the interval . The walkers then undergo a series of smaller steps with respective amplitudes . If the maximal span of these steps cannot lead walkers from the edge (i.e. at ) back to the origin , then the walkers near have a fixed density (given by the lhs in Eq. (3)), specified by the first step and thus equal to . In pictorial terms, the walkers near the origin cannot know they live in a finite world, unless the messengers starting from the confines at reach them. This may never happen if , in which case an equality like (3) will hold at all orders rque60 . It is interesting here to note that the model of random walks with shrinking steps directly applies to physico-chemical problems such as line broadening for single molecule spectroscopy in disordered media Barkai ; PaulK . The random walk reformulation naturally leads to non-trivial extensions, since it is irrelevant that the steps etc. be uniformly distributed, provided the first one (labeled ) has the desired property (uniform to lead to (3) or Pearson to lead to (7)) and that the subsequent steps ( …) are bounded. Generalizations in higher dimensions appear of particular interest, and provide calculation-free results that would otherwise require considerable effort and ingenuity.
While the mathematical problem at stake deceives intuition, we have shown that physical arguments may take over. Physics’ insight takes the form of a causality rule, where a message travels from a boundary. Applied to random walkers undergoing jumps of bounded amplitude, this yields a clear account for the change of behavior of a class of multidimensional integrals (of the types (11) and (13)). The random walk picture also allows for simple calculation of complex multidimensional integrals (such as in (15)) and (16)). Besides, related probabilistic ideas can be generalized to compute other classes of integrals that are otherwise hard to obtain. Assume for instance that a 1d walker starts with a Pearson jump of amplitude , so that . The boundedness of subsequent steps and the causality rule mean that remain at , as long as , i.e. as long as the message starting from does not hit . This implies
[TABLE]
after a straightforward re-expression of presented in suppl , section III. It thus appears that the curious phenomenon at work for Borwein integrals is much more general and applies to a much broader class of complex integrals and discrete sums.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1(1) J.H. Conway, R.K. Guy, “How Many Regions?” In The Book of Numbers , New York: Springer-Verlag, p. 76-79 (1996).
- 2(2) we consider throughout that sinc ( 0 ) = 1 sinc 0 1 \hbox{sinc}(0)=1 . Its remarkable properties led E.T. Whittaker to specify sinc as “ a function of royal blood in the family of entire functions, whose distinguished properties separate it from its bourgeois brethren ”. The denomination “cardinal” seems to originate from this author (E.T. Whittaker, Proc. Roy. Soc. Edinburgh 35 , 181 (1915)).
- 3(3) For instance, these integrals naturally appear in the study of random harmonic series, see e.g. Schm 03 .
- 4(4) W. B. Gearhart and H. S. Schultz, The College Mathematics Journal 2 , 90 (1990).
- 5(5) D. Borwein and J. Borwein, The Ramanujan Journal 5 , 73 (2001); D. Borwein, J.M. Borwein and A. Straub, American Mathematical Monthly 119 , 535 (2012).
- 6(6) H. Schmid, Elem. Math. 69 , 11 (2014).
- 7(7) Noticing that cos ( k ) sinc ( k ) = sinc ( 2 k ) 𝑘 sinc 𝑘 sinc 2 𝑘 \cos(k)\,\hbox{sinc}(k)=\hbox{sinc}(2k) , one should here consider a 1 = 2 subscript 𝑎 1 2 a_{1}=2 instead of a 1 = 1 subscript 𝑎 1 1 a_{1}=1 . Then, N = 56 𝑁 56 N=56 is the threshold such that ∑ n = 2 56 1 / ( 2 n − 1 ) < 2 superscript subscript 𝑛 2 56 1 2 𝑛 1 2 \sum_{n=2}^{56}1/(2n-1)<2 while ∑ n = 2 57 1 / ( 2 n − 1 ) > 2 superscript subscript 𝑛 2 57 1 2 𝑛 1 2 \sum_{n=2}^{57}1/(2n-1)>2 .
- 8(8) Our approach shares with previous ones Bo Bo 01 ; Schm 14 ; Al Gu 14 the explicit usage of Fourier transforms. Yet, it differs in spirit in its probabilistic perspective.
