On the free path length distribution for linear motion in an n-dimensional box
Samuel Holmin, P\"ar Kurlberg, Daniel M{\aa}nsson

TL;DR
This paper analyzes the distribution of free path lengths for particles in an n-dimensional box, providing explicit formulas and methods to recover box dimensions from the distribution's features.
Contribution
It derives explicit, piecewise real analytic formulas for free path length distributions in 2D and 3D boxes, linking distribution features to box dimensions.
Findings
Explicit formulas for 2D and 3D free path length distributions.
Distribution features reveal the box's side lengths.
As R and N tend to infinity, distributions converge to geometric intersection models.
Abstract
We consider the distribution of free path lengths, or the distance between consecutive bounces of random particles, in an n-dimensional rectangular box. If each particle travels a distance R, then, as R tends to infinity the free path lengths coincides with the distribution of the length of the intersection of a random line with the box (for a natural ensemble of random lines) and we give an explicit formula (piecewise real analytic) for the probability density function in dimension two and three. In dimension two we also consider a closely related model where each particle is allowed to bounce N times, as N tends to infinity, and give an explicit (again piecewise real analytic) formula for its probability density function. Further, in both models we can recover the side lengths of the box from the location of the discontinuities of the probability density functions.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
On the free path length distribution for linear motion in
an -dimensional box
Samuel Holmin
Pär Kurlberg
Daniel Månsson
Abstract
We consider the distribution of free path lengths, or the distance between consecutive bounces of random particles, in an -dimensional rectangular box. If each particle travels a distance , then, as the free path lengths coincides with the distribution of the length of the intersection of a random line with the box (for a natural ensemble of random lines) and we give an explicit formula (piecewise real analytic) for the probability density function in dimension two and three.
In dimension two we also consider a closely related model where each particle is allowed to bounce times, as , and give an explicit (again piecewise real analytic) formula for its probability density function.
Further, in both models we can recover the side lengths of the box from the location of the discontinuities of the probability density functions.
1 Introduction
We consider billiard dynamics on a rectangular domain, i.e., point shaped “balls” moving with linear motion with specular reflections at the boundary, and similarly for rectangular box shaped domains in three dimensions. We wish to determine the distribution of free path lengths of ensembles of trajectories defined by selecting a starting point and direction at random.
The question seems quite natural and interesting on its own, but we mention that it originated from the study of electromagnetic fields in “reverberation chambers” under the assumption of highly directional antennas [9]. Briefly, the connection is as follows (we refer to the forthcoming paper [5] for more details): given an ideal highly directional antenna and a highly transient signal, then the wave pulse dynamics is essentially the same as a point shaped billiard ball traveling inside a chamber, with specular reflection at the boundary. Signal loss is dominated by (linear) “spreading” of the electromagnetic field and by absorption occurring at each interaction (“bounce”) with the walls. The first simple model we use in this paper neglects absorption effects, and models signal loss from spreading by simply terminating the motion of the ball after it has travelled a certain large distance. The second model only takes into account signal loss from absorption, and completely neglects spreading; here the motion is terminated after the ball has bounced a certain number of times.
We remark that the distribution of free path lengths is very well studied in the context of the Lorentz gas — here a point particle interacts with hard spherical obstacles, either placed randomly, or regularly on Euclidean lattices; recently quasicrystal configurations have also been studied (cf. [4, 2, 15, 7, 3, 11, 13, 16, 10].)
Let be large and let a rectangular -dimensional box be given, where . We send off a large number of particles, each with a random initial position chosen with respect to a given probability measure on , and each with a uniformly random initial direction v^{(i)}\in\mathbb{S}^{n-1}=\{x\in{\mathbb{R}}^{n}:\mathopen{}\mathclose{{}\left\|x}\right\|=1\}, , for a total distance each. Each particle travels along straight lines, changing direction precisely when it hits the boundary of the box, where it reflects specularly. We record the distance travelled between each pair of consecutive bounces for each particle. (Note in particular that we obtain more bounce lengths from some particles than from others.) Let be the uniformly distributed random variable on this finite set of bounce lengths of all the particles. More precisely, a random sample of is obtained as follows: first take a random i.i.d. sample of points (with respect to the measure ) , and a random sample of directions (with respect to the uniform measure). Each pair then defines a trajectory of length , and each such trajectory gives rise to a finite multiset of lengths between consecutive bounces. Finally, with denoting the (multiset) union of bounce length multisets , we select an element of with the uniform distribution. (That is, with denoting the integer valued set indicator function for , and we select the element with probability .)
We are interested in the distribution of for large and , and this turns out to be closely related to a model arising from integral geometry. Namely, let denote the unique (up to a constant) translation- and rotation-invariant measure on the set of directed lines in , and consider the restriction of this measure to the set of directed lines intersecting , normalized such that it becomes a probability measure. Denote by the random variable where is chosen at random using this measure.
Theorem 1**.**
For any dimension , and for any distribution on the starting points, the random variable converges in distribution to the random variable , as we take followed by taking , or vice versa.
The mean free path length has a quite simple geometric interpretation. We have
[TABLE]
where is the -dimensional surface area of the box , is the volume of the box , is the gamma function, and where is the -dimensional surface area of the sphere . The formula in (2) has been proven in a more general setting earlier (see e.g. formula (2.4) in [6]); for further details, see Section 1.1. For the convenience of the reader we give a short proof of formula (2) in our setting in Section 2.2.
Throughout the paper, we will write and for the probability density function and the cumulative distribution function of , respectively, for random variables . We next give explicit formulas for the probability density function of in dimensions two and three.
Theorem 3**.**
For a box of dimension with side-lengths , the probability density function of is given by
[TABLE]
for .
Remark 5**.**
We note that the probability density function in Theorem 3 is analytic on all open subintervals of not containing or Moreover, it is constant on the interval and has singularities of type and just to the right of and , respectively. See Figure 1 for more details. For an explanation of these singularities, see Remark 134.
Theorem 6**.**
For a box of dimension with side-lengths , the probability density function of is given by
[TABLE]
where is the piecewise-defined function given by
[TABLE]
for , and by
[TABLE]
for , and by
[TABLE]
for
Remark 14**.**
We note that the probability density function in Theorem 6 is analytic on all open subintervals of not containing any of the points
[TABLE]
Moreover, it is linear on the interval and has positive jump discontinuities at the points . At the points , it is continuous and differentiable.
Note that the probability distribution gives a larger “weight” to some particles than others, since some particles get more bounces than others for the same distance . One could also consider a similar problem where we send off each particle for a certain number of bounces, and then consider the limit as followed by taking the limit , where is the number of particles. This would give each particle the same “weight”. Denote the finite version of this distribution by and its limit distribution as and then by . With regard to the previous discussion about signal loss, we call the limit distribution of the spreading model and we call the limit distribution of the absorption model. Determining the probability density function of the absorption model appears to be the more difficult problem, and we give a formula only in dimension two:
Theorem 16**.**
For a box of dimension with side-lengths , the random variable converges in distribution to the random variable , as we take followed by taking , where the probability density function is given by
[TABLE]
for , and by
[TABLE]
for , and by
[TABLE]
for .
See Figure 3 for a comparison between the probability density functions for the two different models in dimension .
Remark 22**.**
It is not a priori obvious that the two limit distributions should differ, and it is natural to ask how much, if at all, they differ. We start by remarking that the expression for does not simplify into the expression for ; indeed, for we have but on the interval . For very skew boxes, with and , it is straightforward to show that
[TABLE]
as .
1.1 Discussion
Given a closed convex subset with nonempty interior it is possible to define a natural probability measure on the set of lines in that have nonempty intersection with . The expected length of the intersection of a random line is then, up to a constant that only depends on , given by ; this is known as Santalo’s formula in the integral geometry and geometric probability literature (cf. [14, Ch. 3]).
A billiard flow on a manifold with boundary gives rise to a billiard map (roughly speaking, the phase space is then the collection of inward facing unit vectors at each point ). Given we define the associated free path as the distance the billiard particle, starting at in the direction , covers before colliding with again. As the billiard map carries a natural probability measure we can view the free path as a random variable, and the mean free path is then just its expected value. Remarkably, the mean free path (again up to a constant that only depends on the dimension) is then given by — even for non-convex billiards. This was deduced in the seventies at the Moscow seminar on dynamical systems directed by Sinai and Alekseev but was never published and hence rederived by a number of researchers. For further details and an interesting historical survey, see Chernov’s paper [6, Sec. 2].
In spirit our methods are closely related to the ones used by Barra-Gaspard [1] in their study of the level spacing distribution for quantum graphs, and this turns out to be given by the distribution of return times to a hypersurface of section of a linear flow on a torus. In particular, for graphs with a finite number of disconnected bonds of incommensurable lengths, the hypersurface of section is the “walls” of the torus, and the level spacings of the quantum graph is exactly the same same as the free path length distribution in our setting when all particles have the same starting velocity. (In particular, compare the numerator in (37) for fixed with [1, Equation (49)].)
In [12], Marklof and Strömbergsson used the results by Barra-Gaspard to determine the gap distribution of the sequence of fractional parts of . The gap distribution depends on whether is trancendental, rational or algebraic; quite remarkably the density function for these gaps share a number of qualitative features with the density function for free paths in our setting. Namely, the density functions both have compact support and are smooth apart from a finite number of jump discontinuities. Further, in some cases the density function is constant for small; compare Figure 1 (here ) with [12, Figure 4] (here ). However, there are some important differences: for , left and right limits exist at the jump discontinuities, whereas for , the right limit of is at the jumps (cf. Figure 1.) Further, despite appearences, is not linear near (cf. [12, Figure 1] corresponding to ) whereas for , is indeed linear near (cf. Figure 2).
1.2 Acknowledgements
We would like to thank Z. Rudnick for some very helpful discussions, especially for suggesting the connection with integral geometry. We also thank J. Marklof for bringing references [1, 12] to our attention.
S.H. was partially supported by a grant from the Swedish Research Council (621-2011-5498). P.K. was partially supported by grants from the Göran Gustafsson Foundation for Research in Natural Sciences and Medicine, and the Swedish Research Council (621-2011-5498).
2 Proof of Theorem 1
In this section, we prove Theorem 1. For notational simplicity, we give the proof in dimension three; the general proof for dimensions is analogous.
Given a particle with initial position and initial direction , let be the number of bounce lengths we get from that particle as it has travelled a total distance , and let be the number of such bounce lengths of length at most . The uniform probability distribution on the set of bounce lengths of particles with initial positions and initial directions has the cumulative distribution function
[TABLE]
(Note that the denominator is uniformly bounded from below, which follows from equation (27) below.) By the strong law of large numbers, the function (24) converges almost surely to
[TABLE]
as , where is the probability measure with which we choose the starting points, and is the surface area measure on the sphere . By symmetry, we may restrict the inner integrals to . We now look at the limit of (25) as , and we note that since the integrands are uniformly bounded, we may move the limit inside the integrals by the Lebesgue dominated convergence theorem. Fix one of the integrands, and denote it by . We will show that its limit exists for all and all directions . Moreover, if and denote random variables corresponding to an initial position and an initial direction, respectively, as above, then
[TABLE]
is a random variable with finite variance (and similarly for the terms in the denominator of (24); in particular recall it is uniformly bounded from below), and thus the strong law of large numbers gives that the limit of (24) as , and then almost surely equals (25). This shows that exists almost surely and is equal to .
Consider a particle with initial position and initial direction . By “unfolding” its motion with specular reflections on the walls of the box to the motion along a straight line in — see Figure 4 for a 2D illustration — we see that the particle’s set of bounce lengths is identical to the set of path lengths between consecutive intersections of the straight line segment with any of the planes , . Thus we see that
[TABLE]
for large , and therefore
[TABLE]
as .
Now project the line to the torus where and let us identify the torus with the box ; see Figure 4. Each bounce length corresponds to a line segment which starts in one of the three planes , or and runs in the direction to one of the three planes or . There are line segments which start from the plane , and thus the probability that a line segment starts from the plane is
[TABLE]
as . By the ergodicity of the linear flow on tori (for almost all directions), the starting points of these line segments become uniformly distributed on the rectangle for almost all as ; from here we will assume that is such a direction, and we will ignore the measure zero set of directions for which we do not have ergodicity. Consider one of these line segments and denote its length by and its starting point by . For an arbitrary parameter , we have if and only if or or ; the starting points which satisfy this are precisely those outside the rectangle assuming that and otherwise it is the whole rectangle . The area of that region is
[TABLE]
if and otherwise it is . Since the starting points are uniformly distributed in the rectangle as , it follows that the probability that is
[TABLE]
where is the indicator function which is whenever the condition is true, and [math] otherwise. We get analogous expressions for the case when a line segment starts in the plane or instead. Thus the proportion of all line segments with length at most as is
[TABLE]
which can be written
[TABLE]
Recognizing that both integrands (28) and (35) are independent of the position , we see that the limit of (25) as may be written as
[TABLE]
for all . The corresponding formula in dimensions is given by
[TABLE]
for all , where the side-lengths of the box are and is the surface area measure on . (The denominator can be given explicitly by using Lemma 147 below.)
We have thus proved that the random variable converges in distribution to a random variable with probability density function given by (37) as we take followed by taking , or alternatively, first taking followed by taking . It remains to prove that this distribution agrees with the distribution of the random variable defined in the introduction.
2.1 Integral geometry
We start by recalling some standard facts from integral geometry (cf. [14, 8].) The set of directed straight lines in can be parametrized by pairs where is a unit vector pointing in the same direction as and is the unique point in which intersects the plane through the origin which is orthogonal to . The unique translation- and rotation-invariant measure (up to a constant) on the set of directed straight lines in is where is the surface measure on the plane through the origin orthogonal to , and is the surface area measure on .
Consider the set of directed straight lines in which intersect the box . Now, since is the area of the projection of the box onto the plane for , it follows that the total measure of with respect to is
[TABLE]
where we used symmetry, and the integral may be evaluated by switching to spherical coordinates. It follows that is a probability measure on the set of directed lines intersecting the box . Let be a random directed line with respect to this measure, and define the random variable , as in the introduction. Let us determine the probability that for an arbitrary parameter . By symmetry it suffices to consider only directed lines with . The set of all intersection points between the rectangle and the lines with and direction has area , as in (30), and its projection onto the plane has area
[TABLE]
By symmetry it follows that the area of the set of directed lines with and direction projected down to is
[TABLE]
and it follows that
[TABLE]
which we see is identical to (36), and we have thus proved that converges in distribution to as we take and then . This concludes the proof of Theorem 1.
2.2 Computing the mean value
We will determine the mean value (2) of ; to do this we exploit the integral geometry interpretation of the random variable . By symmetry it suffices to restrict to directed lines with . For fixed , denote by the set of such that the directed line parametrized by intersects . We note that is a volume element of the box for any fixed , and thus integrating over all yields the volume of the box. Hence the mean value is
[TABLE]
In dimensions we get a normalizing factor , so with the aid of the Lemma 147 in the Appendix, it follows that the mean value in dimensions is
[TABLE]
where is the -dimensional surface area of the box , and is the volume of the box .
3 Proof of Theorem 3
Using formula (37) in dimension , we get
[TABLE]
We use polar coordinates so that . Then the above becomes
[TABLE]
The numerator of the second term may be written
[TABLE]
which can be simplified to
[TABLE]
Inserting this into (48) and differentiating yields Theorem 3.
4 Proof of Theorem 6
We will evaluate the cumulative distribution function (36) and then differentiate. The denominator of the second term of (36) is
[TABLE]
as may be evaluated by switching to spherical coordinates. Define
[TABLE]
By symmetry, we have
[TABLE]
and thus we can write the numerator in the second term of (36) as
[TABLE]
Exploiting the symmetries, it suffices to evaluate and (note the order of the arguments to ). We will evaluate these integrals by switching to spherical coordinates, but first we need to parametrize the part of the sphere inside the box .
Lemma 61**.**
Fix . We have
[TABLE]
for any integrable function , where , where
[TABLE]
and where we have used the shorthand \mathopen{}\mathclose{{}\left\{u}\right\}_{1}\mathrel{\mathop{:}}=\min(u,1).
Proof.
We will parametrize the set of points on the sphere such that
[TABLE]
Switch to spherical coordinates . The non-negativity conditions of (71) are equivalent to the condition . For such angles, the condition is equivalent to
[TABLE]
and the conditions are equivalent to
[TABLE]
The interval (74) is non-empty for precisely those such that since
[TABLE]
Thus we may restrict to the interval given by the inequalities
[TABLE]
Note that we have for all since
[TABLE]
We conclude that we can write
[TABLE]
For , note that is defined precisely when \sin^{-1}\mathopen{}\mathclose{{}\left\{\frac{a}{t}}\right\}_{1}\leq\theta and that is defined precisely when \sin^{-1}\mathopen{}\mathclose{{}\left\{\frac{b}{t}}\right\}_{1}\leq\theta. We have if and only if , and we have if and only if . Moreover we note that we always have .
Let us rewrite the integration limits in the right-hand side of (80) in terms of and . A priori, we need to distinguish between the two cases and . If then we get
[TABLE]
If on the other hand then
[TABLE]
which we see is identical to (83). Combining (80) and (83) we get the conclusion of the lemma. ∎
Applying Lemma 61 we get
[TABLE]
An antiderivative of the integrand with respect to is , and thus the above is
[TABLE]
Next consider
[TABLE]
An antiderivative of the integrand with respect to is , and thus the above is
[TABLE]
We obtain and by switching the roles of in (98). We remark that trying to obtain and directly, by integrating and , respectively, by first integrating with respect to , taking the limits and , and then finding an antiderivative with respect to , seem to result in much more complicated expressions.
Finally consider
[TABLE]
An antiderivative of the integrand with respect to is , and thus the above is
[TABLE]
where the last integral inside the parentheses may be written as
[TABLE]
whenever , by using the fact that \frac{1}{2}\mathopen{}\mathclose{{}\left(x\sqrt{c-x^{2}}+c\tan^{-1}\mathopen{}\mathclose{{}\left(\frac{x}{\sqrt{c-x^{2}}}}\right)}\right) is an antiderivative of with respect to when is a constant. We obtain and by switching the roles of in (104).
It remains to insert the limits into the antiderivatives (91), (98) and (104) above. Noting that are expressed in terms of piecewise-defined functions, the following manipulations will be useful. For any function , we have
[TABLE]
where . Similarly,
[TABLE]
where , and
[TABLE]
and similarly, can be written as
[TABLE]
With this we can evaluate . But since we know that we will get a function symmetric with respect to the values , it suffices to keep only those terms with and , say, and then the other terms may be evaluated by just switching the order of . Upon inserting the limits and differentiating, one obtains (after tedious calculations) that
[TABLE]
where
[TABLE]
Rewriting as a piecewise function, we get Theorem (6).
5 Proof of Theorem 16
Consider the distribution of the random variable . Since we record the same number of bounces for each choice of angle we may replace the -particle system with a one particle system as follows: randomly select, with uniform distribution, the angle and generate bounce lengths and randomly select one of these bounce lengths (with uniform distribution); by the strong law of large numbers, converges in distribution to as .
We now determine the limit distribution of . As before, we first unfold the motion, and replace motion in a box with specular reflections on the walls with motion in ; see Figure 4. The path lengths between bounces is then the same as the lengths between the intersections with horizontal or vertical grid lines. To understand the spatial distribution, we project the dynamics to the torus where is the lattice
[TABLE]
and we may identify the torus with the rectangle .
Let us first consider the motion of a single particle with an arbitrary initial position, and direction of motion given by an angle . Taking symmetries into account, we may assume that . (Note that gives a probability measure on these angles.) If the particle travels a large distance , the number of intersections with horizontal, respectively vertical, grid lines is , respectively . Thus, in the limit , the probability of a line segment beginning at a horizontal (respectively vertical) grid line is given by , respectively (here we suppress the dependence on ) where
[TABLE]
The unfolded flow on the torus is ergodic for almost all , and thus the starting points of the line segments becomes uniformly distributed as for almost all .
Let
[TABLE]
Since , we obtain that
[TABLE]
Let denote the angle of the diagonal in the box, and assume that . We then observe the following regarding the line segment lengths.
First, if the segment begins at a horizontal line, it must end at a vertical line, and the possible lengths of these segment lie between [math] and . We find that these lengths are uniformly distributed in since the starting points of the segments are uniformly distributed.
On the other hand, if the line segment begins at a vertical line, it can either end at a vertical or horizontal line. Since the starting points are uniformly distributed, the former happens with probability
[TABLE]
and the length of the segment is again uniformly distributed in , whereas the latter happens with probability
[TABLE]
in which case the segment is always of length .
Now, implies that , and noting that
[TABLE]
we find that the probability of observing a line segment of length is the sum of a “singular part” (the segment begins and ends on vertical lines; note that all such segments have the same lengths) and a “smooth part” (the segment does not begin and end on vertical lines). Moreover, the smooth part contribution equals
[TABLE]
which, on inserting (119), equals
[TABLE]
On the other hand, the “singular part contribution”, provided , to the probability of a segment having length equals
[TABLE]
In case , a similar argument (we simple reverse the roles of and ) shows that the smooth contribution equals
[TABLE]
and that the singular contribution (if ) equals
[TABLE]
Thus, if we let denote the “singular contribution” to the probability density function we find the following: if , then
[TABLE]
if , then
[TABLE]
and if , then
[TABLE]
Remark 134**.**
Note that has a singularity of type just to the right of (and similarly just to the right of ). In a sense this singularity arises from the singularity in the change of variables since . The reason for the singularities in the spreading model for is similar, as the spreading model can be obtained from the absorption model by a smooth change of the angular measure.
Similarly, the “smooth part” of the contribution is (for ) given by
[TABLE]
Hence the probability density function of the distribution of the segment length is given by
[TABLE]
We will now evaluate . An antiderivative of with respect to for is
[TABLE]
where for \mathopen{}\mathclose{{}\left|z}\right|<1. (A quick calculation shows that whenever .) We can rewrite (137) as
[TABLE]
By l’Hôpital’s rule we have
[TABLE]
so the limit of (137) as is
[TABLE]
The limit of (137) as is
[TABLE]
Thus, assuming , we can write as
[TABLE]
if , or as
[TABLE]
if or as
[TABLE]
if . Adding to this, we get Theorem 16.
Appendix A Calculation of an integral
Lemma 147**.**
Write for the -dimensional surface area of the sphere . Then we have
[TABLE]
where is the part of the sphere with positive coordinates.
Proof.
We may parametrize with
[TABLE]
for . We have the spherical area element
[TABLE]
Thus we get
[TABLE]
Introducing an additional integration variable , we recognize the integrand as the spherical area element in dimensions, and thus the above is
[TABLE]
since . ∎
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] F. Barra and P. Gaspard. On the level spacing distribution in quantum graphs. J. Statist. Phys. , 101(1-2):283–319, 2000.
- 2[2] C. Boldrighini, L. A. Bunimovich, and Y. G. Sinaĭ. On the Boltzmann equation for the Lorentz gas. J. Statist. Phys. , 32(3):477–501, 1983.
- 3[3] J. Bourgain, F. Golse, and B. Wennberg. On the distribution of free path lengths for the periodic Lorentz gas. Comm. Math. Phys. , 190(3):491–508, 1998.
- 4[4] L. A. Bunimovich and Y. G. Sinaĭ. Statistical properties of Lorentz gas with periodic configuration of scatterers. Comm. Math. Phys. , 78(4):479–497, 1980/81.
- 5[5] M. Bäckström, S. Holmin, P. Kurlberg, D. Månsson. Randomized Ray Tracing for Modeling UWB Transients in a Reverberation Chamber. In preparation.
- 6[6] N. Chernov. Entropy, Lyapunov exponents, and mean free path for billiards. J. Statist. Phys. , 88(1-2):1–29, 1997.
- 7[7] F. Golse and B. Wennberg. On the distribution of free path lengths for the periodic Lorentz gas. II. M 2AN Math. Model. Numer. Anal. , 34(6):1151–1163, 2000.
- 8[8] D. A. Klain and G.-C. Rota. Introduction to geometric probability . Lezioni Lincee. [Lincei Lectures]. Cambridge University Press, Cambridge, 1997.
