On the uniqueness of trapezoidal four-body central configurations
Manuele Santoprete

TL;DR
This paper proves that for the Newtonian four-body problem, there is at most one trapezoidal central configuration for each cyclic order of the masses, using a topological approach.
Contribution
It establishes the uniqueness of trapezoidal four-body central configurations for each mass ordering, a new result in celestial mechanics.
Findings
Proves at most one trapezoidal configuration per mass order
Uses topological methods to establish uniqueness
Contributes to understanding of four-body central configurations
Abstract
We study central configurations of the Newtonian four-body problem that form a trapezoid. Using a topological argument we prove that there is at most one trapezoidal central configuration for each cyclic ordering of the masses.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
On the uniqueness of trapezoidal four body central configurations
Manuele Santoprete Department of Mathematics, Wilfrid Laurier University E-mail: [email protected]
Abstract
We study central configurations of the Newtonian four-body problem that form a trapezoid. Using a topological argument we prove that there is at most one trapezoidal central configuration for each cyclic ordering of the masses.
1 Introduction
A central configuration (c.c.) of the Newtonian -body problem is a special arrangement of point masses with the property that the gravitational acceleration vector produced on each mass by all the others points toward the center of mass and is proportional to the distance to the center of mass.
The central configurations of the three body problem have been known for a long time. In the three-body problem, up to symmetry, there are exactly five relative equilibria, they are the Eulerian (or collinear) configurations discovered by Euler in 1767, and the Lagrangean configurations discovered by Lagrange in 1772. In the Eulerian configuration all the masses belong to the same line, while in the Lagrangean configurations the masses form an equilateral triangle.
Collinear configurations are also well understood. Moulton [24] provided an exact count of the number of collinear configurations of bodies: modulo symmetries, there are central configurations. Also well understood are the -dimensional configurations of masses. In this case there is a unique central configuration: the regular simplex. For instance, for four masses the only three-dimensional central configuration is the regular tetrahedron.
If all the masses are equal we have a complete classification of central configuration for and . For the classification is due to Albouy [2, 3]. In this case the only noncollinear planar central configurations are the square, the equilateral triangle with a mass in the baricenter and an isosceles triangle with a mass on the line of symmetry. For and the classification is given using a computer assisted proof [22]. See also [19] were a complete classification of the isolated central configurations of the 5-body problem was given (note, however, that the approach used in this paper has a numerical component).
As soon as we go to the planar four-body problem, however, there is sufficient complexity to prevent a complete classification of noncollinear central configurations. For general masses we know that there is a finite number of central configurations of four bodies [15], but we don’t even have an exact count of the number of c.c.’s. In a recent paper [10], however, Corbera, Cors and Roberts provided a description of the set of convex central configurations and gave a clear picture of how the special subcases (i.e. trapezoidal, co-circular and kite-shaped, and equidiagonal central configurations) are situated within the broader set. Even less is known for the five-body problem where the finiteness of the number of central configurations was proven for arbitrary positive masses, except for a given codimension 2 subvariety of the mass space [7].
There are several reasons why c.c.’s play an important role in celestial mechanics. Central configurations lead to the only explicit solutions of the -body problem. For instance, if all masses are released from a central configuration with zero initial velocity they accelerate in such a way that the configuration collapses homotethically. The result is a solution in which all the masses collide together after a finite time.
Furthermore, a planar central configuration gives rise to a family of periodic solutions. Given the appropriate initial conditions each particle will follow an elliptical orbit as in the Kepler problem. In this motion the configuration remains similar to the initial configurations, varying only in size. For instance, Eulerian configurations generate a periodic solution where each of the masses follows an elliptical orbit and the masses always lie on a common line, see Figure 1. Similarly, if at the initial moment the masses form an equilateral triangle and if suitable velocities are chosen, then the masses will move periodically on ellipses, as in Figure 2.
Central configurations also play an important role in the study of the topology of the integral manifolds of the -body problem. An integral manifold is a subset of phase space obtained by fixing the values of the integrals of motion of the body problem (e.g. energy, angular momentum). Smale [30, 31] showed that central configurations are associated with changes in the topology of the integral manifolds. Since the integral manifolds have the property that if then the orbit through is contained in , Smale suggested that the topological type of can provide a crude, but important, invariant of the orbits [30, 31] of the planar -body problem. Therefore, an understanding of the central configurations gives information on the topology of the integral manifolds which in turn gives rough information on the orbits of the system. The situation for the spatial -body problem is more complicated and was addressed by Albouy [1].
In this work we will concentrate on the convex planar central configurations of four bodies. A planar configuration is convex if no body lies inside or on the convex hull of the other three bodies; otherwise, it is called concave. MacMillan and Bartky showed that for any four masses and any ordering of the bodies, there exists at least one convex central configuration [21], Xia [33] provided a simpler proof. It is an open question as to whether this solution must be unique. Yoccoz [34] conjectured that this solution is unique
Conjecture 1** (Simó-Yoccoz).**
There is a unique convex planar central configuration of the 4-body problem for each ordering of the masses in the boundary of its convex hull.
This conjecture likely arose from discussions between Yoccoz and Simó, and hence it seems appropriate to call it the Simó-Yoccoz conjecture. This problem is also included on a published list of open questions in celestial mechanics [4], and has often been attributed to Albouy and Fu [5]. The conjecture is known to have a positive answer in the cases all the masses are equal [2, 3], in the case some of masses are equal [20, 26, 6, 13], and in the case three of the masses are small [8]. Some related results were also obtained for point vortices in the case some of the vorticities are equal [16, 27], where it is possible to give a complete classification. Recently, the conjecture was verified also in the case of the co-circular four body problem [29]. In this paper we will show that this conjecture is also true for the trapezoidal four-body problem, which considers the case where the masses form a trapezoid. The central configurations of the trapezoidal four-body problem were studied in detail in [28, 9]. The uniqueness of trapezoidal central configurations was recently proved for the particular case of two pairs of equal masses in the case of power-law potentials [14]. The main goal of this paper is to prove the following theorem:
Theorem 1**.**
There is at most one trapezoidal central configuration of four bodies for each cyclic ordering of the masses.
The method of proof is similar to the one employed for the co-circular four body problem. The main idea of the proof is to use mutual distances as coordinates and replace the Cayley-Menger determinant condition used by Dziobek [12] with a simpler condition which comes from the geometry of trapezoids [17, 28]. It is then possible to show that the critical points of the potential restricted to a certain subvariety are all minimum points. Knowing the Euler characteristic of the variety one can then use Morse theory to prove the theorem.
The paper is organized as follows. In Section 2 we introduce the n-body problem and define central configurations. In Section 3 we write four-body central configurations in terms of mutual distances between the bodies. In Section 4 we define trapezoidal configurations and find their equations following the approach of [28]. In particular we view such configurations as critical points of the potential restricted to a certain space that we call . In Section 5 we prove Theorem 1 using Morse theory. This is done in several steps. In Proposition 3 we show that all the critical points are nondegenerate local minima. In Lemma 5 we obtain the Euler characteristic of the space . In Lemma 6 we use Morse theory and the Euler characteristic of to prove that the potential restricted to has a unique critical point. We then use this last result to prove Theorem 1.
2 Central Configurations of the -body problem
The Newtonian -body problem concerns the motion of point particles of masses and positions , where . Let , let be the Euclidean distance between the masses and , and let be the vector of mutual distances. The equations of motion are given by
[TABLE]
where is the Newtonian potential
[TABLE]
which we denote by when viewed as a function of the mutual distances . Without any loss of generality we can assume that the center of mass of the particles is at the origin: . Denote by the moment of inertia as a function of
[TABLE]
and by the moment of inertia as a function of the distances.
A central configuration of the -body problem is a configuration which satisfies the algebraic equation
[TABLE]
where is a Lagrange multiplier. Hence, a central configuration is simply a critical point of subject to the constraint .
The central configuration equation (1) is invariant under rotations, reflections and dilations. It is standard to say that two configurations and are *equivalent * if can be transformed to by a rotation and a dilation. As a consequence, by convention, central configurations are usually counted up to rotations and dilations. This convention is also used in the statement of Conjecture 1 and of Theorem 1.
We define the dimension of a configuration , denoted , to be the dimension of the subspace spanned by the vectors . Then, we say that is a Dziobek configurations if [23].
In the four-body problem is a Dziobek central configuration if it is a central configuration with , that is, in this case, the set of Dziobek configurations coincide with the set of planar, non-collinear, central configurations.
3 Central Configurations in terms of distances
For four bodies it is convenient to recast the equations defining Dziobek central configuration, so that the variables are the distances between the particles rather than their coordinates. Since the mutual distances determine the configuration up to rotation and reflection symmetry, this choice not only reduces the number of variables but also removes the rotational and reflectional degeneracy. The dilational degeneracy can then be eliminated by fixing the size of the configuration with the restriction .
Let be a vector of non-negative mutual distances, and let the Cayley–Menger determinant of four points be
[TABLE]
where is the volume of the configuration. It is important to note that not all vectors realize actual configurations of four bodies in . Therefore, we typically want to restrict our attention to configurations that can be realized in . For this purpose we consider the sets
[TABLE]
and
[TABLE]
We say that a vector of mutual distances is geometrically realizable if and that is a normalized Dziobek configuration if .
Thus we have the following characterization of planar four body central configurations given by Dziobek:
Proposition 1**.**
*Let be a Dziobek configuration, let be its corresponding normalized Dziobek configuration, and let be the restriction of the Newtonian potential to . Then, is a Dziobek central configuration if and only if is a critical point of . *
Since equations (1) are invariant under rotations, dilations and reflections in the plane, we can consider two relative equilibria as equivalent if they are related by these symmetry operations. This defines an equivalence relation , different from the more standard one introduced in section 2. Let be the set of equivalence classes with respect to , then the set of equivalence classes is in a one-to-one correspondence with the set of critical points of the function .
To find the equation for the critical points of we need to write the gradient of restricted to . The following formula due to Dziobek [12]
[TABLE]
is particularly useful for this purpose. Here, denotes the signed area of the triangle whose vertices contain all bodies except for the -th body. This formula is valid when restricting to planar configurations. A generalization of this formula that also works for non planar configurations uses oriented areas and can be found in [18].
4 Trapezoidal Configurations
In this section we study trapezoidal central configurations. Since we use mutual distances as coordinates, we cannot distinguish between bodies ordered counterclockwise and bodies ordered clockwise. Hence, we introduce the following terminology: we say that the bodies are *ordered sequentially * if they are numbered consecutively while traversing the boundary of the quadrilateral in any direction.
Without loss of generality, we may assume that any trapezoid is ordered sequentially so that and are the lengths of the diagonals. This is justified because we can always relabel the bodies so that they are ordered sequentially. Denote
[TABLE]
Let be the set of geometrically realizable satisfying , that is
[TABLE]
Moreover, we define and as follows:
[TABLE]
and
[TABLE]
Let us denote by and by the sets and in the case . We can also define the set
[TABLE]
which will play an important role, in this paper.
There is an interesting relationship between the conditions and , which is outlined in the following lemma
Lemma 1**.**
If , then . In other words on the set of geometrically realizable vectors for which the configuration of four bodies is coplanar.
Proof.
A computation shows that
[TABLE]
where
[TABLE]
and
[TABLE]
Note that equation (3) is the analogue of equation (12) in [25] for cyclic quadrilaterals. If we have
[TABLE]
Since implies that , it follows that we must have , which concludes the proof. ∎
Since trapezoidal central configurations are Dziobek configuration we can give the following definition
Definition 1**.**
The configuration vector is a sequentially ordered trapezoidal central configuration if and only if its corresponding distance vector belongs to and it is a critical point of with respect to .
In terms of Lagrange multipliers this means that is a sequentially ordered trapezoidal four body central configuration if and only if it is a critical point of the function
[TABLE]
satisfying , and , where , and are Lagrange multipliers. The following lemma shows that and are parallel on the set of geometrically realizable configurations with . See [28] for a different proof. A similar result was obtained by Cors and Roberts for the co-circular four body problem [11].
Lemma 2**.**
For any
[TABLE]
where , with the height of the trapezoid. In other words, on the set of geometrically realizable vectors for which vanish, the gradients of and are parallel.
Proof.
Since , we have that
[TABLE]
Since , then . It follows that as well. Hence, .
We now want to show that, in this case, has a meaningful geometric interpretation and can be written in terms of the height of the trapezoid. For a convex quadrilateral ordered sequentially we can choose the signed areas so that and . In a trapezoid these signed areas are
[TABLE]
where is the height of the trapezoid, namely the distance between the opposite parallel sides. From (2) we get
[TABLE]
and hence, at a trapezoidal central configuration, we have
[TABLE]
On the other hand, the gradient of at a trapezoidal configuration is
[TABLE]
from which it follows that . ∎
Remark**.**
In the previous lemma we showed that . Note that this equality is not trivial. In fact, solving for we find the following formula for the height of a trapezoid as a function of the mutual distances:
[TABLE]
This formula is different from the well known one given in [17, 32, 28], and has the advantage of working even when the bases of the trapezoid have the same length.
We now have the following characterization of trapezoidal configurations [28]:
Proposition 2**.**
Let . Then is a critical point of , the restriction of to , if and only if is a critical point of the function . Therefore the vector is a sequentially ordered trapezoidal four-body c.c. if and only if the corresponding distance vector is a critical point of the Lagrangian function
[TABLE]
satisfying , and , where and are Lagrange multipliers.
Proof.
Recall that is the orthogonal projection of onto the tangent space , which is given by
[TABLE]
Similarly, is the orthogonal projection of onto the tangent space , which is given by
[TABLE]
Since , by Lemma 2, . It follows that, if , then , and hence for any . Then if and only if , that is, is a critical point of if and only if is a critical point of the function . ∎
By Proposition 2, we can find the critical points of that lie in by finding the critical points of which lie in . The equations of the critical points of , are given by , the gradient of the Lagrangian . Explicitly, we have
[TABLE]
Note that these equations hold for , and not just for . When , the solutions of these equations give trapezoidal central configurations.
The equations have been grouped in pairs so that when they are multiplied together the product of the right-hand sides is . Consequently, from equations (5),(6) and (7) we obtain three equations for :
[TABLE]
5 Uniqueness of Trapezoidal configurations
In this section we prove Theorem 1. The strategy of the proof is as follows.
We first show that if is a critical point of , then it is necessarily a nondegenerate local minimum. This is proved in Proposition 3. Lemma 3 is a technical lemma required to prove Proposition 3.
We then study the topology of and . In Lemma 4 we show that . In Lemma 5 we show that the Euler characteristic of is .
Finally we use Morse theory to prove that the function has a unique critical point on . This is done in Lemma 6. The proof of the theorem follows immediately.
We start with the following technical lemma which is needed in the proof of Proposition 3.
Lemma 3**.**
If is a critical point of then .
Proof.
Suppose, for the sake of contradiction, that . By the first of the two equation in (8) we find that
[TABLE]
and hence , since in . By the first of the two equation in (9) we find that
[TABLE]
and hence , which contradicts the fact that . It follows that .
∎
Note that the second derivative of of with respect to the variable is the matrix
[TABLE]
This second derivative, with appropriate choices of and is the second derivative of , at the critical points. We can now prove the following proposition
Proposition 3**.**
If is a critical point of then is a nondegenerate minimum point for .
Proof.
The second derivative of is the matrix
[TABLE]
where , denotes a diagonal matrix, and denotes the anti-diagonal matrix whose entries on the anti-diagonal are . As we observed earlier, coincides with evaluated at the critical point .
Let be the principal minor of order of . We first prove that if satisfies equations (5-7), then for .
Let
[TABLE]
Since , eliminating using equations (5-7) yields
[TABLE]
Furthermore, eliminating from using (8) gives
[TABLE]
Since by Lemma 3, it is easy to see that all the principal minors are positive:
[TABLE]
It follows that is positive definite, and is a nondegenerate local minimum of . ∎
Remark**.**
Note that using the condition instead of in this problem does not make a big difference when computing the gradient, but it leads to much simpler computations when computing the second derivative. This can be seen from the following computation. Recall that if , then is an matrix whose entries are the partial derivatives of at , while is a matrix whose entries are the partial derivatives of at . We compute the Hessian of by computing the derivative of equation (4) and we obtain
[TABLE]
where the dot represents matrix multiplication. Since at a trapezoidal c.c. we have that and , it follows that
[TABLE]
which is much more complicated than .
We now turn to study the topology of and .
Lemma 4**.**
.
Proof.
Since in this case , the equation for the moment of inertia, reduces to
[TABLE]
which defines a sphere. Adding to this equation gives
[TABLE]
subtracting from it gives
[TABLE]
which shows that the manifold is diffeomorphic to , provided that .
∎
We can now better understand the topology of .
Lemma 5**.**
*The Euler characteristic of is . *
Proof.
Suppose , and consider the change of variables
[TABLE]
equations (12) and (13) can be rewritten in the form
[TABLE]
Clearly the set is homeomorphic to , the subset of defined by the following inequalities
[TABLE]
The inequalities for and can be expressed more compactly as , which clearly implies . The inequalities select a spherical triangle corresponding to one octant of the sphere . Such spherical triangle is homeomorphic to a closed disk, and can be represented with coordinates in the set . Corresponding to each point there is a region on the sphere defined by the inequalities , and . Clearly we have
[TABLE]
The region is homeomorphic to a region on the plane defined by the inequalities
[TABLE]
If , then and the region is an arc of the unit circle. If , then is a quarter unit disk. In all other cases is a quarter of an annular ring, see 3.
It follows that is always contractible, and so is .
The restriction of the projection , induces a fibration with base space and fibers given by . Hence, the projection is a fibration with contractible fibers. Since is also contractible, it follows that is contractible, and hence when .
Consider the rays having the origin as a initial point. Each of these rays intersect , the region of the sphere defined by equation (11) satisfying the inequalities , in exactly one point. Each ray also intersects , the region of the ellipsoid of inertia such that , in one point. Thus the points of are in one-to-one correspondence with the points of . Let be the homeomorphism defined by the rays having the origin as initial point. Since defines a cone, and , then . Since the restriction of an homeomorphism to a subset is still a homeomorphism, it follows that . Hence, , which concludes the proof.
∎
Since we have determined the topology of we can now use Morse theory to prove the following Lemma
Lemma 6**.**
The function has a unique critical point on .
Proof.
The proof is analogous to the proof of Lemma 6 in [29], and to Smale’s proof of Moulton’s theorem for the collinear -body problem [31] (which however, is presented without details). We repeat it here for convenience of the reader. By Proposition 3 any critical point is a nondegenerate local minimum of the function , and hence is a Morse function that tends to as nears , the boundary of . Therefore, the function admits a global minimum value in the interior of . Suppose there are several global minimum points where the function obtains its least possible value. By Proposition 3 any of such point must be a non-degenerate local minimum point. By Lemma 5, the Euler characteristic of is . By Morse theory we have
[TABLE]
where the sum is over the critical points, is the Morse index of the critical points and is the number of critical points of index . We know that there is at least one local minimum, and that all the critical points of are local minimum points and hence have index [math]. However, this function cannot have more than one minimum point since otherwise, equation (14) would imply the existence of at least one non-minimum critical point, contradicting Proposition 3. ∎
We are finally in a position to prove Theorem 1, our main result
Proof of Theorem 1.
Recall that, by Proposition 3, trapezoidal central configurations correspond to distance vectors that are critical points of the function . Lemma 6 shows that has a unique critical point on . Since , there is at most one critical point of on . Hence, we have shown that there is a most one trapezoidal central configurations for each ordering of the masses, and the theorem follows. ∎
Acknowledgments
I would like to thank Alessandro Portaluri and Shengda Hu, for interesting discussions related to this work.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] Alain Albouy. Integral manifolds of the n-body problem. Invent. math , 114:463–488, 1993.
- 2[2] Alain Albouy. Symétrie des configurations centrales de quatre corps. Comptes rendus de l’Académie des sciences. Série 1, Mathématique , 320(2):217–220, 1995.
- 3[3] Alain Albouy. The symmetric central configurations of four equal masses. Contemporary Mathematics , 198:131–136, 1996.
- 4[4] Alain Albouy, Hildeberto E Cabral, and Alan A Santos. Some problems on the classical n-body problem. Celestial Mechanics and Dynamical Astronomy , 113(4):369–375, 2012.
- 5[5] Alain Albouy and Yanning Fu. Euler configurations and quasi-polynomial systems. Regular and Chaotic Dynamics , 12(1):39–55, 2007.
- 6[6] Alain Albouy, Yanning Fu, and Shanzhong Sun. Symmetry of planar four-body convex central configurations. In Proceedings of the Royal Society of London A: Mathematical, Physical and Engineering Sciences , volume 464, pages 1355–1365. The Royal Society, 2008.
- 7[7] Alain Albouy and Vadim Kaloshin. Finiteness of central configurations of five bodies in the plane. Annals of Mathematics , pages 535–588, 2012.
- 8[8] Montserrat Corbera, Josep Cors, Jaume Llibre, and Richard Moeckel. Bifurcation of relative equilibria of the (1+ 3)-body problem. SIAM Journal on Mathematical Analysis , 47(2):1377–1404, 2015.
