Isotropic-Nematic Phase Transitions in Gravitational Systems
Zacharias Roupas, Bence Kocsis, Scott Tremaine

TL;DR
This paper models dense stellar systems with a central black hole as liquid crystal-like phases, revealing phase transitions between ordered disk-like and disordered isotropic states based on angular momentum.
Contribution
It introduces a novel analogy between gravitational systems and liquid crystal phases, demonstrating phase transitions driven by angular momentum in such systems.
Findings
Existence of ordered and disordered phases in stellar systems
Identification of a first-order phase transition at a critical angular momentum
Discovery of metastable states with mutually inclined disks
Abstract
We examine dense self-gravitating stellar systems dominated by a central potential, such as nuclear star clusters hosting a central supermassive black hole. Different dynamical properties of these systems evolve on vastly different timescales. In particular, the orbital-plane orientations are typically driven into internal thermodynamic equilibrium by vector resonant relaxation before the orbital eccentricities or semimajor axes relax. We show that the statistical mechanics of such systems exhibit a striking resemblance to liquid crystals, with analogous ordered-nematic and disordered-isotropic phases. The ordered phase consists of bodies orbiting in a disk in both directions, with the disk thickness depending on temperature, while the disordered phase corresponds to a nearly isotropic distribution of the orbit normals. We show that below a critical value of the total angular momentum,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Isotropic–Nematic Phase Transitions in Gravitational Systems
Zacharias Roupas
Institute of Physics, Eötvös University, Pázmány P. s. 1/A, Budapest, 1117, Hungary
Bence Kocsis
Institute of Physics, Eötvös University, Pázmány P. s. 1/A, Budapest, 1117, Hungary
Scott Tremaine
Institute for Advanced Study, Princeton, NJ 08540, USA
Abstract
We examine dense self-gravitating stellar systems dominated by a central potential, such as nuclear star clusters hosting a central supermassive black hole. Different dynamical properties of these systems evolve on vastly different timescales. In particular, the orbital-plane orientations are typically driven into internal thermodynamic equilibrium by vector resonant relaxation before the orbital eccentricities or semimajor axes relax. We show that the statistical mechanics of such systems exhibit a striking resemblance to liquid crystals, with analogous ordered-nematic and disordered-isotropic phases. The ordered phase consists of bodies orbiting in a disk in both directions, with the disk thickness depending on temperature, while the disordered phase corresponds to a nearly isotropic distribution of the orbit normals. We show that below a critical value of the total angular momentum, the system undergoes a first-order phase transition between the ordered and disordered phases. At the critical point the phase transition becomes second-order while for higher angular momenta there is a smooth crossover. We also find metastable equilibria containing two identical disks with mutual inclinations between and .
1. Introduction
The thermodynamics and statistical mechanics of a self-gravitating gaseous or stellar system is an intriguing subject with a long history (Tolman, 1934; Lynden-Bell, 1967; Lynden-Bell & Wood, 1968; Shu, 1978; Tremaine et al., 1986; Padmanabhan, 1990) and many recent developments (Gurzadian & Savvidy, 1986; Nakamura, 2000; Votyakov et al., 2002b; de Vega & Sánchez, 2002a, b; Arad & Lynden-Bell, 2005; Chavanis, 2006; Axenides et al., 2012; Touma & Tremaine, 2014; Bar-Or & Alexander, 2014; Roupas, 2015; Tremaine, 2015). Astrophysical applications include star clusters, galaxies, dark matter halos, and galaxy clusters. Statistical mechanics offers the hope of describing the macroscopic equilibrium structure of astrophysical systems without tracking the microscopic, particle-by-particle, evolution. The traditional approach, in which thermodynamic equilibrium is found by maximizing the entropy, does not work straightforwardly in self-gravitating systems for several reasons. First, there is no global entropy maximum for an isolated self-gravitating gaseous or stellar system (Antonov, 1962), although long-lived metastable thermal equilibrium states (i.e., local entropy maxima) can exist if the system is confined to a finite volume (Lynden-Bell & Wood, 1968; Chavanis, 2005). In addition, depending on the number of objects and the size of the system, the stochastic process by which the system evolves toward statistical equilibrium is often too slow for the system to approach the equilibrium state during its lifetime and therefore relaxation remains incomplete (Binney & Tremaine, 2008).
In the traditional thermodynamic approach, employed in most of the above references, the fundamental microscopic entities are stars, molecules, or elementary particles, considered to be point masses, which we label “bodies”. However, in contrast to systems with only short-range forces, the phase-space trajectories of bodies in self-gravitating systems are in many cases endowed with a special structure where the location of each body is restricted over long intervals to a bounded region of phase space, usually of lower dimensionality (Sridhar & Touma, 1999). This occurs if the dynamical system admits isolating integrals of motion (Kandrup, 1998; Binney & Tremaine, 2008) and allows the introduction of the familiar concept of the orbit. Rauch & Tremaine (1996) showed that certain degrees of freedom that describe the orbits may relax much faster than others, in a process they called resonant relaxation (see also Chavanis 2012; Fouvry et al. 2015). For example, bodies bound to a dominant central point mass execute eccentric orbits described by an angular-momentum vector and an eccentricity vector, which relax much faster than the semimajor axis. The phase-space distribution may then attain partial thermodynamic equilibrium, in which the rapidly evolving degrees of freedom are driven to internal thermodynamic equilibrium while others are frozen-in at their initial values.
Another example is a system dominated by a spherically symmetric potential, in which the directions of the angular-momentum vectors relax much faster than either the magnitudes of the angular-momentum vectors or the energies of the orbits (equivalently, much faster than either the eccentricities or semimajor axes). This process is called vector resonant relaxation (VRR) (Rauch & Tremaine, 1996; Hopman & Alexander, 2006; Gürkan & Hopman, 2007; Eilon et al., 2009; Kocsis & Tremaine, 2011, 2015), and is the subject of this work.
Gravitational systems that relax through VRR are conceptually similar to liquid crystals, as we will describe in more detail later in this paper. In normal liquids the microscopic degrees of freedom are the positions and momenta of the molecules, while their size may be ignored. However, in certain liquids, the molecules’ size and shape gives rise to new types of macroscopic structures. The liquid may acquire properties similar to a crystal: a macroscopic alignment of the molecules’ orientation with discrete rotational symmetries. For example in the nematic phase of a liquid crystal, axisymmetric molecules align in parallel or antiparallel configurations (Singh, 2002). Similarly, in gravitational systems subject to VRR, the basic microscopic entities, the orbits, behave as solid bodies. Their orientational degrees of freedom relax into an internal thermodynamic equilibrium. The correspondence with liquid crystals is due to the similarity between the Coulomb-type electro- and magnetostatic interaction acting between axisymmetric molecules and the Newtonian gravitational interaction acting between orbits. We shall show that there is a gravitational phase transition between disk+halo and isotropic phases111Note that this is different from the gravitational phase transitions commonly discussed in the literature, which usually refer to a transition between collapsed (core-halo structure) and diffuse phases in spherical systems (Aronson & Hansen, 1972; Stahl & Kiessling, 1994; Chavanis, 2006)., which strongly resembles the nematic-isotropic phase transition of liquid crystals.
Furthermore, we identify one more connection with condensed-matter physics, the existence of negative-temperature equilibria for which entropy decreases with increasing energy (Kocsis & Tremaine, 2011). Generally, negative-temperature equilibria may arise if the energy has an upper bound. In nuclear spin systems, the relaxation time of spin-spin interactions is typically much smaller than that of spin-lattice interactions (Ramsey, 1956), leading to a thermodynamic equilibrium for the former while the latter remain frozen at their initial values (Landau & Lifshitz, 1980). These systems admit negative-temperature configurations, in which most spins are anti-aligned with respect to the magnetic field. In VRR the role of spin is played by the orbital angular momentum and that of the magnetic field by the thermodynamic variable conjugate to the total angular momentum, the “rotation” (Eq. 20).
For a specific example of VRR, let us consider nuclear star clusters (NSCs), which consist of stars orbiting a supermassive black hole (SMBH). NSCs are found at the centers of galaxies and include the densest stellar systems in the Universe (Norris et al., 2014). The number density of various stellar types in NSCs increases steeply inwards with radius as to within the radius of influence of the SMBH, which is for the NSC at the center of the Milky Way (Schödel et al., 2009, 2014; Bartko et al., 2009; Yelda et al., 2014; Chatzopoulos et al., 2015). In the Milky Way’s NSC, the low-mass old stars are spherically distributed. There is also a prominent population of young stars at radii between about 0.03 and .222Based on their colors and spectra these are O-type main-sequence stars, which have an estimated age of 4-6 Myr. The distribution of orbital planes of the young stars exhibits several structures with debated physical origin, including a disk in the inner region –, a clumpy, disordered, roughly spherical structure in the intermediate region – and a warped, twisted disk in the outer region – (Bartko et al., 2009; Genzel et al., 2010). A fraction of stars in the intermediate region reside in a disk that rotates clockwise as seen from the Sun; a smaller fraction appear to reside in a disk that rotates counterclockwise, with orbit normals separated by from the normal to the clockwise disk; and the rest constitute a roughly spherical distribution (Levin & Beloborodov, 2003; Bartko et al., 2009). Yelda et al. (2014) estimate that of young stars are in the clockwise disk and are spherically distributed. The nearly identical ages of the young stars in both disks suggest a common origin, but why is the distribution of orbital planes so complex? Is it possible that this complicated structure can exist in statistical equilibrium, or is this a transient feature?
A further motivation to understand the relaxation processes in dense stellar populations comes from the emerging field of gravitational-wave astronomy. The recent LIGO discovery of gravitational waves from the merger of two black holes has opened a new window on the Universe (Abbott et al., 2016). Most of the massive stars formed in the past history of the Galaxy would by now have turned into neutron stars and stellar-mass black holes. The number density of stellar-mass black holes is expected to be up to a billion times higher in NSCs than in the Galactic field so NSCs may dominate the rate of black hole-black hole mergers detectable by LIGO (O’Leary et al., 2009). An understanding of the expected distribution of black holes in galactic nuclei may influence gravitational-wave search strategies and eventually help to interpret detections.
The main stellar-dynamical processes in NSCs may be ordered as follows. The spherical potential due to the central SMBH of mass ( for the Milky Way) supports eccentric Keplerian orbits with a characteristic orbital time , between and yr for distances (more precisely, semimajor axes) between and 0.5 pc. The gravitational field of the spherical distribution of old, low-mass stars causes in-plane apsidal precession for each eccentric orbit with a characteristic precession time or – yr in this region, where is the enclosed number of stars and is their typical mass. The stellar orbits conserve their angular-momentum vector and orbital energy over timescales of this order. However on longer timescales the time-varying higher multipole moments of the gravitational field of the stars drive chaotic mixing in phase space, through three distinct processes that operate on different timescales (see Figure 1 in Kocsis & Tremaine 2011):
- •
(– yr), the diffusion time of angular-momentum vector directions or orbital planes, called the vector resonant relaxation time,
- •
(– yr), the diffusion time of the magnitude of the angular momenta or eccentricity, called the scalar resonant relaxation time,
- •
(– yr), the diffusion time of energy or semimajor axis, called the two-body relaxation time.
The age of the young stars observed in the Galactic center is 4–6 Myr, long enough that we expect that VRR has strongly affected the distribution of orbital planes, but short enough that the eccentricities and semimajor axes remain frozen at their initial values.
More generally, in most of the volume of NSCs inside the sphere of influence of the central SMBH and , and if these inequalities are satisfied we have the timescale hierarchy (Kocsis & Tremaine, 2011):
[TABLE]
This hierarchy implies that different dynamical properties of the system relax on different timescales and they may reach internal statistical equilibrium independently from one another (see Sridhar & Touma 2016a, b for a mathematically rigorous derivation of some of these results).
In this paper, we examine the equilibrium distribution of orbital planes after the VRR process is completed () but before scalar resonant relaxation begins (). VRR is driven by the gravitational interaction averaged over two much faster motions, the orbital motion around the SMBH and the in-plane precession induced by the spherical distribution of old stars (Kocsis & Tremaine, 2015). Since precessing eccentric stellar orbits trace out axisymmetric punctured disks, the gravitational interaction is to be calculated between two such disks, in which the local surface density is proportional to the residence time of the star at the given location. Since and are much longer than the timescale of interest, the eccentricity and semimajor axis are nearly conserved during this process and so the surface density, as a function of radius, of the punctured disk is fixed, though its orientation is not333During VRR, the relative change in angular momentum is , and the change in energy is . The former sets the change of where is eccentricity and the latter sets the change in semimajor axis since .. This leads to an averaged or effective Hamiltonian that determines the time evolution of the orientations of the orbit normals or angular-momentum vector directions (see Appendix A). The VRR Hamiltonian is given by terms corresponding to pairwise interactions between the punctured disks. Each pairwise interaction is further decomposed into a sum over multipoles. In this decomposition, the coupling constants depend on the conserved quantities of the two disks, mass, semimajor axis, and eccentricity, which are set by their initial distribution. In the terminology of statistical physics mass, semimajor axis, and eccentricity are quenched random variables, so the coupling constants in the Hamiltonian are random matrices as in spin glasses. If the external perturbations of the galaxy on the NSC may be neglected (as we shall assume in this paper), the energy corresponding to the VRR Hamiltonian and the total angular-momentum vector are also conserved.
Our main goal in this paper is to describe the mean-field theory of VRR for the dominant quadrupolar interaction, arguing that the qualitative features of the theory would be similar if higher order multipoles were included. The statistical equilibrium is described by a distribution function in orbit-normal space that extremizes the Boltzmann entropy. As we already noted the quadrupolar mean-field model of liquid crystals, the Maier-Saupe model (Maier & Saupe, 1958; Plischke & Bergersen, 2006), is analogous to the mean-field model of VRR for a one-component cluster in which the bodies have the same masses, semimajor axes, and eccentricities.
The model we investigate in this paper is also reminiscent of the Hamiltonian mean-field model (Campa et al., 2014), which describes the dynamics of identical rigid rotors interacting via a cosine potential. Both are exactly soluble and exhibit phase transitions and other interesting behavior. In both cases the simplicity arises because the interactions of all pairs of bodies are identical, and the dynamical behavior is entirely determined by a few moments of the distribution function.
The remainder of the paper is organized as follows. In Section 2 we formulate the mean-field theory of vector resonant relaxation. In Section 3, we restrict our attention to the special case of a cluster composed of bodies of a single mass, eccentricity and semimajor axis; this restriction simplifies the physics so that we can explore the thermodynamics and statistical mechanics analytically. We find all of the equilibrium solutions in the microcanonical and canonical ensembles; we present the axisymmetric equilibria in Section 5.1, and the non-axisymmetric ones in Section 5.2. In Section 6 we discuss a different ensemble, which we call the -ensemble, in which the system is embedded in a bath with which it can exchange not only energy but also angular momentum. Section 7 contains a brief discussion of how these results can be applied to separable multi-component systems in which the interactions between components are weak. We discuss our conclusions in Section 8.
2. Mean-Field Theory of Vector Resonant Relaxation
We wish to describe the angular distribution of orbits of gravitating bodies bound to a spherically symmetric potential, usually dominated by a central point mass. To this end we adopt the VRR Hamiltonian obtained by Kocsis & Tremaine (2015) and reproduced in Appendix A. This Hamiltonian is derived by introducing a canonical transformation from Cartesian positions and momenta to action-angle variables, called Delaunay variables, and averaging over the rapidly varying angles, the mean anomaly and argument of periapsis (see also Sridhar & Touma 2016a, b, c). To leading order in the ratio of the stellar mass to the central mass, the VRR Hamiltonian is the average potential energy between the annuli or punctured axisymmetric disks that the precessing elliptical trajectories cover over times long compared to the apsidal precession time ( in Section 1). The surface density of the annulus at any point is inversely proportional to the time the body resides at that position during its orbit. Since the semimajor axis and eccentricity are approximately conserved during VRR, these annuli conserve their intrinsic properties (i.e., surface-density distribution, inner and outer radii), but their orientations can vary due to VRR. Since we have averaged over 2 phase-space coordinates and an additional 2 phase-space coordinates are frozen-in (conserved), the relaxation is restricted to a 2-dimensional space, which can be taken to be determined by the direction of the unit vector parallel to the angular momentum of the orbit (i.e., the normal to the annulus), which we denote by . Alternative but less convenient coordinates are the inclination and the longitude of the node of the orbit. In summary, the basic structures of VRR are concentric orbit annuli with distinct masses, inner, and outer radii, which behave as rigid bodies pinned to the dominant central point mass, and precess due to their mutual gravitational torques.
An important property of the annulus, which is conserved during VRR, is its scalar angular momentum
[TABLE]
here we have denoted the mass of each body by , its semimajor axis by , its eccentricity by , and the mass of the central object (e.g., the SMBH in a NSC) by .
We assume that the system is comprised of distinct groups of bodies. The group contains bodies of identical mass , semimajor axis , and eccentricity . We assume the ordering . This model can be thought of as representing an approximate coarse-grained distribution, assuming that the bodies are grouped into bins with similar mass, semimajor axis, and eccentricity. We shall refer to these groups as “components”. Since every orbit of the same component has the same scalar angular momentum , we can visualize our construction in angular-momentum space as follows: the tips of the angular-momentum vectors of different bodies in the same component lie on a thin spherical shell, and since the scalar angular momentum of each body is conserved during VRR, the angular momenta can only move along this spherical shell, so the bodies in a given shell can interact with bodies in other shells but can only relax within their own shell.
We denote the distribution function or number per unit solid angle within shell by . We neglect two-body or higher-order correlation functions (the mean-field approximation).
Let us summarize the simplifying assumptions used in this paper.
The long-term evolution is driven by the mutual gravitational torques between concentric, axially symmetric, time-independent structures, the time-averaged orbits, which change in orientation in response to these torques. This assumption is generally valid for masses which execute circular orbits in an arbitrary spherical potential or general orbits in a smooth spherical potential that is not exactly quadratic or Keplerian. 2. 2.
The multipole expansion of the interaction potential between orbits is truncated at the quadrupole order. The quadrupole is generally the strongest multipole interaction for near-Keplerian orbits, whether radially overlapping or not, but the cumulative effects of higher order multipoles may be significant for radially overlapping orbits (Kocsis & Tremaine, 2015). 3. 3.
There are components, each comprised of bodies with the same scalar angular momentum , and the quadrupole coupling ( in Eq. 8) between any body in component and any body in component is the same. 4. 4.
Multi-body correlations are negligible, so the evolution can be described by a mean-field model. This assumption is expected to be valid if is sufficiently large. 5. 5.
The total number of bodies, the mass of each body, and their total vector angular momentum are conserved, except in Section 6, where the bodies are allowed to exchange angular momentum with a surrounding reservoir. 6. 6.
Either the total energy of the system (i.e., the total gravitational potential energy arising from pairwise interactions of orbits) is conserved or the VRR temperature is fixed (see Section 2.3). These assumptions correspond to the microcanonical and canonical ensembles, respectively.
After presenting the framework for a general multi-component model in this section, we will focus most of the paper on the simple case of a one-component system444In Section 7 we will discuss briefly how these results can be extended to multi-component systems in which the interactions between different components are much weaker than the interactions within a component., which exhibits a rich phase diagram and provides an analytically tractable starting point for comparisons with future numerical studies of multi-component systems.
2.1. Basic definitions
In the following, Greek indices label coordinates and Latin indices label stellar components. We use bold characters to denote 3-dimensional vectors, and normal (non-bold) symbols for their norm (). If we write out the Cartesian coordinates of some vector or tensor which has a stellar component index, we put the component index in parentheses in superscript to reduce clutter, e.g., denotes a Cartesian coordinate of the vector .
The system is characterized by the total number of bodies , the total angular momentum , and the total orbit-averaged interaction energy . In spherical coordinates, the number of bodies in the component with between and is , where is the solid angle, and therefore the number of bodies of the th component is
[TABLE]
where unless otherwise noted the integral is over the full unit -sphere. The total number of bodies is
[TABLE]
The angular momentum of one orbit in the component is
[TABLE]
and the total angular-momentum vector of all bodies in the component is
[TABLE]
The total angular momentum is
[TABLE]
The total interaction energy due to the VRR Hamiltonian up to the quadrupole order is given in Appendix A:
[TABLE]
with and
[TABLE]
The Hamiltonian (8) resembles that of the Maier-Saupe model for liquid crystals (Maier & Saupe, 1958; Plischke & Bergersen, 2006). We assume that are known constant parameters of the model, which are determined by the quadrupole potential energies of interacting time-averaged orbits. For example, for circular orbits
[TABLE]
see Appendix A for the more general case of eccentric orbits in a near-Keplerian potential. The array is symmetric.
Eq. (9) may be rewritten as
[TABLE]
where here and in the following we suppress the summation over the Greek indices (Cartesian coordinates), and we have introduced the traceless quadrupole moment tensor of angular momenta
[TABLE]
where is the Kronecker-. Let us introduce the mean-field sum
[TABLE]
where we define the ensemble average of over component as
[TABLE]
The orbit-averaged energy of a single body in the component with unit normal is, from Eq. (8),
[TABLE]
where we defined
[TABLE]
which is traceless. The total energy is
[TABLE]
The one-body energy (15) has the following properties.
It has inversion symmetry, . 2. 2.
Since has eigenvalues , the average of such matrices has eigenvalues between 0 and 1. Thus has eigenvalues between and ; similarly, the superposition has eigenvalues between and . The energy of each body is confined between these bounds with local minima (maxima) along the eigenvector of associated with its smallest (largest) eigenvalue and along the opposite directions, and saddle points along the eigenvector associated with the intermediate eigenvalue and the opposite direction. 3. 3.
The equipotential curves with fixed are generally ellipses on the unit sphere (i.e., the intersection of the unit sphere with an ellipsoid having the same center), which enclose the local minima and maxima. 4. 4.
The eigenvectors of are fixed points of the time evolution of for any body in component , provided that is constant. The eigenvectors corresponding to the smallest and largest eigenvalue are stable fixed points, and the intermediate eigenvector is unstable. 5. 5.
The time evolution of is integrable if is constant in time.
However the precession of the angular momenta typically changes and in time which usually leads to chaotic evolution. We expect, and shall assume, that this chaotic evolution causes the distribution function to relax toward a state of maximum entropy.
2.2. Statistical equilibrium
We calculate the equilibrium distributions that extremize the Boltzmann entropy (Jaynes, 1965)
[TABLE]
for fixed energy , fixed total angular momentum , and fixed number of bodies in each component . The use of this formula for the entropy in the mean-field approximation can be justified in the general case of the self-gravitating gas (Padmanabhan, 1990; Katz, 2003) (see also Miller 1973; Sormani & Bertin 2013 for critical analysis), and also for VRR equilibria as we show in Section 3.1.
We use the method of Lagrange multipliers. For perturbations about the equilibrium distributions, the extremum must satisfy
[TABLE]
where , , and are Lagrange multipliers corresponding to the constraints and denotes the first-order variation with respect to . We may identify the Lagrange multiplier with the inverse temperature , while is the thermodynamic variable conjugate to with respect to entropy. We emphasize that in this paper “temperature” refers to the VRR temperature, which is the inverse Lagrange multiplier enforcing conservation of VRR energy in Eq. (19) when maximizing the Boltzmann entropy; thus . It is also the inverse derivative of the Boltzmann entropy with respect to VRR energy for a series of equilibria.
For the quantity enforcing the conservation of total angular momentum, it will later prove to be convenient to replace by , which we define as
[TABLE]
where has units of angular velocity, and we refer to it as “rotation”555For an ideal gas, describes rotation in the sense that the mean velocity of the gas at any position is . However, in VRR is not simply related to the mean angular velocity. (see Section 6, and Votyakov et al. 2002b, a; de Martino et al. 2003). We will use this quantity to illustrate the analogy with paramagnetism and spin systems.
From the definition of , , and , we get to first order (for the second-order variation see Appendix C)
[TABLE]
where is the equilibrium distribution function. Eq. (19) becomes
[TABLE]
Since the variations are independent, the quantities in parentheses must vanish, which implies
[TABLE]
Using the constraint on , Eq. (3), we may eliminate to get finally
[TABLE]
where depends on the orbit normal through the tensor as given by Eqs. (12), (13), and (15). In the following, we drop the subscript “eq” for brevity.
Eq. (13) gives the self-consistency equations
[TABLE]
where is defined in Eq. (16). These self-consistency equations are subject to the constraints (7) and (17) for given and , which determine and . These equations define the equilibria of the system and may be solved numerically, or in many cases analytically (see Appendix E).
2.3. On inequivalence of statistical ensembles
In statistical physics the term canonical ensemble is used to describe a system in equilibrium with a large heat reservoir having a fixed temperature. A microcanonical ensemble, on the other hand, describes an isolated system with constant total energy. The two ensembles are equivalent in the thermodynamic limit for systems that are governed by so-called “short-range” interactions (Campa et al., 2014). The essential property of short-range systems is that they can be divided in an arbitrary manner into subsystems whose mutual interaction energy can be neglected with respect to the total energy of the system. Thus the subsystem energies are additive (Padmanabhan, 1990).
For systems with long-range interactions, such as the self-gravitating gas, the two ensembles may be inequivalent. The inequivalence leads to different stability properties for the two ensembles (see also Appendix C). A second condition must also be met for inequivalence to appear666More specifically, inequivalence may appear in regions where the entropy is not a concave function of energy for a series of equilibria.: the existence of a first-order phase transition in the canonical ensemble (Ellis et al., 2000; Barré et al., 2001; Ellis et al., 2004; Bouchet & Barré, 2005; Touchette & Ellis, 2005). The phase transition in the canonical ensemble is replaced by a stable region with negative specific heat in the microcanonical ensemble (Lynden-Bell & Wood, 1968; Padmanabhan, 1990; Campa et al., 2014). In the canonical ensemble the phase transition takes place in an out-of-equilibrium process, where the system absorbs or emits the whole latent heat from the heat bath needed for the transition. The two phases cannot coexist in equilibrium because the mixed phase has a higher free energy than a single pure phase in the canonical ensemble due to the interaction energy (Chavanis, 2006; Campa et al., 2014), and there cannot be a phase separation. In other words, there are negative specific heat equilibria in the phase transition region which are therefore unstable in the canonical ensemble777A system with negative specific heat cannot be in equilibrium with a heat bath. Any energy loss of the system to a colder heat bath will increase its temperature, making it lose even more energy, while any energy absorption of the system from a hotter heat bath will make the system colder causing a further energy absorption.. However, these equilibria are stable in the microcanonical ensemble, and the system can lie in any of those configurations between the two phases.
In the case of inequivalence, the equilibrium state depends on whether the system is under conditions of constant temperature (external environment has a strong influence which resembles a heat bath) in this state, in which case we refer to the canonical ensemble, or under conditions of constant energy (isolated system), in which case we refer to the microcanonical ensemble.
The self-gravitating gas is generally non-additive and presents inequivalence of ensembles (Padmanabhan, 1990; Campa et al., 2014). On the other hand, in VRR, the spatial and velocity distributions of bodies are not determined by the relaxation process, apart from the orientation of the orbits, leaving open the possibility for specific distributions to be approximately additive with respect to VRR energy. However, in more typical cases, VRR in a multi-component system is non-additive. For the one-component systems that are the focus of this paper, VRR is not only non-additive, but also presents a first-order phase transition and therefore inequivalence of ensembles. For this reason the microcanonical and canonical ensembles will be studied separately.
3. One-component systems
In the next three sections, we restrict ourselves to one-component systems; these are the simplest VRR systems yet they exhibit a remarkably rich phenomenology. The quadrupole mean-field VRR Hamiltonian of the one-component system is equivalent to the Maier-Saupe model of liquid crystals, which allows us to follow standard textbooks on the subject (Plischke & Bergersen, 2006)888The main difference is that we also require the conservation of total angular momentum.. First, we write down the one-component version of the relevant equations of Section 2, in particular the mean-field quadrupole moment, the total energy, the total angular momentum, the distribution function, and the self-consistency equation, and evaluate the entropy and the free energy. We solve the resulting system of equations in the following section.
We adopt a coordinate basis in which the axes are aligned with the eigenvectors of . In the next subsection 3.1 and Appendix D we show that at equilibrium and are parallel to one of these eigenvectors. Therefore, without loss of generality, we may choose the third coordinate axis to lie along . In this case we may write
[TABLE]
where we define
[TABLE]
Here are spherical coordinates in orthogonal axes aligned with the eigenvectors of and . We have introduced the angular-momentum vector direction coordinates , which are proportional to the real spherical harmonics999Specifically, and , where are real spherical harmonics, which form an orthonormal basis on the sphere, .. These coordinates are defined in the region
[TABLE]
From Eq. (15), the mean-field potential energy of a single body is simply
[TABLE]
All of our results are parameterized by , so its value need not be specified for this analysis. For concreteness is given for general near-Keplerian orbits in Appendix A, and for circular orbits Eq. (10) implies
[TABLE]
At equilibrium, the total energy from Eq. (17) is
[TABLE]
In Appendix B we show that the extrema of the entropy have the property that the Lagrange multiplier and the total angular momentum are parallel. Without loss of generality we may label the direction of as the positive -axis. Letting
[TABLE]
we get and the total scalar angular momentum from Eq. (6) is
[TABLE]
Thus, is the dimensionless total angular momentum of the system normalized to the configuration in which all angular momenta are aligned. This satisfies , where represents a razor-thin disk in physical space with all bodies orbiting counterclockwise as seen from the positive -axis, and represents configurations with an equal net angular momentum in clockwise and counterclockwise orbits.
The parameters and are bounded by the inequalities101010These follow from the bounds (since are Cartesian coordinates of a unit vector) and (due to the Cauchy–Schwarz inequality).
[TABLE]
and the total energy is bounded by the inequalities
[TABLE]
The mean-field equilibrium distribution function (27) is written as
[TABLE]
In these equations the angular dependence of is implicit in , , and . We now evaluate and using their definitions in Eqs. (33) and (34). In so doing we use the identity and simplify the integrals over the azimuth angle using modified Bessel functions
[TABLE]
The self-consistency equations (33)–(34) become
[TABLE]
where the arguments of the Bessel functions are . The moments and are the order parameters of the system. Configurations with are axisymmetric around the -axis while configurations with are axisymmetric around the - and -axes, respectively (recall that the positive -axis points along the total angular-momentum vector). Therefore may be regarded as a measure of the deviation from axisymmetry around the angular-momentum axis. Configurations with are isotropic.
The entropy of equilibrium states is calculated from Eqs. (18), (33), (34), and (45) to be
[TABLE]
From now on we drop the constant term. In this equation
[TABLE]
The argument of the Bessel function is given below Eq. (48). The total energy and angular momentum in Eq. (49) are given in Eqs. (40) and (42), respectively.
In what we call the canonical ensemble, the system can exchange energy but not angular momentum with a reservoir. The corresponding thermodynamic potential is the Helmholtz free energy . Using Eq. (49) we get
[TABLE]
In this case, the parameter appearing explicitly here and implicitly in (see Eq. 50) is to be determined by Eqs. (42) and (45). In what we call the -ensemble (see Eq. 20 and Section 6), the system can exchange both energy and angular momentum with a reservoir. Then the variables and conjugate to and are conserved during the evolution of the system. The corresponding thermodynamic potential, analogous to the Gibbs free energy, is
[TABLE]
Typically we would like to solve the system of equations (47)–(48) and (42) for , , and , given values of the angular momentum , and the energy (for the microcanonical ensemble) or temperature (for the canonical ensemble). It is helpful to observe that , , and only appear in the combinations (see Appendix E)
[TABLE]
Thus the ratio of Eqs. (47) and (48) gives
[TABLE]
where the argument of the Bessel functions is . Moreover the angular momentum constraint (42) can be written as
[TABLE]
This last equation is a monotonic and therefore invertible function of for given and , mapping onto the interval . Thus for any triple , we can solve this numerically, and substitute the resulting value of into Eq. (54) to get a relation between and , although this relation may have multiple solutions for at a given value of . For each pair we can evaluate the temperature, energy or other quantities using Eqs. (47), (53), and (40). Once we have determined the value(s) of corresponding to the desired temperature, the order parameters and the entropy, and the free energies follow from Eqs. (53), (49), (51) and (52. In Appendix E we simplify the integrals analytically in Eqs. (47), (50), and (55) for axisymmetric states (see Eqs. E15–E18) and derive the asymptotic behavior of macroscopic variables.
3.1. Positive-temperature equilibria
In this Section we calculate the positive-temperature VRR mean-field free energy (51) as the steepest-descent approximation to the partition function and we describe the equilibrium states and their stability properties. The negative-temperature equilibria are discussed in Section 3.2 and Appendix C.
We work in the TN-ensemble, that is we assume that the system is embedded in a heat bath with constant temperature and rotation , with which it can exchange VRR-energy and angular momentum but not bodies. Therefore, in this ensemble , and are held fixed (see also Section 6). The partition function is
[TABLE]
strictly the sum should not contain terms with but for any so these only contribute a constant factor to the partition function. Note that (i) the Latin indices run through different bodies in the single-component case and not through different components as in Section 2; (ii) for clarity, in this Section we explicitly write out sums over Cartesian coordinates indicated by Greek indices.
We will use the Hubbard–Stratonovich method to calculate this integral. We may write the first sum in Eq. (3.1) as
[TABLE]
Now let be an arbitrary matrix. For and we can write
[TABLE]
which can be shown by completing the square of on the right-hand side. Then, Eqs. (3.1) and (58) become
[TABLE]
Here we have used the fact that the integrals over are separable and the integral is over the unit sphere, . We have also used the definition (12) and in the last line we introduced
[TABLE]
In this formula we identify as the order parameters.
We now specialize to the thermodynamic limit . We note that the integral in Eq. (60) is invariant as for fixed as long as is rescaled as . With this rescaling is independent of . But since the integrand in (59) is , this can be evaluated asymptotically for large by the method of steepest descent. We have
[TABLE]
The equilibrium free energy is given by so by dropping the fractional correction of order we have
[TABLE]
Note that the extrema of occur when , and these conditions give exactly the self-consistency equations (28), that is, the same distribution function as specified by use of the Boltzmann entropy (18).
In Appendix D we show that a coordinate system aligned with can always be rotated in such a way that the off-diagonal elements of are zero at equilibrium and the system is stable with respect to perturbations in the off-diagonal elements in this coordinate system. Therefore we can finally write
[TABLE]
where
[TABLE]
which gives exactly Eq. (52).
More generally, a global minimum of free energy with respect to the order parameters is a stable equilibrium, while a local minimum is metastable. Saddle points and local/global maxima are unstable equilibria. A similar result holds for the free energy of the canonical ensemble , given by
[TABLE]
where and now is determined implicitly by the requirement that the total angular momentum is .
The top left panel of Figure 1 shows the free energy in the canonical ensemble as a function of the order parameters and for at a low temperature, . Points – are the five distinct equilibrium configurations found at this temperature; the corresponding angular-momentum unit vector distribution functions are shown below the panel. The dashed lines represent axisymmetric configurations. In particular,
- •
, and are axisymmetric equilibria. and are non-axisymmetric (even though they lie close to the dashed lines).
- •
is stable since it is the global minimum of the free energy. We call this the “ordered phase” of the system or “uniaxial”, since it possesses one axis of symmetry (following the terminology of liquid crystals; Gramsbergen et al. 1986; Luckhurst & Sluckin 2015).
- •
is a saddle point, which is unstable with respect to perturbations that are not axisymmetric with respect to the -axis.
- •
is a local maximum and hence an unstable equilibrium at this low temperature; however it is stable at higher temperature or at negative-temperature. It is nearly isotropic and we call it the “disordered phase”.
- •
is a saddle point and therefore an unstable equilibrium.
- •
is a local minimum and hence a metastable equilibrium. We call this state “biaxial”, since at low temperatures the bodies are organized into two distinct disks with different symmetry axes (Section 5.2). The state is long-lived for a significant range of temperatures and total angular momenta.
The top right panel of Figure 1 shows the canonical free energy at high temperature for the same mean angular momentum, . All equilibria merge at the point labeled which represents a stable disordered phase.
The geometry of the equilibrium states may be understood from the bottom panels in Figure 1, or from the eigenvalues of which from Eqs. (13) and (3)–(36) are , , and . For example, a razor-thin disk with all angular momenta pointing along the positive -axis has distribution function , where is the unit vector along the -axis, and so has eigenvalues . In contrast, an isotropic distribution has and eigenvalues . For the stable equilibrium in Figure 1 the eigenvalues are . In physical space, this represents a relatively thin, axisymmetric disk containing both prograde and retrograde bodies, oriented perpendicular to the -axis. The metastable equilibrium has eigenvalues which in physical space represents two disks with mutual inclination (see Eq. 71 below).
3.2. Negative-temperature equilibria
We remark that the internal energy given by Eq. (40) is bounded from above (Eq. 44). It is known that in this case, negative-temperature equilibrium configurations may exist for isolated systems (Ramsey, 1956; Pathria & Beale, 2012). Negative absolute temperature states were first introduced by Onsager (1949) in a statistical approach to turbulent flow aiming to explain long-lasting large vortices. This phenomenon may also arise in quantum systems with an upper energy bound; such a bound may arise for some degrees of freedom as long as they evolve in isolation from the motional degrees of freedom, as in the case of nuclear spins. Indeed, negative-temperature configurations were first produced in the laboratory in nuclear spin systems (Purcell & Pound, 1951; Ramsey & Pound, 1951; Ramsey, 1956). The spin-spin relaxation timescale is about six orders of magnitude smaller than that of the spin-lattice interactions in the system of Purcell & Pound (1951). A negative spin-temperature state was achieved for the spinorial degrees of freedom by cooling the system to very low temperature in the presence of a magnetic field and then rapidly reversing its orientation. The system maintained negative spin temperature for several minutes until relaxation of the motional degrees of freedom restored positive temperature (Ramsey, 1956).
Recently, a negative-temperature state was achieved in the laboratory for the motional degrees of freedom of ultra-cold quantum boson gases (Braun et al., 2013). This achievement triggered a debate (Schneider et al., 2014; Hänggi et al., 2015; Frenkel & Warren, 2015; Campisi, 2015; Cerino et al., 2015; Poulter, 2016), initiated by Dunkel & Hilbert (2014), on whether negative absolute thermodynamic temperatures are observable after all or maybe the Boltzmann’s entropy formula should be abandoned in favor of Gibbs’ entropy, which allows only positive temperatures. However, the arguments in favor of the validity of Boltzmann’s formula are more convincing at least for the case of ensemble-equivalence where it was proven recently that Boltzmann’s formula is appropriate and negative temperatures do occur (Buonsante et al., 2016).
Negative-temperature states in nuclear spin systems present a close analogy with the quadrupole VRR system discussed here. The VRR degrees of freedom, the unit normals , are similar to spinorial degrees of freedom. Orbits averaged over a precession period represent fixed density annuli which are characterized by their normals and relax similarly to spin vectors in quantum mechanics; the difference is that the interaction is a sum of terms proportional to for VRR and for spin systems. The analogy goes even further since , defined in Eq. (20), enters the distribution function through similar to a paramagnetic term, .
Negative-temperature equilibria also arise for an isolated thin circular disk of bodies undergoing VRR (Kocsis & Tremaine, 2011). That model is recovered in the limit of the model presented here. With this perspective, negative-temperature equilibria arise naturally for VRR.
At negative temperature, we find a single solution to the self-consistency equation (47)–(48) at fixed angular momentum . In Appendix C, we show that these negative-temperature equilibria are stable in both the canonical and microcanonical ensembles. These states are always axisymmetric (Appendix E.2), and they are similar to the large positive-temperature disordered state shown in the top right panel of Figure 1 although negative-temperature states always exhibit a population inversion: the occupation number is an increasing function of one-body VRR-energy (Eq. 38).
Before closing this section, let us introduce some quantities for further use. We denote by the number of orbits with angular-momentum vectors on the hemisphere (i.e. prograde orbits with respect to ) and by the number of orbits with (retrograde orbits with respect to )
[TABLE]
We also introduce the expectation value of a quantity over bodies on the hemisphere as
[TABLE]
and similarly for the hemisphere. In particular, we shall use to denote the mean angular momentum of the prograde and retrograde orbits with respect to the -axis.
4. Zero angular momentum
It is instructive to start with the configurations having zero total angular momentum, in Eqs. (42)–(45). In the one-component case, this is completely equivalent to the Maier-Saupe model of liquid crystals.
Figure 2 shows the free energy as a function of the order parameters and at the same temperature as in the left panel of Figure 1. We may observe a similar structure as in that Figure with correspondence , , , , . The main difference is that all equilibria are exactly axisymmetric ( or , see Eq. 3). The primed and double primed states may be obtained by a rotation of the unprimed distribution.
The left panel of Figure 3 shows the order parameter for a series of axisymmetric equilibria as a function of temperature (only positive temperatures are shown). There are three zero-temperature states , and with , , and 0. The state corresponds to the isotropic distribution function
[TABLE]
which satisfies the self-consistency conditions (42), (48), and (47) when for any inverse temperature ; thus is identical to . The state has , and since this is the maximum allowed value of by Eq. (37) all bodies in the system must have . In other words all of the orbits have unit normals so they form a razor-thin disk in the - plane with equal numbers of prograde and retrograde orbits and energy . This is the most extreme configuration of what we call the ordered phase. The state has and since this is the minimum allowed value of all bodies must have , that is, their orbit normals lie in the equatorial plane . The azimuths of the orbit normals are uniformly distributed. These three states are respectively analogous to , , and in Figure 2 for zero temperature, which indicates that is stable, while and are unstable. A mathematical description of these states is given in Appendix E. The right panel of Figure 3 shows the mean angular momenta of the clockwise and counterclockwise orbits respectively, as defined in Eq. (67). As we have already shown, the stable zero-temperature state corresponds to a razor-thin disk with equal numbers of bodies orbiting clockwise () and counterclockwise (). At higher temperatures most of the bodies remain in a disk, which thickens as the temperature increases. In addition a few bodies are found at higher inclinations forming a “halo” as shown in Figure 4. As Eq. (68) implies, the disordered phase corresponding to is isotropic at all temperatures for zero total angular momentum ().
Figure 3 marks the states , , , and , where stability properties change in the canonical ensemble, as shown in Figures 5 and 6 and in Appendix E.1111111For the case of zero total angular momentum discussed here, , , , and are identical: they all represent the isotropic configuration, . However, these states are distinct for nonzero angular momentum . The isotropic disordered state is unstable at low temperature, but in the canonical ensemble it becomes metastable for temperatures in the range where and . These states become stable for . The ordered phase (branch ) becomes metastable in the canonical ensemble for temperatures in the range , where . At , the system suffers a first-order phase transition from the ordered phase to the disordered phase . At the transition point , , and , while at , , , and . The distribution functions at and are shown in Figure 4.
To understand the nature of the phase transition, let us imagine slowly heating a one-component system from low temperature across the phase transition in the canonical ensemble121212This would occur, for example, if the system interacts with massive distant objects with a nearly isotropic angular-momentum distribution.. At low temperatures, the equilibria along the sequence starting with are minima of the free energy and therefore stable, while those along the sequences starting with and are saddle points and maxima, respectively, and therefore unstable. A comparison of Figures 2, 3, and 5 shows how the equilibria along these sequences change as the temperature increases: the sequences starting at and move toward the isotropic configuration, in the sense that the disk thickens and decreases for both. At , the unstable sequences starting from and intersect. As the temperature increases across , the equilibria on the isotropic sequence that started at change from maxima to minima of the free energy so the sequence becomes metastable, while the sequence starting from remains unstable. The stable ordered sequence that begins at remains stable until the temperature increases to . At , the free energies of the ordered and disordered states, and in Figure 3, become equal. When the temperature is increased above , a first-order phase transition occurs in the canonical ensemble, in which the isotropic disordered state becomes stable and the ordered state becomes metastable. As the temperature continues to increase, the sequences that began at and approach the same state, and they eventually coincide at , For there is no equilibrium other than the isotropic disordered state, which is stable.
The behavior near the phase transition is explored further in Figure 5. The left panel shows the free energy as a function of the order parameters and at the temperature of the phase transition, . Since the equilibria are all axisymmetric, we can set and plot the free energy as a function of one order parameter , which we do in the right panel. The free energy is shown for three temperatures. The bottom curve is for temperatures between and , where the minimum at corresponds to the stable ordered sequence starting at , the minimum at corresponds to the metastable disordered sequence starting at , and the intervening maximum corresponds to the unstable sequence starting at . The middle curve is at the temperature of the phase transition, where the free energies at and are equal, and the upper curve is for temperatures between and .
Figure 6 shows near the phase transition. At , the free energy is the same for the ordered and disordered phases ( in the left panel), but the energies of these states are different ( and in the right panel). This panel shows the caloric curve and Maxwell’s construction for the first-order phase transition. In a nonadditive system multiple phases cannot coexist in equilibrium (see Section 2.3) and so the dotted line in the right panel of Figure 6 is unphysical. However, phase separation can occur in separable multi-component systems, as described in Section 7; in this case individual components are either in or in but the full system can lie anywhere along .
This analysis is different in the microcanonical ensemble for a one-component system, i.e., an isolated system under conditions of constant energy. In that case, we find that the series of equilibria represent the highest entropy states at fixed energy, while the states along and at fixed energy have an entropy minimum. Thus, the branch is stable in the microcanonical ensemble, and a continuous second-order phase transition takes place at point in the microcanonical ensemble at temperature (see Figure 6). Branches and are unstable in both canonical and microcanonical ensembles.
5. Non-zero angular momentum
Let us turn to the more general case of a one-component system with non-zero total angular momentum, in Eq. (42). In this case, the angular momentum constraint can only be satisfied if the Lagrange multiplier is non-zero, which introduces a factor in the distribution function (Eq. 45). This is similar to the effect of a paramagnetic term in the Hamiltonian, and hence the distribution function, of a spin system in a magnetic field, where the role of magnetic field is played by , the magnetic moment is replaced by and the spin is replaced by .131313This is different in the Maier-Saupe model of liquid crystals, where an external magnetic field gives rise to a term proportional to where describes the orientation of the molecules (Wojtowicz & Sheng, 1974; Palffy-Muhoray & Dunmur, 1982)..
We work in a coordinate system in which the total angular momentum is parallel to the positive -axis. The equilibrium distribution function may be either axisymmetric around the -axis (e.g., , , in Figure 1) or else non-axisymmetric, with three distinct eigenvalues (i.e., and as in , in Figure 1). We study these cases separately.
5.1. Axially symmetric equilibria
Let us consider first axisymmetric configurations (e.g., , , in Figure 1). These constitute the most important cases, since global free energy minima at fixed temperature and global entropy maxima at fixed total energy are always axisymmetric as we will see below. We derive analytic expressions for the order parameters valid at any temperature and angular momenta in Appendix E.2.
Figure 7 shows the order parameter with respect to temperature for axisymmetric equilibria, at several values of the dimensionless total angular momentum. For small values of , three axisymmetric ground states exist at zero temperature, similar to the case with shown in Figure 3. The stable ordered state has at , for any . This state is a razor-thin disk in physical space, with a fraction of prograde orbits equal to . As increases, at moves toward and they merge at . This behavior is explained using the asymptotics of the partition function in Appendix E. Figure 8 shows the mean angular momentum and the ratio of prograde or retrograde orbits at the critical total angular momentum, . This is the maximum total angular momentum for which a phase transition occurs.
The first-order phase transition in the canonical ensemble presented in Figure 5 for is similar for any . As is increased, the jumps and shrink, and above there is no phase transition. This is illustrated in Figure 10, which shows the region in energy-angular momentum space that corresponds to the phase transition.
On the critical curve , the phase transition in the canonical ensemble becomes second-order. Figure 9 shows the order parameter with respect to temperature for several values of angular momentum below and above the critical value, and the free energy and the caloric curve are shown in Figure 11. The critical point is labeled with on the critical curve. In the left panel of Figure 9 we also plot the curve defined by the phase transition points (dashed line), the “coexistence curve”141414Although in the one-component model there cannot be phase coexistence in equilibrium, see Section 2.3., which is found empirically to follow a parabolic rule
[TABLE]
where is the temperature of the corresponding first-order phase transition and the temperature at the critical point. The right panel of Figure 9 shows the temperature-angular momentum phase diagram. In the vicinity of the critical point and on the critical curve, where the phase transition becomes second-order, we get the scaling
[TABLE]
These scalings are the ones predicted by mean-field theory (Stanley, 1987; Papon, 2002) and the critical point as described above is completely analogous to the critical points displayed not only by liquid crystals, but also by ferromagnetic and liquid-vapor systems (Stanley, 1987; Wojtowicz & Sheng, 1974; Palffy-Muhoray & Dunmur, 1982). The thermodynamic quantities , , of the one-component quadrupole VRR system discussed here correspond to , and , respectively, for liquid crystals in a magnetic field (see Palffy-Muhoray & Dunmur 1982 for the definitions), or , and for ferromagnets, or , and for liquid-vapour systems (see Stanley 1987).
The first-order phase transition shown for in Figure 11 is qualitatively similar to that for shown in Figure 6. Figure 12 (left panel, black curve) shows that the disordered phase is nearly isotropic (for all for which the phase transition is defined), while the ordered phase corresponds again to a disk+halo structure in physical space (cf. Figure 4 and see Section 3.1). In the ordered phase the system consists of a disk with two components, rotating in opposite directions, and a dilute halo. Due to our definition of the coordinate system, more bodies have than as shown by the asymmetry of the curve in the left panel of Figure 12. For high values of the system is a thin disk rotating in practically one direction only as depicted in Figure 13. Figure 14 shows the distribution function for states along the unstable branches.
For non-zero angular momentum, there is no phase transition in the microcanonical ensemble, but the system passes continuously from the ordered to the disordered phase as the total VRR energy is slowly increased. The caloric curve is shown in Figure 15 for the microcanonical ensemble, while the entropy with respect to energy is presented in Figure 16. The equilibrium configurations are identical in the microcanonical ensemble to those in the canonical ensemble apart from the phase-transition region of the canonical ensemble. In both ensembles, the system presents a disk+halo structure at low temperature and a nearly isotropic structure at high temperature for low total angular momentum. However, there are no stable configurations in the phase-transition region in the canonical ensemble between and and the system passes discontinuously from the one phase to the other, while in the microcanonical ensemble the corresponding configurations (branch of Figure 11) are stable.
5.2. Non-axisymmetric equilibria
In Figure 1, we have seen that VRR free energy extrema exist with and (, , and their rotations , ). For , these states represent non-axisymmetric configurations in the sense that the tensor has three distinct eigenvalues (see Eq. 3).
Figure 17 shows the order parameters of the non-axisymmetric equilibria as a function of temperature, for four values of the total angular momentum. For and sufficiently low temperature, there are two equilibria shown by solid and dashed lines, which correspond respectively to and in Figure 1. The solid lines represent local free-energy minima which are therefore metastable, and the dashed lines represent free-energy saddle points which are unstable (see Figure 1).
Figure 18 shows the two-dimensional probability density distribution of the orbit normals for the non-axisymmetric equilibria. When the angular-momentum vectors are concentrated near a single direction, as in the left panels, the distribution resembles a disk in physical space in which each object orbits in the same sense. Thus, in physical space, the metastable non-axisymmetric equilibria in the left panels are comprised of two misaligned but otherwise identical disks with axes along and ; half of the bodies are in each disk. The mutual inclination of the disks is approximately
[TABLE]
We call these “biaxial” equilibria.
Let us now perform a thought experiment in which we heat the system, and visualize how the distribution of angular momenta changes. At , the metastable state is (analogous to in Figure 1), consisting of two razor-thin disks in physical space with angular momenta pointing along directions 151515This configuration is metastable up to a global rotation of the coordinate system since the maximum entropy equilibrium state does not specify the direction of the eigenvectors of perpendicular to . Integrating Hamilton’s equations of motion for the VRR system shows that the two disks precess uniformly around the total angular-momentum vector (Kocsis & Tremaine, 2015).. As we heat the system, we increase the scatter of the angular-momentum vectors, or in physical space, the thickness of the disks. As the temperature continues to increase, the scatter of the angular-momentum vectors eventually becomes so large that the two angular-momentum direction distributions become connected, forming an inhomogeneous thick arc (or possibly a complete ring for small ) in the – plane with two density maxima. Eventually, above some maximum temperature , only axisymmetric states exist. A second non-axisymmetric sequence begins at with a razor-thin arc along the – plane, which is the unstable ground state analogous to in Figure 1. As the temperature increases the arc thickens. At point in Figure 17, the distribution function becomes bimodal and continues to thicken, until it finally connects with the sequence at .
Figure 17 shows that for any fixed total angular momentum, non-axisymmetric equilibria only exist below a maximum temperature corresponding to point in the figure. For both metastable and unstable configurations are allowed in the canonical ensemble. decreases with increasing or . These equilibria are similar to161616 and are equivalent in the limit , and similarly for and . the and branches of zero angular momentum axisymmetric equilibria for shown in Figure 3. At this maximum temperature an instability171717 The term “gravitational zeroth order phase transition” has been applied to analogous instabilities of the self-gravitating gas in de Vega & Sánchez (2002a) where it is used to describe the collapse of classical Newtonian self-gravitating gas and has been used in Chavanis (2002) to describe the condensation of a fermionic Newtonian self-gravitating gas.
arises in the canonical ensemble, which is also evident from the caloric curve shown in the left panel of Figure 19, and the system’s transition to the axisymmetric phase. Indeed, the left panel of Figure 19 shows that the stable axisymmetric equilibrium has lower free energy than the metastable non-axisymmetric equilibrium of the same temperature and angular momentum. Similarly, Figure 20 shows that axisymmetric equilibria have higher entropy than non-axisymmetric equilibria of the same energy and angular momentum.
The left panel of Figure 21 displays the region in the VRR temperature and total angular momentum space where biaxial equilibria exist. The maximum biaxial temperature is a decreasing function of , and there are no biaxial states above . The value of can be derived analytically by using the distribution function in angular-momentum direction space shown in Figure 18, and exploiting the fact that at . At , we may assume that the bodies are arranged in razor-thin disks with normals and :
[TABLE]
In particular, the biaxial state has with
[TABLE]
To determine whether this state is stable or unstable to splitting each peak of the distribution function to two peaks centered at slightly smaller and larger , we examine if the energy of those configurations is lower or higher than that of using Eqs. (72–76). At fixed , , and , we find that these perturbations result in an energy decrease exactly if corresponding to . Hence the mutual inclination between the two axes in the zero-temperature biaxial state must be larger than for metastability. We conjecture that is metastable at higher inclinations and smaller against arbitrary perturbations of the distribution function181818We find stability against a variety of perturbations including more general bifurcations into four modes with , for where and for the additional modes with arbitrary..
A rough estimate of the lifetime of the biaxial metastable states is obtained by assuming that the transition probability for each body in a unit VRR relaxation time (see Section 1) is of order , where is the free-energy barrier between the metastable biaxial state and the stable axisymmetric state . To find , note that the transition from to requiring the least free energy passes through , the unstable non-axisymmetric state (see Figure 1). In the most conservative estimate (Chavanis, 2005), the transition occurs when all objects climb the barrier coincidentally during their thermal motion, which happens in a time
[TABLE]
The right panel of Figure 21 shows . Based on this simple estimate, the lifetime of the biaxial metastable states increases exponentially with (recall that at fixed values of the order parameters and coupling constant and ), and thus the metastable state is very long-lived for . In the opposite extreme, the transition to the state could occur gradually with different objects jumping over the barrier in succession. This happens over time , which is much shorter. Numerical simulations are needed to determine the correct scaling with .
6. -ensemble
In Section 3.1 we introduced the -ensemble. The system is assumed to be in a heat bath, a much bigger system that surrounds our small one-component system, with which it can exchange VRR-energy and angular momentum but not bodies. Thus in this ensemble the temperature , the number of bodies , and the rotation parameter (Eq. 20) conjugate to angular momentum are held fixed. The generalized thermodynamic potential of this ensemble is given in equation (52), We focus our attention to the axisymmetric case here. We derive a parametric solution to the self-consistency equations in Appendix E.2. Similar to the canonical ensemble, we find a phase transition, but the values of the order parameters at the transition points are modified as shown in Figure 22 (cf. Figures 9 and 10). A first-order phase transition occurs for ; the transition becomes second-order for . This behavior resembles the canonical ensemble but the critical values are different with , and as depicted in Figures 22 and 23.
7. Separable multi-component systems
Let us define a separable multi-component system to be a system in which the energy of each individual component is dominated by its self-energy and the small interaction between components leads to thermodynamic equilibrium among different components. More specifically, we require that for all in Eq. (15), which is typically191919This does not hold if for some . satisfied if
[TABLE]
In such a system each individual component is subject to the one-component model developed above and thus the analysis (including all of the figures) applies to each one of them. In the special case where the mutual interaction of a particular component with all other components is exactly zero then this component behaves as an isolated system and it may have a different temperature from the rest of the system in equilibrium in a microcanonical ensemble. Otherwise, if the mutual interactions between components are small but non-zero, then different components settle at a common temperature and . However note that for each component, the equilibrium distribution function (Eq. 45) depends explicitly on the dimensionless quantities
[TABLE]
as shown in the figures above. Thus, different components may exhibit different phases (uniaxial, biaxial, or disordered) in equilibrium, depending on their respective values of and at the same and .
In the special case where for all and and either or for all and , all components’ phase transition occurs at the same temperature. Then depending on the microscopic initial conditions, some components lie in the ordered and the rest in the disordered phase. In this sense, during the phase transition, different phases coexist.
In multi-component separable systems with radially non-overlapping components, those with larger values of have smaller and therefore form thinner disks than components with lower (see Figures 3 and 7). In particular, if the radial number density follows and the stellar mass distribution is independent of radius then and (see Eq. 79). Thus if the radial number density is steeper than , as it is for the massive stars in the Galactic center (Bartko et al., 2009; Yelda et al., 2014), then the fractional thickness of the equilibrium disk increases outwards (i.e., the disk flares) and the distribution may be nearly isotropic (disordered) beyond a transition radius. Conversely, if the number density is shallower than then the disk thickness increases inwards and the system is nearly isotropic inside some transition radius. Similarly, higher mass objects form thinner disks. We emphasize however that the multi-component system observed in the Galactic center is probably non-separable and higher order multipoles beyond the quadrupole are likely to play a significant role in the dynamics. We leave further study of multi-component systems to future work.
8. Conclusions
In self-gravitating systems subject to a dominant central potential, such as that of a massive central object, the bodies typically follow bounded planar orbits about the center. Due to rapid in-plane precession, each planar orbit can be represented by an axisymmetric surface density profile (over timescales given by , see Eq. 1). On longer timescales the angular-momentum directions, which describe the orientation of the orbital planes, relax and attain thermal equilibrium long before other degrees of freedom do (see Section 1). This process is called vector resonant relaxation (VRR). We determined the mean-field thermodynamic VRR equilibrium states using the quadrupole approximation for the gravitational interactions between orbits. For a list of our assumptions see the introduction to Section 2. The equilibria exhibit a remarkable variety of behavior including three qualitatively different VRR-phases:
- (i)
a uniaxial “nematic” ordered phase which represents a disk in physical space containing both prograde and retrograde orbits, surrounded by a dilute halo (Figures 12 and 13); 2. (ii)
a biaxial “nematic” ordered phase consisting of two disks in physical space; the two disks have the same thickness and mass and their mutual inclination is between and (Figure 18); 3. (iii)
an axially symmetric disordered phase representing a nearly isotropic distribution of orbits in both physical space and angular-momentum space (Figure 12).
The term “nematic” highlights the analogy with liquid crystals, where the molecules specified by their symmetry axes are concentrated towards a symmetry axis , in the sense that is preferentially parallel or antiparallel to . In both liquid crystals and the quadrupole VRR model the Hamiltonian is invariant under the inversion , but the direction of the total angular momentum in the VRR model breaks this symmetry.
The ordered and disordered phases are stable at low and high temperatures, respectively202020See definition of temperature below Eq. (19.. The biaxial ordered phase, consisting of two highly inclined disks, requires the temperature to be limited to the range shown in Figure 21 and the mean angular momentum to be limited to where is the angular momentum of one object (Eq. 2). Biaxial states are metastable in both the canonical and microcanonical ensembles (i.e., in the canonical [microcanonical] ensemble they are a local minimum of the VRR free energy [negative of the entropy] but this minimum is larger than that of axisymmetric ordered states with the same temperature [energy]). Nevertheless, for practical purposes the lifetime of the two-disk state may be very long, especially if the disks are thin.
The canonical ensemble exhibits a first-order gravitational phase transition between the axisymmetric ordered and disordered states for a limited range of total angular momentum (where is the magnitude of the angular-momentum vector of a single object, see Figure 9). The phase-transition temperature depends on the total angular momentum (). The disordered state is only stable if either or . Negative-temperature states are allowed due to the fact that the VRR energy has a maximum possible value, which corresponds to the isotropic distribution for (see Section 3.2 and Figures 16 and 20).
At a critical total angular momentum the gravitational phase transition becomes second-order in the canonical ensemble and there is a smooth crossover between the ordered and disordered states for larger . For high angular momentum, equilibria resemble an axisymmetric disk in physical space for any positive temperature.
Due to the non-additivity of the quadrupolar mean-field model and the existence of negative heat-capacity equilibrium states, the canonical and microcanonical ensembles are inequivalent. In particular, the phase-transition of the canonical ensemble is replaced by a stable sequence of equilibria without a phase transition in the microcanonical ensemble including a region of negative specific heat (between and in Figure 11). Furthermore, negative heat capacity biaxial equilibria (between and in Figure 19) are metastable in the microcanonical ensemble but unstable in the canonical ensemble. Outside these regions, all VRR equilibrium states are identical in the microcanonical and canonical ensembles.
We furthermore introduced an ensemble, the -ensemble, in which the system is embedded in a much bigger bath with which it can exchange not only VRR energy but also angular momentum. This ensemble also exhibits a first-order phase transition and a critical point where it becomes second-order, but at different values of the thermodynamic variables (see Section 6).
The one-component quadrupolar VRR model discussed in this paper is reminiscent of the Hamiltonian Mean Field (HMF) model (Dauxois et al., 2002), although the VRR model has two degrees of freedom per body while the HMF model has only one, and the HMF model has a kinetic energy term in the Hamiltonian where VRR does not. Both the VRR and the HMF model exhibit ensemble inequivalence, first- and second-order phase transitions, and negative heat capacity, but the biaxial metastable equilibria and the negative-temperature equilibria do not appear in the HMF model.
There are strong similarities between the equilibrium configurations of VRR and liquid crystals. The Newtonian gravitational interaction between rapidly precessing elliptical orbits, which trace out axisymmetric punctured disks, is similar to the Coulomb interaction between axisymmetric molecules. In particular, the quadrupole mean-field Hamiltonian of VRR for a one-component system with zero total angular momentum is equivalent to the Maier–Saupe model of liquid crystals (Maier & Saupe, 1958), which exhibits a nematic-isotropic phase transition. In the nematic phase of VRR, the bodies are configured as a disk embedded in a dilute halo. The onset of the nematic-isotropic transition in VRR depends on the total angular momentum, just as the onset of the transition in liquid crystals depends on the external magnetic field, as can easily be verified by comparing our Figure 9 with the corresponding ones of Wojtowicz & Sheng (1974); Gramsbergen et al. (1986) for example.
However, there are important differences between the quadrupolar VRR system and liquid crystals. In contrast to the diamagnetic term that arises in the free energy due to an external magnetic field in liquid crystals, the term that arises in VRR due to the angular-momentum constraint is , which resembles a paramagnetic term of a spin system in an external magnetic field, .
We have restricted our attention in this paper to the mean-field approximation, in which each body is drawn independently from a distribution function and correlations are ignored. In the limit of zero temperature (), we find two distinct distributions that are in stable or metastable thermodynamic equilibrium: (i) a single razor-thin disk containing bodies with orbit normals that are both aligned and anti-aligned to the total angular momentum, with the relative numbers of each population depending on the total angular momentum (state ); and (ii) two razor-thin disks of equal mass, with normals inclined to the total angular-momentum vector by where depends on the total angular momentum and lies between and set by (state ). We call these states “uniaxial" and “biaxial", respectively.
The zero-temperature equilibria and are discrete configurations of angular-momentum vector directions (i.e., the distribution function has compact support on the unit sphere) that are time-invariant up to a rigid-body rotation at uniform angular speed. These are not the only such configurations. When other examples include (i) oriented along the vertices of a regular polyhedron, (ii) oriented along the vertices of a regular planar polygon in an arbitrary plane, (iii) oriented along the vertices of a regular planar polygon and either or both of the two directions perpendicular to the polygon’s plane. In the simplest of these configurations the mass in each component is equal. These configurations do not appear in our maximum-entropy analysis, but in the limit where the number of components is large, we may recover the axisymmetric continuous zero-temperature states derived earlier in this paper: for example, configuration (ii) .
Our analysis is based on two important simplifications in addition to the mean-field approximation (see Section 2 for a detailed inventory of our simplifying approximations). First, we approximated the gravitational torques between orbit annuli by their quadrupole component. Multipoles beyond the quadrupole may be important for systems with radially overlapping or closely spaced orbits. Second, we focused on a one-component model, in which all bodies have the same scalar angular momentum and all of the pairwise interactions have the same coupling coefficients (we briefly discussed a simple extension of this model to multi-component systems in Section 7). As we have shown in this paper, the statistical mechanics of this simplified model can be described completely yet exhibits a remarkable range of behavior. We expect that many of the features we have observed will also be present in more general models of stellar systems dominated by a central mass that do not depend on these approximations.
We are grateful to Julien Barré for generously helping us to clarify several aspects of the stability analysis and to Benjamin Beri for pointing out that the quadrupole mean-field Hamiltonian is described by the Maier-Saupe model of liquid crystals, and to the referee for thoughtful comments that substantially improved the paper. We thank Zoltan Rácz for useful discussions. This work was supported in part by the European Research Council under the European Union’s Horizon 2020 Programme, ERC Starting Grant #638435 (GalNUC), by the U.S. National Science Foundation through grant AST-1406166, and by NASA through grant NNX14AM24G.
Appendix A Hamiltonian for vector resonant relaxation
Here we describe the interaction Hamiltonian that governs VRR, that is, the interaction energy between Keplerian orbits after averaging over orbital phase and apsidal precession (equivalently, mean anomaly and longitude of pericenter). The most general form of this Hamiltonian may be written (Kocsis & Tremaine, 2015)
[TABLE]
where are Legendre polynomials and and label the bodies, which orbit around the central point mass with angular-momentum unit vectors . In this formula ,
[TABLE]
and
[TABLE]
Here , , denote the mass, semimajor axis, and eccentricity of the orbit of body around the central object, all of which are fixed during VRR, and “” and “” label the index or with the larger and the smaller semimajor axis, respectively, i.e., . Note that only terms with even contribute to the sum, and the term only contributes an unimportant constant to the Hamiltonian.
For circular orbits for all , and more generally, for eccentric orbits with (radially non-overlapping orbits)
[TABLE]
for . Here . For radially non-overlapping orbits the sum in Eq. (A1) converges exponentially as a function of and the term dominates the dynamics for arbitrary mutual inclinations. In contrast, for radially overlapping orbits, , the contribution of the multipole decays as , and multipoles up to order must be included when calculating the Hamiltonian (see Figure B1 in Kocsis & Tremaine 2015). However, if most orbits have relatively large inclinations and , the net torque on an object is still dominated by the quadrupole interaction. In particular, for a spherical cluster the contribution of the multipole to the net torque decays as fast as (see Appendix D in Kocsis & Tremaine 2015). For clusters with overlapping orbits, the effects of multipoles beyond the quadrupole may be significant.
For simplicity, let us keep only the term and assume that a large number of bodies have similar but different with several such components . Then the interaction energy is a sum over the components and the bodies therein,
[TABLE]
where and . In the second line the sums over and of components and respectively are written as integrals using the corresponding distribution functions and of the angular-momentum unit vectors. For circular orbits, , and this is Eq. (8) used in the main text. For eccentric, radially overlapping orbits is given analytically by Eqs. (B12) and (B31) in Kocsis & Tremaine (2015).
Appendix B Alignment of total angular momentum
In this appendix we show that, at equilibrium, the angular momentum and the Lagrange multiplier lie along one of the eigenvectors of the matrix . We work in a coordinate system aligned with these eigenvectors so is diagonal (Eq. 3) and we can write (there is no summation over here, that is, , where is the vector whose elements are the eigenvalues of ). The self-consistency condition (28) is then
[TABLE]
In the case , this condition simplifies to
[TABLE]
in the last equation we have introduced a spherical polar coordinate system. Now divide the interval into four quadrants: in the first quadrant, , use as the integration variable; in the second quadrant, , use ; in the third quadrant, , use ; and in the fourth quadrant, , use . Adding the sub-integrals we find
[TABLE]
Since the integration range is from 0 to , and are positive, so the integral is positive-definite if , negative-definite if is negative, and zero (as required) if and only if at least one of and is zero. Repeating this argument for , and , we conclude that at least two of the must be zero. Therefore must be aligned with one of the coordinate axes and thus with one of the eigenvectors of .
Without loss of generality we may assume that points along . Then the angular-momentum vector is
[TABLE]
If or 2, the integrand in the numerator is odd under the transformation so the integral must vanish. Thus only is non-zero so the angular momentum is parallel or anti-parallel to .
Appendix C Second-order variation
Here we calculate the second variation of entropy and free energy under general perturbations of the mean-field distribution function and show the stability of the negative-temperature configurations and the disordered phase at infinite temperature, labeled in Figure 3.
Let be the equilibrium distribution function,
[TABLE]
and let be a perturbation about this equilibrium. The entropy changes to
[TABLE]
Expand this to second order in . Then the change in entropy is
[TABLE]
Substituting Eq. (C1) and dropping terms higher than second order,
[TABLE]
Conservation of the number of bodies and the total angular momentum implies that
[TABLE]
Thus the equation for the entropy variation simplifies to
[TABLE]
The energy change can be written as
[TABLE]
which implies212121We denote .
[TABLE]
where the second line follows from the relation (Eq. 15). Thus if the energy is fixed
[TABLE]
Substituting in Eq. (C6) to eliminate the terms depending on , we find
[TABLE]
where is the second-order variation of entropy
[TABLE]
As it should, the entropy variation (C10) contains no first-order terms, since the perturbation is performed about the equilibrium distribution, at which the entropy has vanishing first-order variation by construction, if the constraints are satisfied (see Section 2.2).
In the canonical ensemble the equilibria are extrema of the free energy at constant temperature . Using Eqs. (C6) and (C9) we find the second-order variation of free energy to be
[TABLE]
The condition for stability of an equilibrium with respect to any perturbation is given in the microcanonical ensemble by the condition
[TABLE]
that is, the entropy must be a local maximum. In the canonical ensemble the stability condition is
[TABLE]
that is, the free energy must be a minimum if the temperature is positive, or a maximum if the temperature is negative.
Now, since , as is evident from (C11) and (C12), stability in the microcanonical ensemble is determined by the same inequality condition as in the canonical ensemble (Eqs. C13 and C13)). However, equilibria do not necessarily possess the same stability properties in the two ensembles, because the perturbations are subject to different constraints. In the canonical ensemble, the perturbations are subject only to constraints (C5), while in the microcanonical ensemble they are subject additionally to the fixed energy constraint (C9). Thus, modes that render the canonical ensemble unstable may not exist in the microcanonical ensemble. If this happens, the system presents inequivalence of ensembles, as described in Section 2.3.
Negative-temperature states
For negative temperature, , both of the terms in the entropy second variation (C11) are negative-definite and so for perturbations that keep , , and fixed. Therefore negative-temperature equilibria are local maxima of the entropy and hence they are (meta)stable in the microcanonical ensemble.
Similarly, in the canonical ensemble both terms in (C12) are positive-definite and so for perturbations which keep , , and fixed222222The temperature is defined by the angular momentum vector distribution function of the heat bath; it is constant because the heat bath has a much larger angular momentum than the system we are examining. Note that fixing does not impose any constraint on in Eq. (C12).. Therefore negative-temperature equilibria are local minima of and hence they are (meta)stable in the canonical ensemble.
State
The state is the disordered phase at infinite temperature and therefore corresponds to axisymmetric solutions with . Its stability is straightforwardly implied by Eq. (C11)
[TABLE]
Appendix D Stability of off-diagonal elements
We show here that for positive temperature, there can always be found a coordinate system with one axis aligned with , such that the off-diagonal elements of do not give rise to instabilities and are zero in equilibrium. We follow the notation of Section 3.1.
The dominant contributions to the integral (59) for the partition function of the -ensemble come from the minima of , which occur at a subset of the extrema where . The extrema are located at given by the implicit equation
[TABLE]
here the angle brackets denote the average
[TABLE]
We now describe some properties of the matrix . (i) Since we have , that is, is symmetric. (ii) Since , we have , that is, is traceless. (iii) Since is a tensor (Eq. 12), Eq. (D1) implies that also transforms as a tensor under rotations. Therefore without loss of generality we can choose the coordinate system in the -integral so that is diagonal232323Thus systems with are an example of spontaneously broken symmetry.. The whole system is free to change its principal axes perpendicular to . (iv) Using similar arguments to those in Appendix B we may then show that lies along one of these axes, which we may choose to be the -axis.
Given these results, all of the elements of are either zero or linear combinations of the parameters and defined in Eqs. (3)–(36). It is therefore useful to replace the nine coordinates by nine new coordinates defined by
[TABLE]
The Jacobian . In the new coordinates
[TABLE]
Here and , as in Eqs. (35) and (36). It is straightforward to confirm that the extrema of occur at . An extremum is a minimum if the Hessian is positive definite (i.e., all of its eigenvalues are positive) and a maximum if it is negative definite; otherwise the extremum is a saddle point. It is straightforward to confirm that all off-diagonal elements of the Hessian are zero in the 7 rows and columns corresponding to and that
[TABLE]
Thus is a minimum at the extrema as a function of these seven coordinates.
Appendix E Analytic results for the equilibria
Here we derive analytic expressions for the thermal equilibria. We define the moment generating function
[TABLE]
where , , and , the integral is over the unit sphere, and the parameters , , and are defined by Eq. (53). Once is known, all statistical quantities follow straightforwardly242424We suppress the constants from , , and .:
[TABLE]
[TABLE]
and for the -ensemble we have
[TABLE]
Eqs. (E9) follow from Eq. (53) assuming and . In the special case , self-consistency requires that . In this case is a trivial solution for arbitrary , and . This is the isotropic distribution , which is an equilibrium for any temperature and is shown by the curve in Figure 3.
E.1. Axisymmetric equilibria with zero total angular momentum
Systems that are axisymmetric must have two of the diagonal elements of (Eq. 3) equal. This requires that either or . Since the total angular momentum we are free to choose the -axis to be aligned with any one of the three eigenvectors of so without loss of generality we can assume that . Then by Eq. (53) and by Eq. (55). Eq. (E1) simplifies to
[TABLE]
Note that this is real for either positive or negative . Eqs. (E2) and (E9) yield the parametric solution
[TABLE]
As increases from to , increases monotonically from to . Both limits correspond to so the two limits are the zero-temperature states and in Figure 3. As increases from the temperature grows monotonically from to at , then it decreases monotonically for higher , approaching zero at . At , , Point in Figure 3 has , , and .
E.2. Axisymmetric, rotating equilibria
Next consider systems with non-zero angular momentum. Since is an even function of , implies (see Eq. E3). These states are axisymmetric around the angular-momentum axis with and . We evaluate (E1) at
[TABLE]
which is real for all . The quantities , , and may be obtained from Eqs. (E4), (E2), and (E9), which yields the parametric solution
[TABLE]
For any fixed , Eq. (E16) defines a monotonic function of mapping onto . For any fixed , the temperature assumes all values from to since the expression for is singular at (state ). In the canonical and microcanonical ensembles with fixed , is a monotonically increasing function of , but the temperature has three local extrema as a function of for , one local maximum for and no extrema for . Note that at (state ), (state ), and at a finite for (state ). Note that at for due to Eq. (E17).
In the -ensemble, and Eq. (E18) defines a monotonic function of for fixed , which can be inverted numerically to find for the equilibrium states between and for fixed . Substituting in Eqs. (E17) and (E18), is a monotonic function of for fixed , while has two local extrema as a function of for and no local extrema for . The heat capacity is negative between the two local maxima and positive otherwise.
Next we provide asymptotic expressions near , , and . We utilize the asymptotics of the error function
[TABLE]
In particular as and with fixed (state of Figure 7)252525The expressions below contain factors of . The factor , which is proportional to defined in Eq. (20), is retained because at fixed angular momentum , as according to Eq. (E23).
[TABLE]
which imply that
[TABLE]
and for and (state of Figure 7)
[TABLE]
As (state of Figures 15, 16, and 20) the temperature diverges,
[TABLE]
These results show that for axisymmetric states generally increases from to as changes from to , the endpoints being the and states (see Figure 7). The temperature approaches zero at both of these endpoints. For very small negative , the temperature is positive for and negative for . For intermediate values of (these are not shown by the asymptotics), we find that increases monotonically as a function of for fixed and the temperature assumes all values between and with up to three local maxima as a function of , i.e., one with for and two with for . In the limit , two of the local maxima of approach points and the third approaches point in Figure 3.
E.3. Non-axisymmetric rotating equilibria
For arbitrary one of the integrals over the two polar angles may be evaluated in Eq. (E1), which gives (see Eq. 50)
[TABLE]
The zero-temperature non-axisymmetric equilibria correspond to . We use
[TABLE]
for . We obtain
[TABLE]
The asymptotics near state in Figure 17 may be obtained from Eq. (E34) in the limit and such that
[TABLE]
is fixed. Laplace’s method262626 For any twice differentiable function with a unique minimum in , the following integral (if it exists) approaches
(E36)
We apply Eq. (E36) in Eq. (E34) with , , and .
yields
[TABLE]
Substituting in Eqs. (E2)–(E4) gives
[TABLE]
Substituting in the self-consistency equation (E9) gives a relation between and
[TABLE]
We may now eliminate . Next solve for using Eq. (E38) and substitute back into Eqs. (E9) and (E39)–(E40) to get the asymptotics near state parameterized by for any given :
[TABLE]
The state corresponds to the limit .
The asymptotics near may be obtained from Eq. (E34) in the limit that approaches a finite value while . In practice, we find numerically that for , for , and for any fixed , approaches a finite value. We derive an analytic approximation for the asymptotic behavior for by expanding Eq. (E34) in around 0 and using the identity (46),
[TABLE]
where is the modified Bessel function272727These equations become inaccurate for . For , we find numerically that , which implies that accurate analytic expressions exist in this regime (not shown), which may be derived with the Laplace method as in Eq. (E36).. Similarly, from Eqs. (46), (E34), and (E2)–(E4), we get
[TABLE]
We may substitute in the self-consistency equation (E9) to get a relation between and
[TABLE]
Now eliminate from Eqs. (E48)–(E50)
[TABLE]
[TABLE]
Therefore at zero temperature, the and equilibria depend on , while and are independent of . The latter series of equilibria depend on for at first and second beyond leading order in for and , respectively. In the limit, Eqs. (E43)–(E46) show that the order parameters are in the range and depending on as long as it satisfies . Outside of this range, for all , hence the assumption cannot be satisfied, and there is no state. The asymptotics near (Eqs. E52)–(E55) show that and , independent of .
In the limit, the non-axisymmetric equilibria near and (Eqs. E52–E55 and E43–E46) reduce to the axisymmetric asymptotics near and (Eqs. E20–E22 and E26–E28), respectively, in a rotated coordinate system (see Eq. 3)
[TABLE]
The energy of non-axisymmetric equilibria is bounded between
[TABLE]
where
[TABLE]
and for all where is the upper energy bound in Eq. (44) of the main text (see Figure 19).
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Abbott et al. (2016) Abbott, B. P., Abbott, R., Abbott, T. D., et al. 2016, Ph Rv L, 116, 061102
- 2Antonov (1962) Antonov, V. A. 1962, Solution of the problem of stability of stellar system with Emden’s density law and the spherical distribution of velocities (Vestnik Leningradskogo Universiteta, Leningrad: University)
- 3Arad & Lynden-Bell (2005) Arad, I., & Lynden-Bell, D. 2005, MNRAS, 361, 385
- 4Aronson & Hansen (1972) Aronson, E. B., & Hansen, C. J. 1972, Ap J, 177, 145
- 5Axenides et al. (2012) Axenides, M., Georgiou, G., & Roupas, Z. 2012, Ph Rv, D 86, 104005
- 6Bar-Or & Alexander (2014) Bar-Or, B., & Alexander, T. 2014, CQG Ra, 31, 244003
- 7Barré et al. (2001) Barré, J., Mukamel, D., & Ruffo, S. 2001, Ph Rv L, 87, 030601
- 8Bartko et al. (2009) Bartko, H., Martins, F., Fritz, T. K., et al. 2009, Ap J, 697, 1741
