Flattened stellar systems based on distribution functions depending on actions
Raffaele Pascale, James Binney, Carlo Nipoti

TL;DR
This paper investigates the construction of flattened stellar systems using distribution functions dependent on action integrals, addressing continuity constraints and presenting an algorithm for model creation.
Contribution
It introduces a new algorithm to ensure physically consistent distribution functions for flattened stellar systems based on actions.
Findings
Developed an algorithm for constructing flattened models
Ensured velocity distribution continuity near symmetry axes
Created various self-consistent flattened stellar models
Abstract
We address an issue that arises when self-consistently flattened dynamical stellar systems are constructed by adopting a distribution function (DF) that depends on the action integrals. The velocity distribution at points on the symmetry axis is controlled by the the dependence of the DF on just one action, while at points off the symmetry axis two actions are involved. Consequently, the physical requirement that the velocity distribution evolves continuously in the neighbourhood of the symmetry axis restricts the functional forms of acceptable DFs. An algorithm for conforming to this restriction is presented and used to construct a variety of flattened models.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStellar, planetary, and galactic studies · Astronomy and Astrophysical Research · Cosmology and Gravitation Theories
Flattened stellar systems based on distribution functions depending on actions
Raffaele Pascale1,2, James Binney3 and Carlo Nipoti1
1Dipartimento di Fisica e Astronomia, Università di Bologna, via Piero Gobetti 93/2, I-40129 Bologna, Italy
2Osservatorio di Astrofisica e Scienza dello Spazio, via Piero Gobetti 93/3, I-40129 Bologna, Italy
3Rudolf Peierls Centre for Theoretical Physics, Clarendon Laboratory, Oxford OX1 3PU, United Kingdom E-mail: [email protected]
(Submitted, 21 July 2019)
Abstract
We address an issue that arises when self-consistently flattened dynamical stellar systems are constructed by adopting a distribution function (DF) that depends on the action integrals. The velocity distribution at points on the symmetry axis is controlled by the the dependence of the DF on just one action, while at points off the symmetry axis two actions are involved. Consequently, the physical requirement that the velocity distribution evolves continuously in the neighbourhood of the symmetry axis restricts the functional forms of acceptable DFs. An algorithm for conforming to this restriction is presented and used to construct a variety of flattened models.
keywords:
celestial mechanics - galaxies: kinematics and dynamics - galaxies: structure
- Galaxy: kinematics and dynamics - Galaxy: structure
1 Introduction
Model stellar systems with known distribution functions (DFs) are powerful tools for the interpretation of observations because such a model predicts the outcome of any observation. For example, the system’s surface brightness can be obtained by integrating times the luminosity per star over velocities and the line of sight; a map of any velocity moment can be obtained by including an appropriate power of in the integral. If the system’s stars are resolved, the likelihood of the data given the model can be computed regardless of how many phase-space coordinates have been measured and with what precision.
In general it is essential to consider a stellar system to comprise several components. For example, a globular cluster will contain a range of stellar masses and include significant numbers of massive, dark, remnants. In addition to these components, galaxies are thought to contain a population of dark-matter particles that are distributed more extensively in phase space than the stars, and they generally contain several stellar populations that differ by age and/or chemical composition.
A small number of model systems are known that have analytic expressions for density and potential and also analytic expressions for the DF. All such systems are spherical and the only multi-component models are the rather specialised and complex two-component models described by Ciotti (1999). A wider range of models can be obtained by considering systems with DFs that are analytic functions of integrals of stellar motion but have density and potential distributions that have to be obtained numerically. Traditionally the integrals of stellar motion used as arguments of the DF have been the energy and the magnitude of the angular momentum . However, the key to producing multi-component and aspherical systems is to take the DF to be a specified function of the action integrals . Advantages of using actions as arguments of the DF include
- •
The mass of any component is specified by the DF before the system’s density and potential have been determined.
- •
The addition of an extra component of mass changes the density distributions of other components in an intuitive way.
- •
The self-consistently generated potential can be solved for by a stable and rapidly convergent iteration.
Early uses of DFs that depended exclusively on actions examined the structure predicted by DF in assumed potentials (Binney, 2010, 2012). The potential self-consistently generated by a DF was first obtained by Binney (2014), but only for one-component systems. Piffl et al. (2015) solved for the potential self-consistently generated by a multi-component DF designed to model our Galaxy. Posti et al. (2015), Williams & Evans (2015) and Pascale et al. (2018) explored spherical models of early-type galaxies that are defined by DFs designed to model both dark haloes and the stellar content of elliptical and dwarf spheroidal galaxies.
In the case of a spherical potential, the actions comprise two components and of the angular momentum , and the radial action . is the component of about some chosen axis, and quantifies the inclination of the orbit with the respect to the chosen axis. The radial action quantifies the amplitude of a star’s radial oscillations.
If the part of that is an even function of depends on and only through the combination , the system’s real-space structure will be spherical. It may, however, have net rotation around the axis that defines : rotation around this axis is encoded in the part of that is a odd function of . If the part of that is even in depends on and other than through the combination , the model will be aspherical. If decreases with increasing faster than it does with increasing , the model will be oblate.
In an oblate potential, the existence of actions is not guaranteed, but numerical orbit integrations in plausible oblate potentials reveal that the great majority of orbits are quasiperiodic (Binney & Tremaine, 2008), which implies the existence of three action integrals. For the majority of orbits these actions prove to be minor modifications of the actions familiar from the spherical case (Binney & McMillan, 2016). A minority of ‘resonantly trapped’ orbits have actions that are not simply related to the spherical actions, although they can be computed using first-order perturbation theory formulated in terms of the usual (Binney, 2016). To a good approximation, the existence of trapped orbits can be neglected when model building since an ensemble of trapped orbits generate very similar predictions for most observables to an ensemble of untrapped orbits (Binney, 2018).
Subject to this proviso regarding the treatment of resonantly trapped orbits, any non-negative function of three variables for which the integral through the positive octant of Cartesian space is finite defines a stellar model of mass , because, given such a function, one can normalise it such that and can then solve for the potential that satisfies
[TABLE]
This computation is rendered feasible by the Stäckel Fudge (Binney, 2012), which, given any plausible axisymmetric potential , provides approximate formulae for . Equation (1) can be solved in iterations, starting from any plausible initial potential , by taking the potential to be that on the left of the equation with used in the Stäckel Fudge on the right (Binney, 2014, hereafter B14).
While any non-negative, normalisable function defines a logically possible model, B14 already noted that unless candidate functions are subjected to restrictions, the final model is liable to display physically implausible structure near the origin and/or symmetry axis. Moreover, Piffl et al. (2015) discovered that the simplest DFs for our Galaxy’s dark halo predicted implausibly cusped velocity distributions. The goal of this paper is to elucidate conditions on that ensure that it will generate a physically plausible model.
In Section 2 we explain the physical origins of the restrictions on and suggest a way of satisfying them. In Section 3 we illustrate the effectiveness of our proposal by presenting a variety models with and without implementation of our proposal. In Section 4 we ask why galaxies are restricted in the distribution of stars in action space, and Section 5 sums up.
2 Restricting the DF
2.1 Physical motivation
Along the symmetry axis of an axisymmetric galaxy, the two directions that run parallel to the equatorial plane are physically equivalent. If we call these the and directions, it follows that at any point on the symmetry axis, the distributions of and must be identical.
Only orbits with can reach the symmetry axis, so when considering the velocity distribution at points on that axis we can confine attention to the plane of action space, which has axes and . The orbits in this plane fall into two families: boxes (with small values of ) and loops (with above a threshold), as illustrated by Fig. 1. The loop orbits do not reach a central section of the symmetry axis; this section is reached by box orbits, which do not visit the part of the symmetry axis that is visited by loops.
In the central section of the axis, , which quantifies the height of a box orbit, largely quantifies , while quantifies and . Outside the central section of the symmetry axis, , which quantifies the radial excursions of a loop orbit, largely quantifies while quantifies and . In both sections of the symmetry axis, the distributions of and are guaranteed to be identical because they are set by the way depends on the same argument: in the central section and further out. If we wish to avoid rapidly changing velocity distributions as we move between the central and outer sections of the symmetry axis, we should relate the way depends on and along the line in the plane that has boxes on one side and loops on the other.
Since the velocity distribution should be a continuous function of position, the distributions of and should be nearly the same if we move a small distance along the axis from the symmetry axis. Once we are off the symmetry axis, orbits with non-zero contribute to the kinematics. Fig. 2 shows two orbits with the same small value of . The orbit on the left approaches the symmetry axis away from its central section, while the orbit on the right approaches just this central section. If we start from the central section of the symmetry axis and move parallel to the axis, it will be orbits like that on the right of Fig. 2 that contribute to the kinematics. A change to of such an orbit changes only , while a change to changes both and . Unless we restrict the way depends on and , there is no guarantee that the distributions of and will be similar at our new location and on the symmetry axis.
If we move parallel to the axis from a point on the symmetry axis that lies outside the central section, it will be orbits like that shown on the left of Fig. 2 that contribute to the kinematics. A change to on an orbit of this type only varies (which controls the amplitude of oscillations perpendicular to the symmetry axis), while a change to mainly changes . Hence the dependence of on and must be restricted if the condition of approximate isotropy just off the symmetry axis is to be satisfied.
In summary, these arguments show that
- •
At the derivatives of with respect to and should be related along a line in the plane .
- •
At small , the derivatives of with respect to and should be related in the limit .
- •
At larger , the derivatives of with respect to and should be related in the limit .
In Section 3 we will show that a DF that does not guarantee near isotropy all along the minor axis, generates unphysical density distributions.
2.2 Essential restrictions
Velocity isotropy would be guaranteed if the DF were a function of the Hamiltonian. Then the derivatives of with respect to and would be in the ratio of orbital frequencies
[TABLE]
For definiteness, we restrict ourselves to cored models. Orbits that are confined to the core will be essentially harmonic, with the consequence that for these orbits . We conclude that we can ensure that the velocity distribution parallel to the equatorial plane tends smoothly to the mandatory central isotropy by requiring that
[TABLE]
It is not hard to see that satisfaction of the very similar condition
[TABLE]
ensures that the velocity distribution in the plane tends smoothly to isotropy as one approaches any point on the central section of the symmetry axis.
Points on the symmetry axis and outside the central section are reached by orbits with but significantly non-zero . In this region, quantifies the vertical velocity component, which is unrestricted, while and quantify the two tangential components of velocity, which should have nearly identical distributions. By the same chain of argument we deployed above, we infer that the condition of approximate isotropy in and will be satisfied if
[TABLE]
The limiting frequency ratio required here is unity, as one may convince oneself in two ways: (i) use the Torus Mapper (Binney & McMillan, 2016) to compute the frequencies of orbits for diminishing ; or (ii) recall that is the frequency at which the orbital plane precesses, and that symmetry requires this frequency to be zero for an orbit that passes right over the pole of the potential. Thus we conclude that velocity isotropy near the symmetry axis requires
[TABLE]
2.3 Implementing the restrictions
Posti et al. (2015) describe a general procedure for constructing DFs that depend on the actions through a function
[TABLE]
that is linear and homogeneous in . Without loss of generality, the coefficient of can be taken to be one (B14). Posti et al. (2015) confined their attention to the case in which and therefore become functions of . The model that then generates is spherical. B14 had earlier shown in the special case of the isochrone (Hénon, 1960) that if one chooses , the model generated by is flattened. This idea was later exploited by Das & Binney (2016) to model the flattened inner stellar halo of our Galaxy.
Since for a DF
[TABLE]
adopting constant values of and is not consistent with the restrictions derived in Section 2.2. Hence we replace the coefficient of in by a function :
[TABLE]
We require to be a continuous function of such that
[TABLE]
We further require
[TABLE]
so that, when , depends on and through the total angular momentum and the generated model is spherical.
We satisfy the conditions (10) and (11) by writing
[TABLE]
where is a smooth function such that
[TABLE]
and is a smooth function such that
[TABLE]
Finally, we require the function in equation (12), to which tends as , to satisfy
[TABLE]
Functions that satisfy conditions (13) to (15) are
[TABLE]
where is a scale action characteristic of the system’s core. The form of specified by equations (16) satisfies the conditions (15): (i) it ensures a continuous transition between flattened models () and spherical models (); (ii) in the case of even small flattening, it quickly tends to . Fig. 3 plots as a function of for given values of . Substituting for and from equations (16) allows one to rearrange equation (12) to
[TABLE]
where and are dimensionless actions.
3 Worked examples
We illustrate the benefit of using (equation 19) rather than as the coefficient of in the function by computing some flattened models derived from the DF that Pascale et al. (2018) introduced to model the Fornax dwarf spheroidal galaxy – Pascale et al. (2019) already explored the range of spherical models that this DF generates. The DF is
[TABLE]
where is a positive, dimensionless constant that primarily controls the model’s density profile, is a scale action that sets the size of the model’s core, and is a normalising constant that ensures that . Here we assume . In the interests of generality, in equation (20) we define
[TABLE]
where
[TABLE]
We will refer to the old models, with constant as coefficient of in equation (7), as , and to the new models, with as coefficient of (equation 19), as .
Fig. 4 plots various quantities for a model computed with . The top panel shows the density profiles along the model’s major (red) and minor (black) axes, with distances scaled to the core radius , defined as the distance down the major axis at which
[TABLE]
The middle panel of Fig. 4 shows isodensity contours in the meridional plane, and the bottom panel shows the axis ratios of these contours that one obtains by fitting ellipses to the contours as explained in Appendix A. These plots reveal unphysical features that derive from the use of in equation (21).
At the left edge of the middle panel, the contours are sloping down to the left, reflecting a depression in the density along the minor axis, and, worse still, a discontinuity in the direction of the normals to isodensity surfaces where they cut the symmetry axis. The bottom panel shows a rapid decrease in the model’s flattening as one approaches the centre, which sends the axis ratio through unity to values indicative of prolateness before the centre is reached. This feature can also be seen in the top panel, where, when approaching the center, the density along the -axis becomes larger than the density computed along the -axis of the meridional plane. These unphysical features arise from the failure of the DF (20) with rather than (equation 21) to respect restriction on the velocity distribution along the minor axis that we derived in Section 2.2.
Fig. 5 compares six models computed with different : in the models shown in the upper row , while in the models of the lower row . The models have similar flattenings but their radial bias increases from left to right: their parameters are = (0.5,1), (0.7,1.4) and (1,2) and flattening increases with the ratio , while increasing both and increases the radial bias of the velocity distribution. All models have .
The isodensity contours plotted in the upper panels of Fig. 5 have clearly discontinuous slopes across the minor axis. The contours plotted in the lower panels do not show this unphysical discontinuity, so using rather than banishes cuspy isodensity surfaces. The left and middle panels of Fig. 6 show the iso-density contours along the minor axis on an enlarged scale for models with . The right column in this figure shows the variation with of the gradients of contours
[TABLE]
at three values of : , with equal to the model central density. Positive values of indicate a depression along the reference isodensity contour when approaching the minor axis. A depression is not unphysical – ‘peanut’ bulges of disc galaxies have such depressions – but a non-zero value of as is unphysical. Blue curves show when , while red curves show when . We see that use of rather than ensures that the slopes of isodensity contours are only slightly positive near the axis and vanish on the minor axis.
Fig 7 shows density profiles and axis ratios for models with flattenings that increase from left to right. Panels in the top row show density profiles along the major (red) and minor (black) axes for models computed with , while below them we show the corresponding models with . In the top row, the red curves fall below the black curves at , implying that these generally oblate models have prolate cores. In the panels of the middle row, the red and black curves approach together. The bottom row shows plots of the axis ratios as a function of semi-major axis. The models with have central axis ratios significantly greater than unity, whereas the more flattened models with have axis ratios that are always less than unity. In the least flattened of the models that uses (extreme left panel), does exceed unity at , but this model is so nearly spherical that this tendency to central prolateness is of little significance. The model just to its right becomes marginally prolate at , where the density is extremely close to the central density and even a tiny angular variation in density generates a significant value of .
Further exploration of any tendency to prolateness in the core is facilitated by defining the diagnostic
[TABLE]
The small panels of Fig. 7 are plots of in models in which . Clearly, in an oblate model should be negative, but the figure shows that in the core to an extent that varies with (). Fig. 8 plots for several values of the peak value of as a function of the ratio that controls a model’s flattening. The top panel is for models that use and the bottom panel is for models that use . In the top panel, the largest values of , and therefore the most prolate cores, occur in models with i.e., nearly spherical models, as is to be expected, and in models with the largest values of and therefore the greatest radial velocity bias. In the most radially biased model (), the peak in reaches and this model is prolate throughout the core. In the bottom panel, even in the most radially biased and least flattened model. In the vast majority of models it is much smaller. Thus replacing with essentially resolves the issue of prolate cores in addition to banishing cusped isodensity contours.
3.1 Distribution of
Piffl et al. (2015) encountered a problem with the distribution of azimuthal velocity components that occurs already in spherical models of the B14 type: when the model has a radial bias, a cusp around appears in the distribution because
[TABLE]
When the model has tangential bias, the cusp is replaced by a dimple, so the above limit is positive. To eliminate this problem, they forced the dependence of on the actions in the limit to mirror that of the Hamiltonian. To achieve this goal with defined by equation (9), we would have to make depend on the actions such that it tends to as . Fig. 9 explores the consequences of our failure to take this step by showing distributions at several locations in nine models of varying flattening and radial bias. The approximately isotropic and radially biased models have unexceptionable distributions. At the distributions of the tangentially biased models do have anomalous central shapes. The double humped nature of these profiles simply reflects the absence of a part of the DF that is odd in ; such a component would reinforce one hump at the expense of the other, yielding the skew distribution of a rotating model. The anomalous features that one would ideally eliminate by the method of Piffl et al. (2015) are the central spikes displayed by two of the tangentially biased models.
The difference between the anomalous features in these distributions and those presented by Piffl et al. (2015) probably arises because the latter were adapting the DFs of Posti et al. (2015), which generate models with central density cusps, rather than building cored models.
4 Discussion
Action space is simply a way of cataloguing orbits, and one may ask why one is not at liberty to populate those orbits as one pleases by setting the DF to an arbitrary, normalisable, non-negative function of . The proper response to this claim is to say ‘yes you can populate the orbits as you please, but there are two reasons why the schemes of population occurring in real systems are restricted.’
The first reason is the requirement for self-consistency. It is possible that certain DFs are not consistent with an essentially integrable self-consistent potential. For example, some of the potentials encountered during an attempt to iterate from to a self-consistent might have significant chaotic zones. Stars would diffuse through these zones, irreversibly changing .
The second reason why the DFs of real systems will be restricted relates to the manner in which stars are distributed in phase space. Imagine stars being scattered like confetti, or shrapnel from a shell burst at a large number of locations. Then the initial conditions will be smooth functions of and of and the resulting DF will not be one that induces discontinuities in velocity distributions. Star formation will likewise create stars with a density that is a continuous function of .
5 Conclusions
While any non-negative, normalisable function specifies a self-consistent stellar system, the system it generates will have unphysical features unless the DF satisfies the constraints (4) and (6) on its derivatives. These constraints arise from the requirement that the velocity distribution at points near the symmetry axis should differ little from the (axially symmetric) velocity distributions on the symmetry axis. If the DF is a function only of the Hamiltonian , these constraints are automatically satisfied.
We have experimented with an algorithm that generates DFs for flattened and possibly radially biased models that are consistent with the DF tending to a function of the Hamiltonian as approaches zero. We have shown that the resulting models are free of the unphysical features near the symmetry axis that disfigure models based on the simpler DFs proposed by B14. The new DFs provide a promising basis for modelling different components of globular clusters, dwarf spheroidal galaxies and galactic bulges.
Appendix A Quantifying flattening
Each iso-density contour is characterised by a collection of points, with , such that . For that contour we define
[TABLE]
where and are, respectively, the semi-major and the semi-minor axes for an oblate model, viceversa for a prolate model. We find the best () as the solution of
[TABLE]
which is
[TABLE]
and
[TABLE]
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Binney (2010) Binney J., 2010, MNRAS , 401, 2318 · doi ↗
- 2Binney (2012) Binney J., 2012, MNRAS , 426, 1324 · doi ↗
- 3Binney (2014) Binney J., 2014, MNRAS , 440, 787 · doi ↗
- 4Binney (2016) Binney J., 2016, MNRAS , 462, 2792 · doi ↗
- 5Binney (2018) Binney J., 2018, MNRAS , 474, 2706 · doi ↗
- 6Binney & Mc Millan (2016) Binney J., Mc Millan P. J., 2016, MNRAS , 456, 1982 · doi ↗
- 7Binney & Tremaine (2008) Binney J., Tremaine S., 2008, Galactic Dynamics: Second Edition
- 8Ciotti (1999) Ciotti L., 1999, Ap J , 520, 574 · doi ↗
