Secular dynamics of binaries in stellar clusters I: general formulation and dependence on cluster potential
Chris Hamilton (1), Roman R. Rafikov (1,2) ((1) DAMTP, Cambridge, (2), IAS)

TL;DR
This paper develops a general theoretical framework for understanding the long-term orbital evolution of binary systems in axisymmetric stellar clusters, accounting for various potential shapes and orbital configurations.
Contribution
It derives a universal, quadrupole-order secular Hamiltonian for binaries in axisymmetric potentials, extending previous models and unifying different dynamical effects.
Findings
Reproduces known secular effects like Lidov-Kozai and Galactic tides.
Introduces parameters A and Gamma that encapsulate cluster potential effects.
Provides a basis for analyzing binary evolution in diverse stellar environments.
Abstract
Orbital evolution of binary systems in dense stellar clusters is important in a variety of contexts: origin of blue stragglers, progenitors of compact object mergers, millisecond pulsars, and so on. Here we consider the general problem of secular evolution of the orbital elements of a binary system driven by the smooth tidal field of an axisymmetric stellar cluster (globular, nuclear, etc.) in which the binary orbits. We derive a secular Hamiltonian (averaged over both the inner Keplerian orbit of the binary and its outer orbit within the cluster) valid to quadrupole order for an arbitrary cluster potential and explore its characteristics. This doubly-averaged 'tidal' Hamiltonian depends on just two parameters, which fully absorb the information about the background cluster potential and the binary's orbit within it: a dimensional parameter setting the secular timescale, and a…
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3
Figure 4
Figure 5
Figure 6
Figure 7
Figure 8
Figure 9
Figure 10
Figure 11
Figure 12
Figure 13
Figure 14
Figure 15
Figure 16
Figure 17
Figure 18
Figure 19
Figure 20
Figure 21
Figure 22
Figure 23
Figure 24
Figure 25
Figure 26
Figure 27
Figure 28
Figure 29
Figure 30
Figure 31
Figure 32
Figure 33
Figure 34
Figure 35
Figure 36
Figure 37
Figure 38
Figure 39
Figure 40| Orbit of | Potential | ||||||||
|---|---|---|---|---|---|---|---|---|---|
| (I) | |||||||||
| (II) | |||||||||
| (III) |
| Orbit | Potential | ||||
|---|---|---|---|---|---|
| (IV) | |||||
| (V) | |||||
| (VI) | |||||
| (VII) |
| Orbit | Potential | ||||
|---|---|---|---|---|---|
| (VIII) | |||||
| (IX) | |||||
| (X) | |||||
| (XI) |
| Type of the potential/orbit | range | range | |
|---|---|---|---|
| General axisymmetric potential | |||
| Spherical potential (assuming and finite mass) | |||
| Midplane of a thin disk | |||
| Vertical cylindrical potential | |||
| Axisymmetric harmonic potential | |||
| Spherical harmonic potential | |||
| Keplerian potential | |||
| Spherical cusp potential (density ) |
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Secular dynamics of binaries in stellar clusters I: general formulation and dependence on cluster potential
Chris Hamilton1 and Roman R. Rafikov1,2
1Department of Applied Mathematics and Theoretical Physics, University of Cambridge, Wilberforce Road, Cambridge CB3 0WA, UK
2Institute for Advanced Study, Einstein Drive, Princeton, NJ 08540, USA E-mail: [email protected]
(Accepted XXX. Received YYY; in original form ZZZ)
Abstract
Orbital evolution of binary systems in dense stellar clusters is important in a variety of contexts: origin of blue stragglers, progenitors of compact object mergers, millisecond pulsars, and so on. Here we consider the general problem of secular evolution of the orbital elements of a binary system driven by the smooth tidal field of an axisymmetric stellar cluster (globular, nuclear, etc.) in which the binary orbits. We derive a secular Hamiltonian (averaged over both the inner Keplerian orbit of the binary and its outer orbit within the cluster) valid to quadrupole order for an arbitrary cluster potential and explore its characteristics. This doubly-averaged ‘tidal’ Hamiltonian depends on just two parameters, which fully absorb the information about the background cluster potential and the binary’s orbit within it: a dimensional parameter setting the secular timescale, and a dimensionless parameter which determines the phase portrait of the binary’s inner orbital evolution. We examine the dependence of and on cluster potential (both spherical and axisymmetric) and on the binary orbit within the cluster. Our theory reproduces known secular results — such as Lidov-Kozai evolution and the effect of the Galactic tide on Oort Cloud comets — in appropriate limits, but is more general. It provides a universal framework for understanding dynamical evolution of various types of binaries driven by the smooth tidal field of any axisymmetric potential. In a companion paper (Hamilton & Rafikov 2019b) we provide a detailed exploration of the resulting orbital dynamics.
keywords:
gravitation – celestial mechanics – stars: kinematics and dynamics – galaxies: star clusters: general – binaries: general
††pubyear: 2019††pagerange: Secular dynamics of binaries in stellar clusters I: general formulation and dependence on cluster potential–F
1 Introduction
Orbital motion of two bound point masses perturbed by external forces represents one of the oldest problems in celestial mechanics (Murray & Dermott, 1999). It naturally emerges in a variety of astrophysically relevant situations: classical secular Laplace-Largange evolution of planetary systems (Murray & Dermott, 1999), Lidov-Kozai oscillations forced by a distant companion in a triple system (Lidov, 1962; Kozai, 1962), evolution of Oort Cloud comets driven by the Galactic tide (Heisler & Tremaine, 1986), and so on. Due to the weakness of the external perturbation, the evolution is usually secular in nature and operates on long time scales.
Dense stellar systems — globular and open clusters, nuclear star clusters, galaxies themselves (hereafter collectively called ‘clusters’) — represent ideal environments for perturbing binaries. Historically, these perturbations were considered predominantly in the context of encounters with individual passing stars (Heggie, 1975; Hut & Bahcall, 1983; Hut, 1983), which should occur rather frequently in dense clusters (although not always in the perturbative limit). In particular, secular changes of binary orbital elements caused by a passage of a single star were calculated by Rasio & Heggie (1995), Heggie & Rasio (1996), and Hamers (2018).
At the same time, the gravitational effect of the smooth mass distribution of the full cluster on binary orbital evolution has been explored mainly in the context of Oort Cloud formation and evolution driven by the Galatic tide (Heisler & Tremaine 1986, hereafter HT86). In this case the ‘binary’ is the Sun-comet system, and Galactic tides can be at least as important as stellar encounters for the orbital evolution of Oort Cloud comets. Effects of the spatially smooth Galactic tide have also been studied in the context of wide stellar binaries (Jiang & Tremaine, 2010) and long-period planetary systems in the Galactic bulge (Veras & Evans, 2013a).
Interest in binaries in stellar clusters has been renewed recently by the discoveries of mergers of compact-object binaries by the LIGO collaboration (e.g. Abbott et al. 2016, Abbott et al. 2017). A number of channels for the production and orbital evolution of merging binaries in dense stellar systems have been proposed following these discoveries (Wen, 2003; O’Leary et al., 2006; Miller & Lauburg, 2009; Bae et al., 2014; Rodriguez et al., 2015; Rodriguez et al., 2016a; Rodriguez et al., 2016b; Antonini et al., 2016; Askar et al., 2017; Chatterjee et al., 2017; Petrovich & Antonini, 2017; Hamers et al., 2018; Hoang et al., 2018; Rodriguez et al., 2018; Samsing, 2018; Fragione & Kocsis, 2018). Some of them involve Lidov-Kozai (Lidov 1962; Kozai 1962; hereafter LK) coupling with a third distant companion111Silsbee & Tremaine (2017) and Antonini et al. (2017) also considered Lidov-Kozai cycles in isolated triples to explain LIGO events., which is either captured from the cluster environment (e.g. Antonini et al. 2016) or is effectively represented by a central super-massive black hole (Antonini & Perets, 2012; Petrovich & Antonini, 2017; Hoang et al., 2018). Under certain conditions, LK oscillations can naturally drive the binary orbit to become highly eccentric, boosting gravitational wave emission and substantially speeding up binary coalescence. Similar ideas (with different sources of dissipation at periapsis) have been previously considered for explaining the origin of other exotic objects typically found in stellar clusters, such as blue stragglers (e.g. Knigge et al. 2009; Perets & Fabrycky 2009).
In this work we consider the general problem of secular evolution of binary orbital elements driven by the tidal field that arises due to the smooth mass distribution of an axisymmetric cluster in which the binary moves. We do this for an arbitrary axisymmetric cluster potential and set no constraints on the type of orbit that the binary can have in the cluster. This allows us to formulate the most general framework for treating tide-driven secular evolution of binaries, applicable to a variety of astrophysical systems.
In doing this we neglect the stochastic effect of individual stellar passages on the orbital elements of the binary. Separating the cumulative effect of multiple stellar encounters from the smooth Galactic tide is a non-trivial exercise, as demonstrated previously by Collins & Sari (2010) and Jiang & Tremaine (2010). Nevertheless, for the purposes of clarity, we prefer to focus here on the effects of the mean tidal field due to the smooth mass distribution inside the cluster — effects of encounters with individual stars will be incorporated later.
This paper (the first in a series) is devoted to the derivation of the general Hamiltonian governing secular orbital evolution of a binary in an arbitrary axisymmetric cluster potential, as well as to exploring the dependence of some characteristics of this Hamiltonian on the properties of the cluster potential and the binary’s orbit within it. It is structured as follows. In §2 we derive the tidal Hamiltonian for the dynamical evolution of binary orbital elements due to any axisymmetric perturbation when expanded to quadrupole order. In §§3-5 we average the tidal potential over both the binary’s inner orbit and then over many orbits of the binary around the cluster, arriving at a simple doubly-averaged (secular) Hamiltonian which describes long-term evolution of the binary’s orbital elements. The coefficients entering this secular Hamiltonian depend on the potential of the host system and the binary’s barycentric orbit within this potential, and we explore this dependence in detail in §6. We verify the time-averaging procedure numerically in §7. In §8 we discuss the limitations of our theory, and show how our general results are connected with various special cases already explored by others (also in Appendices B & C).
In a companion paper (Hamilton & Rafikov 2019a, hereafter ‘Paper II’) we provide a complete study of the dynamics that result from the secular tidal Hamiltonian derived in this work, calculate the timescale and amplitude of eccentricity oscillations, and verify our theory with direct N-body integration.
2 Hamiltonian with cluster tides
Let us consider a binary system with semi-major axis and eccentricity , consisting of point masses and . Binary components gravitationally interact with each other and a fixed smooth background potential of a much more massive system, which we will later take to be axisymmetric. The application we have most readily in mind is that of binary stars in the mean field potential of a globular or nuclear star cluster, and for this reason we will frequently refer to and as ‘stars’ and to the background system as ‘the cluster’. However it should be borne in mind that our analysis works for any system of two gravitationally bound objects (binary black holes, comet-Sun system, etc.) moving in any axisymmetric potential (galaxy, open cluster, young stellar cluster, etc).
Throughout this work we will refer to the binary’s orbit around the cluster as the ‘outer orbit’, while the orbit of the binary components about their common barycentre will be called the ‘inner orbit’, to coincide with the standard terminology in LK studies (e.g. Naoz 2016). To describe the outer and inner orbits we set up two coordinate systems — see Figure 1 for illustration.
The first, given by , has its origin at the centre of the cluster. In this coordinate system, the radius vector of the outer orbit, i.e. from the cluster centre to the barycentre of the binary is given by . The second (non-inertial) coordinate system has its origin at , and its axes are fixed to be aligned with those of the first system, so only its origin moves. The position of star in the non-inertial system is then given by . The position of star relative to the centre of the cluster is and the barycentre is at .
The equation of motion of star is then
[TABLE]
for , where the subscript on derivatives means that we evaluate the derivative at .
Defining the relative position , and , one obtains from (1)
[TABLE]
which is the general equation of relative motion of the binary components.
2.1 Tidal approximation
We now employ the tidal approximation, which means that in equation (2) we expand the potential around . The Cartesian components of the vector at position are
[TABLE]
where , are the coordinate indices, so that e.g. and represent the components of and respectively. We expect that terms will be subdominant because the distance to the centre of the cluster () is much greater than the binary separation . In this approximation we find
[TABLE]
with the components of . Keeping in mind that, since the axes of the two coordinate systems are aligned, we may interchange with , we can also write this as
[TABLE]
With as our canonical coordinates and as the corresponding momenta, these equations of motion may be derived from the time-dependent Hamiltonian
[TABLE]
where
[TABLE]
and . To compress the notation we write so that the perturbing (‘tidal’) Hamiltonian reads
[TABLE]
where we sum over .
The dominant part of the Hamiltonian corresponds to the motion of an isolated binary star about its own barycentre, and has no explicit time dependence. The perturbing term takes into account the tidal effects of the external potential, which will drive the secular evolution of the binary orbital elements. It implicitly depends on time through , which we look at next.
According to equation (1) the evolution of is governed by
[TABLE]
The small correction terms on the right hand side of this equation mean that, in general, the motion of in the cluster does not coincide exactly with that of a test particle. However, at the level of accuracy needed in this work we can neglect this difference and assume that coincides with the ‘guide’ radius vector , which evolves according to the equation of motion of a test particle in the cluster potential,
[TABLE]
In other words, in the following we set and calculate using equation (11).
Our neglect of the terms quadratic and higher order in in equation (3) is equivalent to the so-called ‘quadrupole approximation’ in the hierarchical three-body problem. Keeping the next (quadratic) term in the expansion would correspond to the ‘octupole approximation’, and so on. In Appendix E we describe the extension of our tidal Hamiltonian to octupole order and provide a connection to the LK problem in the octupole approximation.
2.2 Orbital elements and Delaunay variables
We now introduce standard orbital elements in the frame of the binary. The reference direction is taken to be the direction and the reference plane the plane (see Figure 1; we will later take the axis to be the symmetry axis of the potential but the assumption of axisymmetry is not needed at the moment). We define binary argument of pericentre , inclination , longitude of ascending node and mean anomaly relative to this reference plane and direction. When written in orbital elements the relative coordinates become (Murray & Dermott, 1999):
[TABLE]
[TABLE]
[TABLE]
and where is the eccentric anomaly. It is important that the orbital elements are defined with respect to a reference frame with axis directions fixed in time (cf. Brasser 2001; Veras & Evans 2013a; Correa-Otto et al. 2017). In the limit of the cluster tide going to zero these orbital elements stay constant.
For dynamical studies it is often more convenient to use Delaunay variables, in which the actions
[TABLE]
are complemented by their conjugate angles , , . We will use them extensively in Paper II.
Since Delaunay variables are angle-action variables, we can easily identify the conserved quantities in the Hamiltonian. The dominant part of the Hamiltonian (7) reads
[TABLE]
while the perturbing Hamiltonian is given by equation (9) with and given by equations (12), (13) and (14) respectively (or their Delaunay equivalents).
3 Averaging the tidal Hamiltonian
Dynamics of binaries in stellar clusters benefits from a natural separation of scales. For example, a Solar mass binary with has an inner orbital period of years, while its outer orbit around a globular cluster might have a period of years. As we show in Paper II, the resulting secular evolution of the binary’s orbital elements due to the tidal potential of the cluster may take years.
This naturally allows us to simplify our Hamiltonian (9), first by integrating out the fast evolution of the mean anomaly of the inner orbit (‘single-averaging’, see §3.1), and then by also integrating over many (outer) orbits of the binary around the cluster (‘double averaging’, see §3.2).
3.1 Singly-averaged Hamiltonian: averaging over the mean anomaly
We begin by averaging over the shortest timescale in the problem, namely over the inner orbital motion of the binary components around their common barycentre. Our singly-averaged Hamiltonian is
[TABLE]
where the average of a quantity over the mean anomaly is defined as
[TABLE]
The coefficients depend on time only through , which is a ‘slow’ variable, so
[TABLE]
For reference, the full algebraic expressions for in terms of orbital elements are given in Appendix A. The singly-averaged Hamiltonian (19) incorporating these expressions is completely general and can be used to describe orbital evolution of binaries moving in an arbitrary external potential.
Obviously we have eliminated the angle , therefore the conjugate action is conserved, and so the binary’s semi-major axis is constant. When written in Delaunay variables, the singly-averaged Hamiltonian is a function of and the time through the time-dependent coefficients .
3.1.1 Example: orbits in a harmonic potential
For illustration, as well as to connect to subsequent results, we consider a binary orbiting in a globular cluster with a triaxial constant-density core. For orbits in this core the potential is that of a three-dimensional harmonic oscillator with frequencies , namely . Then so the singly-averaged Hamiltonian (19) becomes
[TABLE]
Let us now consider an axisymmetric core where the symmetry axis is the axis. Then and the binary’s outer orbit fills a section of a cylindrical surface with elliptical cross-section (aligned with the axis). Using equations (59)-(61) we end up with
[TABLE]
where .
Note that the dependence on the longitude of the ascending node has dropped out of this Hamiltonian, so the -component of binary angular momentum is conserved. That is, in the reference frame of a binary orbiting an axisymmetric harmonic potential, the perturbation due to the tidal field of the cluster effectively becomes axisymmetric after averaging only over the inner orbit of the binary (single-averaging), not its outer orbit around the cluster (double-averaging). This is despite the fact that the outer orbit itself is not axisymmetric in general, even after averaging over a long time interval (its projection onto the plane is an ellipse centred at the origin). This property does not hold for arbitrary potentials.
Things simplify further if we assume the core to be spherically symmetric. Without loss of generality we can then assume the outer orbit of the binary to be in the plane, and put all frequencies equal to (so that ). We find that
[TABLE]
This singly-averaged Hamiltonian is now also independent of the argument of pericentre . As a result, in this case there is no evolution of eccentricity or inclination of the binary. The only variation of its orbital elements is apsidal precession at the rate
[TABLE]
independent of the orientation of the binary orbit (i.e. , , ). Here is the Keplerian mean motion of the binary.
3.2 Doubly-averaged Hamiltonian: averaging over time
As we already mentioned, binary orbital elements change significantly on timescales that are much longer than the outer orbital period of the binary around its host system. For that reason, it makes sense to average over many outer orbits. Indicating such time-averages with an over bar, we write:
[TABLE]
where the doubly-averaged perturbing Hamiltonian differs from its singly-averaged predecessor (equation (19)) only in that each is now replaced by its time-averaged value :
[TABLE]
This works because the outer orbit only enters and not .
Equation (25) is the doubly-averaged perturbing Hamiltonian and is the main result of this section. It describes the secular evolution of the orbital elements of any binary perturbed by a smooth external potential . However in its current abstract form it is not of much use. In the following section we show how the time-averages may be calculated for cluster potentials possessing certain symmetries, culminating in the expressions (42), (43).
4 Time-averaging in axisymmetric potentials
So far we did not need to specify anything about the potential . However, we will now focus on binaries orbiting in fixed axisymmetric potentials (§§4.1-4.2). We describe the time-averaging procedure in spherical clusters (§4.3) and then extend it to general axisymmetric potentials (§4.4).
4.1 in cylindrical coordinates
In an axisymmetric cluster we can choose the symmetry axis of the potential to be the axis (like in Figure 1). Then it makes sense to write down the derivatives in the cylindrical coordinate system with origin at the centre of the cluster, where and . The axisymmetric potential may then be expressed as , and we find
[TABLE]
Here is the azimuthal coordinate of , namely , and again the subscripts on derivatives mean ‘evaluated at position ’. The coefficients have certain symmetry properties which will become important when we consider their time-averaged values in §4.2.
4.2 Orbit families and non-commensurable frequencies
We now consider which orbit families are possible in general axisymmetric potentials. Numerical orbit integration confirms that most orbits in axisymmetric potentials are regular and appear to respect a third integral of motion alongside energy and the -component of angular momentum (Binney & Tremaine, 2008; Merritt, 2013). The vast majority of these regular orbits form a tube, or ‘torus’, around the symmetry axis: in an oblate potential they are short-axis tube orbits, while in a prolate potential they are (inner- or outer-) long axis tube orbits. Other possibilities include near-resonant regular orbits and irregular (chaotic) orbits, but both of these are typically scarce compared to the tubes.
We will ignore chaotic orbits in what follows since they are very rare in axisymmetric potentials (Regev, 2006). We are left with tube orbits and (near-)resonant non-tube orbits. The resonant family corresponds to having commensurable frequencies. Mathematically, if we denote the frequencies of motion of in each direction by the vector , we must consider whether there exists any triple of integers such that
[TABLE]
The role of commensurabilities and near-commensurabilities will be discussed in §§7-8.
If the frequencies are non-commensurable (i.e. relation (32) does not hold), then the outer orbit of the binary will trace out a non-repeating path around the cluster. Over time this path will densely fill a 3D axisymmetric torus whose symmetry axis is . We may therefore replace the time-average of a function following an orbit with a weighted (by the time the orbit spends at a given point) volume-average over the torus.
Since the torus is axisymmetric, time-averaging over non-commensurable orbits inevitably involves integrating the expressions (4.1)-(31) over azimuthal angle . As a result, time-averages of become
[TABLE]
see equations (4.1)-(31). In practice, vanishing of , , typically requires averaging over many outer orbits — see §7.
4.3 Time-averages in spherical potentials
In spherical potentials the outer orbit of the binary remains in the same plane, which can be chosen to coincide with the plane. In this plane the coefficients and vanish identically. In other words, equation (36) holds true even without averaging over the outer orbit. At the same time, asymptotically tends to zero only upon averaging over many orbits, as we will see later in §7.1.
In the plane the path of is a rosette, assuming it has non-commensurable radial and azimuthal frequencies; see Figure 7 for illustration. Over time the rosette densely fills an axisymmetric annulus with inner and outer radii corresponding to the pericentre and apocentre of the outer orbit . When discussing spherical potentials we will sometimes refer to this as the ‘axisymmetric annulus approximation’.
In this case it is easy to write down an analytical formula for the averages in terms of and , as averaging over can be replaced with averaging over via , where is the radial velocity. Specific energy and angular momentum of the outer orbit in a spherical potential can be explicitly expressed as function of and as follows:
[TABLE]
With this in mind, we can write the time-average of any radially-dependent function as
[TABLE]
Remembering that only the azimuthally-averaged versions of provide non-zero contributions (see §4.2 and equations (33)-(36)), we see that in spherical potentials the time-averages can be calculated in a straightforward fashion via integration over radius .
4.4 Time-averages in axisymmetric potentials
We would like to generalise the approach of §4.3 to axisymmetric potentials . This would involve averaging over the cross-section of an axisymmetric torus filled by the outer orbit of the binary — see the central columns of Figures 10 & 11 for examples of such cross-sections. However, there are several difficulties with this approach.
First, each element of the cross-section enters the averaging procedure with a certain weight proportional to the time the orbit spends in it. Calculating this weight is not trivial and involves the knowledge of the meridional velocity (, ) structure. For a general axisymmetric potential this calculation cannot be done analytically.
Second, even the shape of the outer boundary of the cross-section cannot be determined analytically for a general axisymmetric potential. The difficulty is that the knowledge of and -component of angular momentum (which are integrals of motion in a general axisymmetric potential) is not sufficient to determine the shape of the meridional cross section of the torus: one also needs to know a third integral of motion . The exact shape of the torus cross-section is known only for orbits in Staeckel potentials (Binney & Tremaine, 2008), since only for those do we have analytic expression for the third integral . Even then, writing down a formula for the time-averaged coefficients is tiresome because of the awkward confocal-ellipsoidal coordinate system involved (Binney & Tremaine, 2008) and the complicated functional form of the third integral.
For these reasons, in this work we usually222There are special cases in certain axisymmetric potentials where we can compute (semi-)analytically, see §6.1 & §6.3.1. compute and in axisymmetric potentials by directly integrating the orbit of a guide numerically using equation (11) for a given set of initial conditions , where is the velocity in the direction of increasing , etc. This orbit is then used to carry out the time-average of using a method outlined in Appendix F.
Note that, unlike in the spherically symmetric case, and no longer vanish identically due to a symmetry of the potential. Nevertheless, equations (35)-(36) are still fulfilled upon averaging over many outer orbits.
5 The secular Hamiltonian
Despite the fact that in general axisymmetric potentials we cannot write down a useful analytic expression for time-averages, we can still continue our derivation of the secular Hamiltonian owing to the symmetries of the coefficients (equations (33)-(36)). These symmetry properties allow us to greatly simplify the doubly-averaged perturbing Hamiltonian (25) so that it reads:
[TABLE]
Let us define the quanitities
[TABLE]
(note that the constants and are not related to the Oort constants!) and write in terms of orbital elements using equations (59)-(61). Then the Hamiltonian (40) becomes
[TABLE]
and is the ‘dimensionless Hamiltonian’
[TABLE]
Note that involves only a single dimensionless parameter , while depends on (which has units of (frequency)2). In Paper II we will see that determines the phase space structure of the Hamiltonian while sets the timescale for secular evolution. All the information about the cluster properties and the characteristics of the (outer) orbit of the binary enter the Hamiltonian through these two parameters only. In §6 we investigate in detail how these key quantities depend on the form of the background potential and the outer orbit of the binary within the potential.
The dependence of the Hamiltonian upon the longitude of ascending node has dropped out under time-averaging and so the -component of angular momentum is conserved, as we would expect for an axisymmetric perturbation (which the cluster potential looks like from the binary frame upon averaging over many outer orbits). The dimensionless quantity is an integral of motion, which implies that variations of eccentricity must come at the expense of changes in inclination and vice versa, just as in the LK mechanism (Lidov, 1962; Kozai, 1962; Naoz et al., 2013).
Finally, we note that the doubly-averaged perturbing Hamiltonian (42) appears very similar to the singly-averaged one derived for the example of an axisymmetric harmonic potential in §3.1.1 (equation (21)). Indeed, comparing equations (21) and (42) one might be tempted to say that axisymmetric harmonic potentials have . However, this correspondence is a mathematical coincidence: the assumption of non-commensurability (§4.2) does not apply to harmonic potentials, for which all orbits are closed non-precessing ellipses. Despite their similarities the Hamiltonians (21) and (42) are different objects derived under different approximations.
5.1 Orbits in a Kepler potential: link to the Lidov-Kozai mechanism
Another example of such a mathematical coincidence is represented by the well known test particle quadrupole Lidov-Kozai (LK) problem (Lidov, 1962; Kozai, 1962). The Hamiltonian for this problem takes the form (43) if we were to set . However, we have derived this Hamiltonian under the assumption that fills an axisymmetric torus, while in the LK case moves in a Keplerian ellipse, which in the test particle limit does not precess. Nevertheless, it is known that for elliptical orbits the time-averaged tidal Keplerian potential is exactly axisymmetric to quadrupole order (e.g. Katz et al. 2011; Naoz et al. 2011), and so (42) does in fact hold.
In Appendix B we show explicitly how the LK Hamiltonian is recovered in the ‘test particle quadrupole’ approximation from the singly-averaged equation (19) in the limit that the background potential in which the binary orbits arises from a point mass at the origin. We recover the LK Hamiltonian exactly if we set in (43).
5.2 Epicyclic orbits in a disk: link to Heisler &
Tremaine (1986)
For wide binaries in the solar neighbourhood, the tidal potential of the Galactic disk can provide the dominant torque, as shown by Heisler & Tremaine (1986) for the Oort Cloud comets. Averaged over many orbits of the Sun around the Galaxy, the Galactic disk provides an axisymmetric tide onto the binary. In Appendix C we show how to calculate in the case where performs epicyclic motion in a disk. It is then easy to recover the tidal Hamiltonian of HT86 from (42). We reproduce the dimensionless version of HT86’s Hamiltonian by setting in (43).
6 Dependence of Hamiltonian coefficients and on the cluster potential and binary orbit
All of the information about the effect of the tidal potential on secular dynamics of the binary is contained in the two crucial quantities and , which are constructed from the time-averages and , see equation (41).
The quantity is a direct measure of the strength of the potential. Its influence on the dynamics is trivial: it enters the problem only as a proportionality constant of the Hamiltonian (equation (42)), and therefore merely sets the (inverse of the) secular timescale. In addition, is also a fairly intuitive quantity: if a binary is in a strong tidal potential we expect it will have a large .
The meaning of is less intuitive than although its influence upon the system is quite profound. In Paper II we will see that the phase portrait of the Hamiltonian undergoes several bifurcations as we change the value of , altering the dynamics completely. In particular, we show that there are four qualitatively different regimes — (i) , (ii) , (iii) , and (iv) . The value of is so important because, for instance, high-eccentricity excitation is ubiquitous for binaries in regime (i), whereas it is much less readily achieved by binaries in regime (ii). Hence, most effort in this section will be directed towards understanding which cluster potentials and outer binary orbits give rise to which values of . So far we have seen that for any orbit in a Keplerian potential, and that for epicyclic orbits in a thin disk. In this section we explore in more detail how the values of (and ) depend on the form of the background potential and the binary’s outer orbit within it.
We start by stating some general properties of and in §6.1. We then discuss the behavior of these parameters in certain spherical (§6.2) as well as axisymmetric (§6.3) potentials.
6.1 General properties of and
Writing down Poisson’s equation in cylindrical coordinates
[TABLE]
and using equations (33)-(34) & (41), one can easily show that in a general axisymmetric potential
[TABLE]
where is the cluster density in the vicinity of the binary time-averaged over many outer orbital periods. Then, defining the dimensionless ratio
[TABLE]
we can write quite generally as
[TABLE]
The function is plotted in Figure 2.
In principle there are no limits on the values can take, although in practice, achieving values of less than (and hence ) may require rather contrived orbits. An example of such an orbit with and is given in Appendix D (see Figure 13). Note that necessarily implies that , see equation (46).
Somewhat stronger statements can be formulated for realistic spherically symmetric cluster potentials, as we show in Appendix D. In particular, one can demonstrate that in such potentials , , and . In non-spherical potentials negative values of become possible for certain binary orbits as we will show in §6.3.
It is instructive to consider the values of for some specific potentials .
- •
In the case of a Keplerian cluster potential, i.e. a spherical point mass potential with vanishingly small density outside the centre, one has , , and (see §5.1).
- •
In a spherical harmonic potential, symmetry dictates that so that and (see §3.1.1).
- •
In a spherical cluster with a cusped density distribution with (having finite mass at the centre) we have , see Appendix D.1.
- •
In a thin galactic disk, assuming that dominates over other spatial derivatives in Poisson’s equation, one has ; hence we find and (see §5.2).
- •
In the opposite limit of a ‘cylindrical’ (or highly prolate) potential with no -dependence, one has , and .
The values (or ranges) of and for these and some other types of cluster potential are summarized in Table 4. We stress again that even though applying the ‘axisymmetric annulus’ approximation gives the correct results for Keplerian and spherical harmonic potentials, this is a mathematical coincidence unless the outer orbit of the binary in these potentials is circular (see §3.1.1 & 5.1).
6.2 Behavior of Hamiltonian characteristics in some spherical potentials
In spherical potentials the values of that enter and are computed using the analytic expression (39), which for a fixed potential depends only on the peri/apocentre of the binary’s outer orbit . We can define the outer orbit’s generalised semi-major axis and generalised eccentricity in terms of the peri/apocentre as
[TABLE]
These reduce to the usual orbital elements in the case of a Keplerian potential. These variables fully characterize the outer orbit of the binary in a given spherical potential.
In any spherical potential with scale radius and total mass we can also construct the dimensionless parameter ; this normalization arises because is constructed from the second derivatives of the potential, which are of order333Note that is roughly , the characteristic azimuthal orbital period of the binary around the cluster, so that . , see equation (41). This allows us to estimate
[TABLE]
Both and are dimensionless numbers which, for a given potential, depend only on and . In the following we will explore the dependence of (rather than , which also depends on the cluster mass and size) and on the shape of the potential and the binary orbit in it.
We use the following spherically symmetric potentials:
(i) the isochrone potential (which has a constant-density core and half mass radius )
[TABLE]
(ii) the Plummer potential (also has a core and )
[TABLE]
(iii) the Hernquist potential (has no core, and )
[TABLE]
(iv) the NFW potential (has no core and has a divergent mass profile)
[TABLE]
The NFW potential arises from a density distribution
[TABLE]
In equation (53) the quantity is not the mass of the model (which is formally infinite), just a constant with units of mass.
For illustration, in Figure 7 (left panels) we show three examples of numerically integrated orbits in some of these potentials. (Each orbit was integrated for azimuthal periods; the first few periods are highlighted in red). The first two (‘I’ and ‘II’) orbit the isochrone potential (50), which has a constant density core for . The third (‘III’) orbits the Hernquist potential (52), which is coreless. In Table 1 we list the peri/apocentre , semi-major axis , eccentricity , azimuthal period , and the values of and calculated using equation (39). We also provide values of obtained by direct averaging of along each numerically integrated outer orbit (see Appendix F), to which we will return when discussing the validity of the axisymmetric averaging approximation in §7.
6.2.1 Behavior of
In Figure 3 we plot in the plane for the potentials (50)-(53). We see that is a strong function of but a weaker function of . The dependence on emerges predominantly for orbits with ; it is rather weak at all for orbits with , where is the scale radius of the potential in question. The difference in radial behavior between different potentials is most pronounced for orbits with . In this region there is a sharp increase in in the uncored (Hernquist and NFW) potentials, but a much shallower gradient in the cored potentials (isochrone and Plummer).
We can make the comparison more quantitative by examining the radial profile of for circular outer orbits (of radius and eccentricity ). Then is a function of only, and is plotted in Figure 4. We see that becomes significantly larger than for in the case of non-cored potentials, but reaches a maximum of in the isochrone case. For those potentials with finite total mass we can construct the density-weighted average
[TABLE]
still assuming a circular outer orbit. We find and for isochrone, Plummer and Hernquist potentials respectively. The isochrone model has by far the smallest .
6.2.2 Behavior of
Figure 5 shows in the plane for the same four potentials. We see that for in cored potentials. For the coreless potentials is always positive, as expected. We see that for , the value of is quite sensitive to the outer orbit eccentricity in all four potentials. At fixed , increasing corresponds to a decrease in . For the cored potentials this is because high- orbits start probing the cluster core where the potential is roughly spherical harmonic (for which is effectively zero, see §3.1.1 and §6.1), which tends to lower .
Meanwhile, increasing at fixed tends to increase . At large all finite mass potentials reduce to a Keplerian potential for which . In the NFW potential, the profile is shallow because the potential decays slowly, namely as for . Hence it never becomes sufficiently Keplerian to reach .
To better illustrate this convergence at large , in Figure 6 we show for circular outer orbits in the same four potentials as in Figure 5. We see that the convergence does occur for all potentials that are asymptotically Keplerian as , although in some cases one has to go to radii to observe it satisfactorily. Additionally, at very small radii the NFW density profile can be approximated as a power-law cusp, , see equation (54). Using the result listed in §6.1 (and derived in Appendix D.1) we expect to find as , and indeed this is reflected in Figure 6.
We note that some of the orbits in the potentials (50)-(53) will have commensurable (or almost commensurable) radial and azimuthal frequencies. For these orbits, i.e. at some points in space, equation (39) is not valid, because its derivation relies upon orbits densely filling an axisymmetric annulus, see §4.2. This is particularly true of potentials with a core at small , where the potential is close to harmonic (c.f. §7). Nevertheless, Figures 3 and 5 give a good idea of how and change as we consider different orbits within the cluster.
6.3 Behavior of Hamiltonian characteristics in axisymmetric potentials
For axisymmetric potentials it is difficult to make rigorous mathematical statements about and . In Appendix D.2 we show that in this case, in principle, can take any value ; however, to achieve extreme negative values of , or , may require very unusual orbits. (Two such examples are given in Appendix D.2). Meanwhile, in this section we focus on the most typical orbits in axisymmetric potentials via numerical examples. The and values in this section are calculated numerically using the procedure outlined in Appendix F, and are therefore denoted .
We will use two axisymmetric potentials in our numerical examples. The first is the flattened power-law potential (Evans, 1994):
[TABLE]
where is the central potential, is a core radius and is the oblateness parameter: corresponds to an oblate potential which can be used to model elliptical galaxies and galactic bulges (Evans, 1994). The natural definition of in this case is . We choose and , meaning that this potential is only slightly flattened. One can derive a number of useful analytical results in such weakly non-spherical potentials; we defer this investigation to a future study. Here we simply demonstrate that even in the case of a weakly flattened potential large departures from the behaviour typical for purely spherical potentials described in §6.2 become possible.
The other potential we will use is the Miyamoto-Nagai potential (Miyamoto & Nagai, 1975):
[TABLE]
where is the scale length and is the scale height. As one changes the value of , the Miyamoto-Nagai potential smoothly transitions from the Kuzmin potential of a razor thin disk () to the spherical Plummer potential frequently used to model globular clusters () (Binney & Tremaine, 2008). In Figure 8 we plot contours of constant density in the plane for two Miyamoto-Nagai models used in this paper, namely and . The natural definition of in this potential is .
6.3.1 Orbits in the midplane of an axisymmetric potential
The simplest non-spherical case to consider is when the binary’s outer orbit is confined to the midplane of an axisymmetric potential. Then still traces a planar rosette with a fixed peri/apocentre just as in the spherical case, so we can easily compute and as in §4.3. In Appendix C we show how to compute in the case of epicyclic outer orbits in a disk-like potential. We find , where is the vertical epicyclic frequency at the guiding radius; therefore .
In fact, we already deduced in §6.1 that will hold for any orbit which is confined to the plane of a very thin axisymmetric disk. This follows from the fact that the curvature of the potential is by far greatest in the direction at any given position in the disk, so that . Then and so . In Figure 9 we confirm this prediction using three very different orbits in the midplane of a thin () Miyamoto-Nagai potential. The values are (a) , (b) and (c) , all very close to .
6.3.2 Orbits that are far from coplanar
As we show in Appendix D, we always have in realistic, finite-mass spherical potentials. For to fall below zero the potential must be non-spherical, but also, according to definitions (41), the outer orbit of the binary must have . Qualitatively, this implies that the average ‘radial curvature’ of the potential over the orbit needs to be greater than the average ‘vertical curvature’. This is not going to be the case while the orbit is confined near a single plane, as we have just seen. However, this situation is naturally realised in potentials that are highly prolate in the direction (asymptotically ‘cylindrical’, with ). In such potentials (see §6.1). Also, to probe the negative regime we can consider orbits in non-spherical potentials that make large excursions ‘out of the plane’, i.e. in the direction.
This is demonstrated in Figure 10, in which we plot four Orbits (‘IV’-‘VII’) in the flattened power-law potential (56) with and . These four Orbits are initiated with exactly the same initial conditions except for their initial azimuthal velocity ; the full details of the initial conditions, as well as the resulting and values, are given in Table 2. In the top row of Figure 10 we have Orbit (IV), with initial . Orbit (IV) is certainly not planar but the typical excursions in are fairly small compared to the excursions in . As a result the value is less than but still significantly greater than zero. As we move down the page we decrease the initial azimuthal velocity each time, so that Orbits (V)-(VII) initially have respectively (while keeping all other initial conditions the same). The radial excursions decrease as the initial azimuthal velocity decreases, until they become comparable to the vertical excursions. Eventually moves below zero in Orbit (VII), see Table 2. The values grow as we move down the page since the binary samples a stronger potential when it is closer to the origin.
For our final set of examples we compare four more Orbits (‘VIII’-‘XI’) in the Miyamoto-Nagai potential (57) with , the intial conditions for which are given alongside their resulting values in Table 3. At large distances this potential is significantly flatter than the flattened power-law potential. Orbits (VIII)-(XI) are plotted in the left hand column of Figure 11. Each Orbit has exactly the same initial conditions except we change the initial vertical coordinate , using and respectively. Increasing the initial thickens the orbit. Orbit (VIII) (top row, initial ) is almost coplanar and has . The value decreases as move down the page, reaching a minimum value of for Orbit (XI) (bottom row, ) which is thicker vertically than it is radially. Meanwhile, as we move from top to bottom the value decreases because the orbit spends more time away from the midplane where the tidal potential is strongest. An even more extreme orbit in this potential, with the same initial conditions except (resulting in ), is presented in Figure 12.
Finally, note that Orbit (VIII) is very similar in appearance to the orbit in Figure 9a: both are roughly epicyclic, so they should obey equations (75) and (76). The difference between them is that in the case of Orbit (VIII) the potential felt by the binary is significantly less flattened, since this Orbit resides predominantly in the quasi-spherical core of the potential (). As a result, does not dominate over and hence ends up being significantly less than (unlike the orbit in Figure 9a which explores much more flattened version of the Miyamoto-Nagai potential with ).
7 Validity of secular Hamiltonian
In §6 we focused on understanding the typical values of for various types of orbit in different potentials. However the Hamiltonian (42) is only valid if the symmetry conditions (33)-(36) for the time-averages are satisfied. In addition it is reasonable to require that converge to fixed values (say to within a few percent) on timescales significantly shorter than the timescale for secular evolution (which will be derived in Paper II). In this section we check the validity of these assumptions numerically. The procedure for calculating numerically is given in Appendix F.
7.1 Spherical potentials
In a spherical potential, orbits that have non-commensurable frequencies densely fill an axisymmetric annulus. If this is true then the time-averaged coefficients should obey the symmetry properties (33), (35).
To verify this we use Orbits (I)-(III). Their initial conditions are given alongside their values in Table 1. The right panels in Figure 7 show the corresponding running average (from to current time) of numerically computed . As the number of completed orbits grows, the time-averaged derivatives of the potential tend to converge towards fixed values.
Orbit (I) has rather large semi-major axis and small eccentricity, so that it stays far from the core at all times, filling its annulus densely. The ‘axisymmetric annulus’ approximation works very well in this case, so the (semi-)analytic and numerically computed values agree: and to within accuracy.
We have picked rather extreme examples in Orbits (II) and (III) in order to demonstrate behaviour of orbits that are both very radial and tightly bound near the centre of the cluster. Orbit (II) spends a lot of time in the isochrone potential’s constant density core where its frequencies are almost commensurable (); as a result it precesses slowly, so that there are unfilled gaps left in its annulus even after . This issue does not arise in the uncored Hernquist potential, so Orbit (III) fills its annulus more efficiently than Orbit (II). Nevertheless, the axisymmetric approximation is still very successful in both cases, with a maximum discrepacy of arising between the and values of Orbit (II).
However, we notice that while the converged symmetry properties of the (see equations (33), (35)) are well established after for Orbits (I) and (III), they are less well established for Orbit (II) even at . Again this is because Orbit (II) does not fill its annulus efficiently. This can be problematic because if the fail to converge on a timescale shorter than the secular evolution timescale, the doubly-averaged theory can break down, as we will see in Paper II.
7.2 Axisymmetric potentials
In a (non-spherical) axisymmetric potential, orbits that have non-commensurable frequencies densely fill an axisymmetric torus and so the time-averaged coefficients should obey the symmetry properties (33), (35), and (36). To verify this numerically we use Orbits (IV)-(VII) in the flattened power-law potential (56) with and (see Table 2 and Figure 10), and Orbits (VIII)-(XI) in the Miyamoto-Nagai potential (57) with (see Table 3 and Figure 11).
Some features of the convergence plots are similar to the spherical case. For example, in Figure 10 the derivatives converge rather slowly in the bottom panel because the Orbit (VII) fills its torus rather sparsely, owing to the large fraction of time it spends in the almost-harmonic potential of the core.
Note that the rightmost columns in Figures 10 and 11 also show the convergence of and . This is different from Figure 7 since now we are dealing with non-planar orbits so that these derivatives are no longer identically zero. Although the corresponding time-averages do indeed converge to zero in all of our axisymmetric examples as expected, in most cases their convergence takes significantly longer than that of the other coefficients. This is what we would expect from looking at the dependence of equations (4.1)-(31): the derivatives , and fluctuate twice as rapidly with respect to compared to . Slower convergence of and seems to be especially apparent for strongly non-coplanar orbits, i.e. orbits which make large excursions in the direction. Orbits that inefficiently fill their torus (i.e. on timescales longer than the secular evolution timescale) can render the doubly-averaged theory inaccurate, as discussed in detail in §7 of Paper II.
8 Discussion
In deriving the secular Hamiltonian (equations (42), (43)) we relied on several approximations. First, we assumed that the outer orbit-averaging procedure used for computing potential derivatives converges rapidly when compared to the timescale for secular evolution. The rate of convergence of various components was explored in §7. In Paper II we will use direct numerical integrations of binaries orbiting in stellar cluster potentials to study how well this double averaging procedure works in practice.
Second, we truncated our expansion of the tidal Hamiltonian in §2.1 at the quadrupole order. However, studies of the LK mechanism have shown the importance of higher order — ‘octupole’ — terms for the dynamics of triples in certain situations (Lithwick & Naoz, 2011; Li et al., 2014). This raises a question of whether octupole terms can be important for the secular dynamics of binaries in external tidal fields. While we derive octupole-level corrections to our doubly-averaged Hamiltonian in Appendix E, in practice they are unlikely to be important. This is because in realistic situations the ratio of the semi-major axis of the inner binary orbit ( AU) is much smaller than the size of its outer orbit ( pc, comparable to the cluster size), rendering the timescale on which octupole-level effects may manifest themselves too long (we will see in Paper II that a characteristic timescale of secular evolution driven by quadrupole terms in a typical globular cluster is at least tens of Myrs).
Third, our calculation assumes a spatially smooth and time-invariant tidal potential. This approximation neglects the granularity and stochastic variability of the cluster potential caused by encounters with other stars, which are very important in dense environments of clusters (Heggie, 1975; Hut & Bahcall, 1983; Hut, 1983; Heggie & Rasio, 1996; Collins & Sari, 2008). The cumulative effect of a large number of such encounters is what eventually contributes to the smooth tidal field of the cluster (Collins & Sari, 2010); thus, one hopes that in the long run our framework should provide a qualitatively accurate picture of binary evolution in clusters. Nevertheless, in the future we plan to extend our calculation by including the effects of individual stellar encounters on evolution of the binary inner orbit (Weinberg et al., 1987). We will also explore the role of strong encounters (responsible for the formation and disruption of binaries in clusters, Heggie 1975) in resetting the whole course of secular evolution of the binary.
An effect that can modify the binary’s outer orbit in a stellar cluster is resonant relaxation. Rauch & Tremaine (1996) showed that in quasi-Keplerian systems (such as nuclear clusters dominated by a central super-massive black hole) angular momentum is efficiently exchanged between stellar orbits that have commensurable frequencies. When applied to a binary in a quasi-Keplerian cluster, precession of the binary’s outer orbit due to resonant interactions with other stars (so-called ‘vector resonant relaxation’) can alter its inclination relative to the inner orbital plane, potentially bringing an initially low-inclination binary into a high-inclination regime and triggering LK oscillations (Hamers et al., 2018). While this effect has not been explored for binaries in non-Keplerian potentials, vector resonant relaxation can indeed operate in non-Keplerian systems such as globular clusters (Meiron & Kocsis, 2018), as can more general forms of resonant relaxation allowing for exchange of both angular momentum and energy of the outer orbit (Hamilton et al., 2018).
Additionally, the fact that a binary is typically heavier than the average star in a cluster means that it will tend to sink towards the centre of the cluster via dynamical friction (Binney & Tremaine, 2008). Moreover, the global properties of the cluster itself may evolve as a result of two-body relaxation leading to core collapse. All of these effects can change the values of and for a given binary over long time intervals. We will explore their impact upon binary evolution in future work.
Finally, an important assumption that lies at the foundation of our time-averaging procedure is that different frequencies characterizing binary motion in the cluster are not commensurable with each other (see §4.2). If this condition is violated, the outer orbit no longer fills an axisymmetric torus inside the cluster uniformly in azimuth, rendering the equations (33)-(36) invalid. This issue is addressed in more detail next.
8.1 Commensurable frequencies
Orbits in realistic spherical potentials obey the following relation between the radial () and azimuthal () frequencies (Binney & Tremaine, 2008):
[TABLE]
Thus, in spherical potentials there are infinitely many rational values of in the interval . However, even the orbits with rational values of should still be described (at least roughly) by the ‘filled annulus’ approximation as long as the integers . For these systems, equation (42) should be approximately valid.
Of course, a small number of low-order resonant points in the plane will have with integer a few. Binaries on these outer orbits will not satisfy the axisymmetric averaging approximation. In particular, is always rational in the harmonic and Keplerian potentials. If the potential is purely harmonic we can treat it as in §3.1.1. Moreover, we show in Appendix B that Keplerian potentials are perfectly described by our doubly-averaged formalism with . However it should be stressed that we have not used the axisymmetric averaging approximation in either of these cases: the harmonic potential just happens to be effectively axisymmetric after single-averaging, and the Keplerian potential is known to be axisymmetric under double averaging to quadrupole order. In neither case do orbits ‘fill their annulus’444The Keplerian potential is not axisymmetric to octupole order, see Appendix E..
Another problematic case is when the binary experiences a potential that is almost harmonic or almost Keplerian, so that the outer orbit precesses apsidally, but not quickly enough to fill a circular annulus on a secular timescale and thereby qualify for an axisymmetric treatment. For example, orbits that spend a lot of time in a constant-density core of the cluster potential experience an almost harmonic potential and so tend to fill their annulus very slowly (see Figure 7c).
8.2 Relation to previous work
Many previous studies have looked at secular evolution of binaries perturbed by external potentials. The effect of an arbitrary quadrupole perturbation upon a binary has been briefly considered by Mikkola & Nurmi (2006). In particular, their equation (20) gives the quadrupole potential experienced by a binary in a star cluster consisting of a large number of point masses . Our perturbing Hamiltonian is recovered from their result in the mean-field limit (i.e. by replacing the exact potential of the cluster, , with the smooth potential ). However, Mikkola & Nurmi (2006) did not explicitly convert to orbital elements, perform any averaging, or develop any secular theory as we do here. In a similar vein, a short paper by Katz & Dong (2011) considered the secular dynamics of a binary perturbed by a generic quadratic potential and included axisymmetric potentials as a special case. They did convert to orbital elements but did not go much further; in particular they did not provide any prescription for computing the coefficients of the averaged perturbing potential.
Studies of tidal effects of the Galactic disk on wide binaries (Heisler & Tremaine, 1986; Byl, 1986; Yabushita, 1989) represent an important limit () of our general theory, see §5.2 and Appendix C. Since HT86, Galactic tides have been included in many studies of cometary orbits (e.g. Matese & Whitman 1989; Matese & Whitmire 1996; Breiter et al. 1996; Wiegert & Tremaine 1999; Brasser 2001; Fouchard 2004; Fouchard et al. 2005, 2006; Breiter et al. 2007), as well as planetesimal orbits (e.g. Higuchi et al. 2007).
Veras & Evans (2013a) considered a very general form of the perturbed two-body problem, allowing for both position- and velocity-dependent tidal forces to act upon the binary. Their equations (25)-(29) are more general versions of our singly-averaged equations (c.f. our singly-averaged Hamiltonian (19)), and our equations are recovered if one sets the velocity-dependent forces to zero. However they did not derive any analogues of our doubly-averaged equations. Veras & Evans (2013b) noted that Galactic forces may impact the evolution of exoplanetary systems around stars near the bulge of the Galaxy where the Galactic tide is much stronger than it is in the Solar neighbourhood.
Another interesting and obvious limit of our theory, — which is, however, rather distinct, see §5.1 and Appendix B — has been explored in numerous studies of Lidov-Kozai dynamics (Lidov, 1962; Kozai, 1962; Fabrycky & Tremaine, 2007; Naoz, 2016) and its extensions. One interesting extension to the LK problem was made by Petrovich & Antonini (2017), who considered the effect of an axisymmetric (non-spherical) nuclear cluster potential on compact-object binaries that are themselves orbiting a central super-massive black hole (SMBH). The non-spherical part of the cluster potential was considered to drive nodal precession of the binary’s quasi-Keplerian outer orbit around the SMBH (continuously changing the relative inclination in the triple system composed of the binary and SMBH, which is important for the operation of LK cycles in this sub-system), while the dominant spherical part drove apsidal precession of the outer orbit. Our doubly-averaged formalism covers this problem in the case where the characteristic timescales for nodal and apsidal precession of the outer orbit are much shorter than the secular timescale, so that the outer orbit fills its torus. Our singly-averaged equations cover it in all cases. However, unlike Petrovich & Antonini (2017), we also account for the direct effect of the tidal torque due to the potential of the cluster on the orbital elements of the inner orbit of the binary.
Several authors have considered the problem of a star in orbit around a SMBH in a nuclear cluster (e.g. Sridhar & Touma 1999; Ivanov et al. 2005; Löckmann et al. 2008; Šubr et al. 2009; Chang 2009; Haas et al. 2011; Merritt 2013; Li et al. 2015; Iwasa & Seto 2016, 2017). The SMBH-star system effectively forms a binary. The binary’s Keplerian orbital elements may then evolve on secular timescales due to some combination of (i) the mean field nuclear cluster potential, (ii) GR pericentre precession, (iii) an infalling massive black hole on a slowly decaying circular orbit, (iv) a circumnuclear ring of material, etc. While this class of problems is reminiscent of our work, they are not quite the same because in the SMBH-star case the barycentre of the binary does not move, and so there is no clean separation between single- and double-averaging. In some cases, e.g. for a binary that sits at the centre of a spherical cusp, there is even no well-defined tidal expansion of the potential. Instead, averaging of the potential is incorporated into the averaging over the stellar Keplerian orbit around the SMBH, which is different from our approach.
One of the most interesting recent applications of secular dynamics has been the possibility of substantial shrinking of binary orbits by LK cycles with dissipative effects. Such applications include the origin of hot Jupiters (Fabrycky & Tremaine, 2007; Naoz et al., 2011; Petrovich, 2015; Hamers, 2017), formation of blue stragglers (Perets & Fabrycky, 2009; Knigge et al., 2009), white-dwarf mergers (Thompson, 2011; Katz et al., 2011; Toonen et al., 2018), and compact-object binary mergers in globular or nuclear star clusters (Antognini et al., 2014; Rodriguez et al., 2015; Naoz, 2016; Silsbee & Tremaine, 2017; Petrovich & Antonini, 2017; Leigh et al., 2018). Binary evolution driven by cluster tides explored in our work represents a different evolutionary scenario that may lead to similar outcomes (without invoking a nearby third companion). This possibility is explored in a separate study (Hamilton & Rafikov, 2019b).
9 Summary
This work explores secular evolution of binary systems orbiting in axisymmetric stellar clusters. We derive a Hamiltonian describing this evolution for an arbitrary form of the smooth cluster potential, average it over the (inner) orbital motion of the binary, and then average it again over the (outer) orbit of the binary around the cluster assuming the potential is axisymmetric. Our results can be summarized as follows.
- •
When the doubly-averaged Hamiltonian is cast in dimensionless form, all the information about the tidal potential is contained in a single dimensionless parameter , which depends on the background potential and the orbit of the binary in the cluster. The value of this parameter determines the phase portrait of the binary evolution, which is explored in a companion study (Paper II).
- •
The timescale of secular evolution is set by another (dimensional) parameter , which, like , depends on the cluster potential and the binary’s outer orbit.
- •
In certain cases and can be calculated (semi-)analytically. Such cases include (a) orbits in spherical potentials, (b) orbits confined to the midplane of an axisymmetric potential, and (c) epicyclic orbits near the midplane of an axisymmetric potential. We demonstrate how our calculations reproduce the known results for Lidov-Kozai evolution, evolution of Oort Cloud comets due to the Galactic tide, and so on.
- •
We map out the behavior of and in different spherically symmetric potentials as a function of size and radial elongation of the binary orbit. We find that is small in the central regions of clusters with cored potentials, but tends to unity in clusters with finite mass as the orbit size increases. In general, in realistic finite-mass spherical potentials.
- •
In general axisymmetric potentials, can easily attain negative values, in particular for highly inclined (i.e. non-coplanar) orbits.
- •
The accuracy with which our doubly-averaged Hamiltonian characterizes binary evolution deteriorates for highly non-coplanar orbits in axisymmetric potentials. Commeurability of orbital frequencies in the cluster potential may also present a problem for application of our theory at a quantitative level.
These results will be extensively used in Paper II, where we systematically explore the dynamics of binaries driven by the tidal field of clusters for different values of . There we verify numerically the predictions of the secular theory based on our doubly-averaged Hamiltonian, and derive timescales for secular eccentricity oscillations. In future papers in this series, these calculations will be applied to understanding secular evolution of binaries in dense clusters and exploring their relevance for the formation of compact-object mergers (Hamilton & Rafikov, 2019b), blue stragglers, and so on.
Acknowledgements
We thank John Magorrian for pointing out some literature and Eugene Vasiliev, Fabio Antonini and Almog Yalinewich for useful comments. The numerical orbit integration was done with galpy (http://github.com/jobovy/galpy, Bovy 2015). CH is funded by a Science and Technology Facilities Council (STFC) studentship.
Appendix A The singly-averaged Hamiltonian in orbital elements
In this Appendix we give the full algebraic expressions in terms of orbital elements for the terms that enter the singly-averaged Hamiltonian (19). They are
[TABLE]
[TABLE]
[TABLE]
[TABLE]
[TABLE]
[TABLE]
Appendix B Recovering the Lidov-Kozai quadrupole Hamiltonian
To derive the LK Hamiltonian we take equation (19) and average it over time using , and with describing a Keplerian ellipse with the focus at the origin.
First, it is obvious that for the this potential, . Then we must average the right hand sides of equations (4.1)-(29) which requires that we average the quantities and .
Without loss of generality we may choose the argument of pericentre of the ellipse to be zero, so that , the true anomaly. Then with being the eccentric anomaly, so for an arbitrary function we can write
[TABLE]
and we can convert between and using
[TABLE]
The answers are
[TABLE]
Using these identities, we find a remarkably simple relation between the time-averaged coefficients:
[TABLE]
and as expected . Note that the regime is exactly that of an axisymmetric perturbing potential (see e.g. Katz et al. (2011)). The resulting perturbing Hamiltonian will therefore be the same as (42), with the added simplification that and . Making these substitutions we find
[TABLE]
This is precisely the dimensionless test particle quadrupole Lidov-Kozai Hamiltonian (Lidov, 1962; Kozai, 1962; Kinoshita & Nakai, 2007; Lithwick & Naoz, 2011; Antognini et al., 2014). It describes the secular evolution of a hierarchical triple system in which the outer orbit dominates the angular momentum budget.
Appendix C Epicyclic orbits
In this Appendix we look at the behavior of and in the case of a binary performing epicyclic motion in an axisymmetric disk, to connect with the results of Heisler & Tremaine (1986) (HT86), who calculated the secular effect of the Galactic tide on the Oort Cloud comets.
Let the guiding centre of the binary’s orbit be a circle of radius in the plane of the potential. The potential experienced by the binary can then be approximated as555We implicitly assume that the disk is symmetric about its midplane, , so that at . Otherwise the binary would not remain in the midplane anyway.
[TABLE]
Using this expression and equations (33)-(34) it is easy to show that
[TABLE]
while all other ; here
[TABLE]
is the angular frequency of the guiding centre, while
[TABLE]
are the radial and vertical epicyclic frequencies of respectively. Hence
[TABLE]
Near the midplane of a galactic disk, and in particular in the HT86 case of the solar neighbourhood of the Milky Way, it is almost always the case that . Thus to a very good approximation and . Plugging these results into our doubly-averaged Hamiltonian (42), written in Delaunay variables, we find
[TABLE]
where we have eliminated in favour of the density in the Solar neighbourhood using Poisson’s equation . This is precisely the Hamiltonian arrived at by HT86 (c.f. their equation (14)) when considering the effect of the Galactic tide on the Oort Cloud comets.
Appendix D Signs and sizes of and
Here we provide some technical details about the statements on the signs and values of and made in §6.1. Also, in Table 4 we summarize some information about these coefficients for certain potentials. We provide two examples of orbits with extreme values of .
D.1 Spherical potentials
Consider a spherically symmetric potential , where is the sphericl radius. According to our convention, the outer orbit of the binary always lies in plane of the associated cylindrical coordinate system. Then it is a simple matter to show that
[TABLE]
Using these conversions as well as equations (33), (34), (41) we find
[TABLE]
We can now prove that for (almost) any realistic spherical potential, and thereby show that for such systems . In a spherical potential we have
[TABLE]
where is the cluster mass enclosed inside radius . Also, Poisson’s equation reads
[TABLE]
allowing us to rewrite equation (80) as
[TABLE]
Since and at all radii, this inevitably results in .
Finally, since
[TABLE]
equation (81) can be rewritten as
[TABLE]
For any spherical system in which the density is a non-increasing function of radius, for any and hence .
Let us now focus on spherical systems with . If the cluster has a constant density core, then as and so (equation (86)). Hence if orbits entirely inside the constant density region, . A potential without a core will always have a non-zero value of for orbits at small radii.
At the other extreme, as we have , and usually the enclosed mass const (although see below for potentials arising from power-law cusp density profiles). Hence where is the total mass of the cluster, and in this limit we get . Thus an orbit that spends its time exclusively at very large radii compared to the scale radius of the cluster will have . This is precisely the Lidov-Kozai limit: for potentials that are Keplerian as (i.e. those with finite mass), orbiting far from the core is equivalent to orbiting a point mass at the origin.
Finally, for any orbit in a spherical cluster with a power-law density cusp with (so that the mass is finite at the centre) one naturally has . Then we find from equations (84), (86) that
[TABLE]
and so and .
D.2 Axisymmetric potentials
In a general axisymmetric potential there is no constraint on how negative the parameter (defined by equation (46) and plotted in Figure 2) can be. Non-spherical potentials naturally feature regions with , especially near the poles; choosing a highly inclined (with respect to the equatorial plane of the potential) orbit with large radius so that is vanishingly small, one can drive strongly negative , thereby achieving extreme (positive or negative) values of .
In Figure 12 we give an example of an orbit with . We use the Miyamoto-Nagai potential with and precisely the same initial conditions as Orbits (VIII)-(XI) in the main text (see Table 3), except that we now take the initial coordinate to be . All three panels show of data. The large initial value of means that the orbit spends a lot of time near the poles of the potential where .
In Figure 13 we provide an example of a very polar orbit resulting in (). We use the Miyamoto-Nagai potential , and initial conditions
[TABLE]
In Figure 13a we display only the first vertical periods of the orbit in the meridional plane. When integrated for a long time, the orbit remains almost polar but precesses very slowly around the axis until it eventually fills an axisymmetric torus after a few thousand . Figure 13a shows that the convergence of the coefficients in this case is very slow and takes . In practice, unless the binary is very tight, secular theory is unlikely to work well for such an orbit. Indeed, for a relatively wide binary the secular evolution timescale is likely to be much shorter than , meaning that our assumption that the binary fills its torus (and hence the converge) in much less than a secular timescale is violated. We will explore this issue in more detail in Paper II, §7.
Appendix E Octupole Hamiltonian
The Hamiltonian derived in §2 is correct to quadratic order in , the so-called ‘quadrupole approximation’. We can attempt to derive a more accurate Hamiltonian by keeping the higher order terms in the series expansion of equation (3). The next (‘octupole’) term that we would include is
[TABLE]
We then use the fact that and to write
[TABLE]
As a result, the equation for the relative motion (equation (5)) can be written purely in terms of . The next order (‘octupole’) correction to the Hamiltonian (42) is
[TABLE]
Note that the octupole term vanishes for equal-mass binaries ().
The corresponding doubly-averaged perturbing octupole term is then simply
[TABLE]
E.1 Time-averaging over an axisymmetric torus
As in the quadrupole case, it turns out that when time-averaged over an axisymmetric torus the coefficents satisfy various symmetry properties. After a little algebra one can show that
[TABLE]
and all other except for . Hence
[TABLE]
Writing out the factors in terms of orbital elements we have
[TABLE]
[TABLE]
Equations (94)-(96) provide a general framework for accounting for the octupole contribution to the tidal Hamiltonian in an arbitrary axisymmetric potential. Note there is no dependence in the octupole Hamiltonian, so is still conserved to octupole order.
E.2 Link to the test particle octupole LK Hamiltonian
Note that one cannot recover the test particle octupole term of the doubly-averaged Lidov-Kozai Hamiltonian by putting in (94), because equation (94) is derived under the axisymmetric approximation. The time-averaged potential of a perturber on a Keplerian orbit is only axisymmetric at the quadrupole level, and the symmetry is broken by octupole terms. Instead one must integrate over the outer Keplerian orbit exactly, as in Appendix B.
In general there are independent time-averaged coefficients to consider. We choose the outer orbit to be in the plane so we can immediately eliminate four of these, . We can also choose the pericentre of the outer orbit to be on the axis without loss of generality, so that the ellipse traced by the outer orbit is symmetric under . Then all that contain an odd number of derivatives will be antisymmetric under , so their time-averages over this ellipse will vanish: . This leaves us with only three non-zero terms in the doubly-averaged octupole LK Hamiltonian:
[TABLE]
For reference we now write down the terms that make up equation (97). First we write down the necessary coefficients in terms of cylindrical coordinates and :
[TABLE]
The time-averages of these coefficients are
[TABLE]
The mean-anomaly averaged quantities are
[TABLE]
[TABLE]
[TABLE]
Plugging the results (E.2)-(101) in to (97), the resulting doubly-averaged test particle octupole Lidov-Kozai Hamiltonian is
[TABLE]
where . Equation (107) is precisely the result found in standard LK literature (e.g. Ford et al. 2000; Lithwick & Naoz 2011; Naoz 2016).
Appendix F Numerical prescription for computing time-averages
To calculate the time-averages numerically we use the orbit integrator in galpy (Bovy, 2015). Given the initial position and velocity of the binary’s outer orbit around the cluster, we integrate its equation of motion (11) numerically in the smooth cluster potential . We use a constant timestep so that after timesteps the time elapsed is . Then the running time-average of a quantity is
[TABLE]
In nearly all numerical examples shown in this paper we used where is the azimuthal period of , and integrated the outer orbit for approximately . The exception is Figure 13, where we used ( is the vertical period of ) and integrated the outer orbit for .
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Abbott et al. (2016) Abbott B. P., et al., 2016, Physical Review Letters, 116, 061102
- 2Abbott et al. (2017) Abbott B. P., et al., 2017, Physical Review Letters, 119, 161101
- 3Antognini et al. (2014) Antognini J. M., Shappee B. J., Thompson T. A., Amaro-Seoane P., 2014, Monthly Notices of the Royal Astronomical Society , 439, 1079 · doi ↗
- 4Antonini & Perets (2012) Antonini F., Perets H. B., 2012, The Astrophysical Journal, 757, 27
- 5Antonini et al. (2016) Antonini F., Chatterjee S., Rodriguez C., Morscher M., Pattabiraman B., Kalogera V., Rasio F., 2016, Astrophysical Journal , 816, 65 · doi ↗
- 6Antonini et al. (2017) Antonini F., Toonen S., Hamers A. S., 2017, The Astrophysical Journal, 841, 77
- 7Askar et al. (2017) Askar A., Szkudlarek M., Gondek-Rosińska D., Giersz M., Bulik T., 2017, MNRAS , 464, L 36 · doi ↗
- 8Bae et al. (2014) Bae Y.-B., Kim C., Lee H. M., 2014, MNRAS , 440, 2714 · doi ↗
