Varying the Horndeski Lagrangian within the Palatini approach
Thomas Helpin, Mikhail S. Volkov

TL;DR
This paper investigates the effects of varying the Horndeski Lagrangian within the Palatini approach, revealing conditions under which the resulting metric-affine theories are ghost-free and classifying their cosmological solutions.
Contribution
It provides a detailed analysis of metric-affine Horndeski theories, identifying ghost-free subclasses and exploring their cosmological implications.
Findings
Ghost-free subclass identified for linear-in-connection theories.
Classified homogeneous and isotropic cosmologies within these theories.
Non-linear connection terms generally introduce ghosts, but some can be mitigated.
Abstract
We analyse what happens when the Horndeski Lagrangian is varied within the Palatini approach by considering the metric and connection as independent variables. Assuming the connection to be torsionless, there can be infinitely many metric-affine versions of the original Lagrangian which differ from each other by terms proportional to the non-metricity tensor. After integrating out the connection, each defines a metric theory, which can either belong to the original Horndeski family, or it can be of a more general DHOST type, or it shows the Ostrogradsky ghost. We analyse in detail the subclass of the theory for which the equations are linear in the connection and find that its metric-affine version is ghost-free. We present a detailed classifications of homogeneous and isotropic cosmologies in these theories. Taking into consideration other pieces of the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Varying the Horndeski Lagrangian within the Palatini approach
Thomas Helpin
Mikhail S. Volkov
Abstract
We analyse what happens when the Horndeski Lagrangian is varied within the Palatini approach by considering the metric and connection as independent variables. Assuming the connection to be torsionless, there can be infinitely many metric-affine versions of the original Lagrangian which differ from each other by terms proportional to the non-metricity tensor. After integrating out the connection, each defines a metric theory, which can either belong to the original Horndeski family, or it can be of a more general DHOST type, or it shows the Ostrogradsky ghost. We analyse in detail the subclass of the theory for which the equations are linear in the connection and find that its metric-affine version is ghost-free. We present a detailed classifications of homogeneous and isotropic cosmologies in these theories. Taking into consideration other pieces of the Horndeski Lagrangian which are non-linear in the connection leads to more complex metric-affine theories which generically show the ghost. In some special cases the ghost can be removed by carefully adjusting the non-metricity contribution, but it is unclear if this is always possible. Therefore, the metric-affine generalisations of the Horndeski theory can be ghost-free, but not all of them are ghost-free, neither are they the only metric-affine theories for a gravity-coupled scalar field which can be ghost-free.
1 Introduction
The discovery of the cosmic acceleration [1, 2] has invoked a large number of field-theory models of the Dark Energy. Most of them introduce a scalar field, as in the Brans-Dicke, quintessense, -essence, etc. theories (see [3, 4] for reviews), while the others, as for example the gravity [5, 6], although looking different, are equivalent to the theory with a scalar field. Some of these models were actually introduced long ago in the context of the inflation theory [7]. In view of this interest towards theories with a gravitating scalar field one may ask: what is the most general theory of this type described by second order equations of motion? The answer was obtained already in 1974 by Horndeski [8] (and more recently rediscovered in [9, 10]): this theory is determined by the action
[TABLE]
where, using the parameterization of Ref.[11],
[TABLE]
Here and , denote matrices with components
[TABLE]
while the brackets denote the trace, so that for example . This theory incorporates all previously studied models with a single gravity-coupled real scalar field. The coefficient functions in (1) can be arbitrary, and depending on their choice the properties of the theory can be different.
There is a special subset of the theory, sometimes call Kinetic Gravity Brading (KGB) theory [12], [13], [14] defined by the following choice of the coefficient functions:
[TABLE]
while and can be arbitrary. The speciality of this choice is that it defines theories in which the gravitational waves (GW) propagate with the speed of light around all backgrounds, as demonstrated in [12]. If the property (1.4) is not respected then the GW speed is not constant and the corresponding theories are disfavoured [15, 16, 17, 18] since the recent GW170817 event shows that the GW speed is equal to the speed of light with very high precision [19].
The Horndeski theory can be generalised to the so-called DHOST models containing higher order derivatives in the equations in such a way that the number of propagating degrees of freedom is still three [20, 21, 22, 23, 24]. However, if one restricts only to theories with second order equations of motion, then the Horndeski Lagrangian (1) is the most general one to produce such theories within the metric formulation, that is assuming the connection to be determined by the metric and the covariant derivative of the latter to vanish.
In what follows we shall study theories obtained from the Horndeski Lagrangian (1) without imposing the metricity condition. Specifically, we adopt the Palatini approach and vary the Lagrangian independently with respect to the metric , the scalar field , and the connection that we assume to be symmetric, . The equations for are algebraic111Unless ; see the remark after Eq.(1.17). hence the connection is non-dynamical, therefore the number of propagating degrees of freedom is the same as in the original Horndeski theory, unless the ghost emerges.
In fact, we require the connection to be symmetric just for simplicity, in order that the metric-affine theory be maximally close to the original Horndeski theory. Relaxing this condition would give torsion-full generalisations of the theory, and torsion should certainly be taken into account within the most-general metric-affine setting [25, 26, 27]. At the same time, the vector part of the torsion can be gauged-away in projectively-invariant theories [28], whereas requiring the theory to be projectively invariant seems to be necessary for removing the ghost [29]. In this work we shall mainly focus on the part of the Horndeski theory, whose metric-affine version contains the torsion in the vectorial form that can be gauged away. As a result, choosing the connection to be torsionless from the very beginning does not actually restrict the generality of our analysis. This issue will be further discussed in [30] (see also [31]).
The Ricci tensor in (1) is viewed as function of ,
[TABLE]
the Ricci scalar and the Einstein tensor in (1) are understood as
[TABLE]
while the covariant derivatives should be computed with respect to ,
[TABLE]
Making these replacements in (1),(1.3) gives us the metric-affine version of the original Horndeski Lagrangian,
[TABLE]
and this defines the Palatini action
[TABLE]
Of course, this reduces back to the original Horndeski action if the connection is set to be Levi-Civita,
[TABLE]
Any additional matter can be included into the theory by adding to the action the matter term , where collectively denotes all matter fields. If the matter couples also to the connection and hence the matter term depends on as well, this may generate a non-zero torsion.
One should say that this metric-affine version of the original theory is not unique. For example, adopting instead of (1.7) the definition
[TABLE]
would give a different Lagrangian and a different action that also reduce back to and when . However, varying and would not give the same equations. It is clear that the two definitions (1.7) and (1.11) differ from each other by the term containing the covariant derivative of the metric – the non-metricity tensor
[TABLE]
which does not vanish in general. Using this tensor one can construct generalisations of the Lagrangian:
[TABLE]
where
[TABLE]
Here , , can depend on and the dots stand for all possible terms containing higher powers of and higher derivatives of . As a result, there can be infinitely many different versions of the Palatini Lagrangian. All of them coincide when the non-metricity vanishes, but otherwise they lead to different theories. This ambiguity in defining the theory may actually be important for removing the ghost. However, below we shall be considering only the simplest version of the theory for which it is sufficient to choose
[TABLE]
More complex cases will be reported separately [30] (see also [31]).
Therefore, in what follows we shall vary the Palatini action defined by (1.9). We do not expect to get the same equations as those obtained from the metric action , since already for the theory the metric formulation and Palatini formulation give different results [5]. The same is expected to happen also for the Horndeski theory.
We find that the resulting theory obtained from can show quite different properties, depending on whether the Lagrangian respects or not the KGB condition (1.4). If this condition is respected then but the non-metricity contributions can be grouped into additional terms in the effective energy-momentum tensor of the scalar field, and the field equations can be represented in the form containing only the ordinary metric covariant derivatives. Remarkably, these equations turn out to be identically the same as those for a metric KGB theory corresponding to a specific choice of the coefficients in the Lagrangian. Therefore, the Palatini approach yields in this case a theory which is still in the same Horndeski class.
This is a non-trivial result. It can be traced to the fact that the connection in our theory is non-dynamical and can be integrated out, which transforms the metric-affine theory to a metric theory containing additional interactions [33, 32]. When the connection is integrated out, we obtain a metric theory that is again of Horndeski type but with additional scalar-tensor interactions whose structure, however, may in general be not the same as that in the Horndeski theory. As a result, this metric theory may be outside the Hordeski family and may contain ghost. It is therefore rather non-trivial that, when the procedure is applied to the KGB theory, the ghost does not appear and the additional interactions again have the KGB structure. One could say that the KGB family is “stable” under the metric-affine treatment. This, together with the fact that the GW speed is constant, confirms again the special status of these theories. More general subsets of the Horndeski family are not stable in the same sense – their metric-affine versions contain higher derivatives.
If the Lagrangian does not respect the condition (1.4) then the equations contain higher derivatives. In some cases they are not dangerous as the theory turns out to be of the DHOST type, however this is not always the case. For example, choosing
[TABLE]
and in (1.13), one finds that the ghost is absent if . In some other cases it can be removed via adjusting in (1.13) but it remains unclear if such a procedure always works.
The metric-affine versions of theories with
[TABLE]
remain by far almost totally unexplored because the connection then becomes dynamical. Specifically, the connection enters the action algebraically, apart from the terms with . Varying the latter with respect to the connection produces result of the form with constructed from and derivatives of . After integrating by parts, this gives rise to term in the equations. If then does not depend on the connection and the resulting equations are algebraic in . If then contains and therefore depends on hence contains derivatives . As a result, the equations for are algebraic if but they become differential if . The connection starts propagating in the latter case, which should considerably change the physical contents of the theory.
The rest of this text is organised as follows. In Section 2 we perform the Palatini variation of the piece of the Lagrangian respecting the KGB condition (1.4), and in Section 3 we show that the resulting equations actually correspond to one of the metric Horndeski theories. Therefore, varying the same KGB action in the metric approach and in the Palatini approach gives two different theories from the same metric KGB class. In Sections 4,5 we study their solutions to see how much these two theories differ from each other. One should stress that, although both of these models are in the same KGB class and hence their tensor modes propagate with the speed of light, the properties of the scalar mode are not necessarily the same. Cosmologies in these two models are not the same and their stability properties are different. In Section 4 we consider small perturbations of homogeneous and isotropic backgrounds, which gives us conditions for the absence of ghosts and tachyons in the scalar sector. In Section 5 we specify the subclass of theories invariant under shifts and describe all homogeneous and isotropic cosmologies in these theories. The spectrum of these solutions is surprizingly rich, and we make a comparison with the solutions previously described in the literature [12], [13], [14]. In Section 6 we briefly describe what happens if the condition (1.4) is not respected – a more detailed analysis will be reported separately [30]. We make some concluding remarks in Section 7. The Appendix contains the lengthy expression for the connection arising in the -subset of the theory.
The metric-affine formulation for the scalar-tensor theories was recently studied also in [34], [35], [36], [37], [38]. However, to the best of our knowledge, the entire Horndeski family has not been systematically analysed from this viewpoint.
2 Varying the KGB part of the Palatini action
Imposing the KGB condition (1.4), the action (1.9) reduces to
[TABLE]
with the Ricci scalar defined according to (1.5),(1.6); our signature is . We assume the connection to be symmetric, , but the Ricci tensor will not in general be symmetric, unless is a Levi-Civita connection. The other quantities in the action are the squared gradient of the scalar field and covariant d’Alembertian,
[TABLE]
Let us vary the action (2.1) independently with respect to , , and . To vary with respect to , we notice that the only connection-dependent terms in the action are and . The variation is a tensor that induces the variations,
[TABLE]
Injecting this to (2.1), integrating by parts and remembering that the metric is not necessarily covariantly constant with respect to , we obtain
[TABLE]
with
[TABLE]
The variation of the action will vanish if
[TABLE]
It follows that , which yields
[TABLE]
Taking this condition into account, Eq.(2.6) reduces to
[TABLE]
Since one has
[TABLE]
one obtains after simple manipulations the following expression for the covariant derivative of the metric,
[TABLE]
This can be resolved to obtain the connection,
[TABLE]
Here and in what follows we use the functions related to in the action via
[TABLE]
It is worth noting that the first and second terms on the right in (2.11) correspond to the Kristoffel symbols for the conformally related metric . However, the last term in (2.11) does not have the Levi-Civita structure.
Injecting the expression for to (1.5) gives the Ricci tensor,
[TABLE]
where and are the standard Ricci tensor and covariant derivative constructed from while . We note that the last term on the right in (2.13) is antisymmetric under .
Let us now vary the action with respect to . One has
[TABLE]
Injecting this to the action and integrating by parts yields
[TABLE]
where
[TABLE]
To compute the expression in the third line we set , inject to (2.7), and use (2.9) to obtain
[TABLE]
Since for any vector one has
[TABLE]
it follows that
[TABLE]
Collecting everything together, the variation of the action with respect to the scalar field is
[TABLE]
with
[TABLE]
Here and below the following two functions are used,
[TABLE]
where the prime denotes differentiation with respect to .
Varying the action with respect to the metric is straightforward and yields
[TABLE]
where
[TABLE]
with defined in (2.12) and \gamma_{\mbox{\tinyX}}=\partial_{X}\gamma, \kappa_{\mbox{\tinyX}}=\partial_{X}\kappa. The Ricci tensor is given by (2.13) and tracing it yields . One has
[TABLE]
with defined in (2.2), hence
[TABLE]
Summarizing the above discussion, the action will be stationary if and . This yields the field equations which can be rewritten solely in terms of the ordinary metric covariant derivatives. The conditions reduce to
[TABLE]
where is the Einstein tensor for while the effective energy-momentum tensor
[TABLE]
with defined in (2.22). The condition yields the equation for the scalar field,
[TABLE]
where are defined by (2) with given by (2.26) and obtained by tracing in (2.13). A direct verification shows that the differential consequence of (2.27), the covariant conservation condition,
[TABLE]
indeed follows from Eqs.(2.27)–(2.29).
3 Relation to the metric version of the theory
Let us now return to the action (2.1) and assume that is the Levi-Civita connection determined by . The action then reduces to the Horndeski action . Varying it with respect to and yields the equations
[TABLE]
where
[TABLE]
and also
[TABLE]
Surprisingly, a direct verification shows that equations (2.27), (2), (2.29), (2) of the metric-affine version can be obtained from equations (3.1)–(3) of the metric version by simply replacing in the latter
[TABLE]
with given by (2.22). Therefore, the Palatini theory derived from the action (2.1) is actually equivalent to the metric theory derived from the action
[TABLE]
with the new k-essence term
[TABLE]
The explanation of this is as follows. Let us return to the Palatini action (2.1) and inject into it the on-shell value of the connection,
[TABLE]
given by (2.11). Using and expressed by (2.13) and (2.26) then yields
[TABLE]
so that the metric action (3.5) is indeed recovered. This implies that the equations derived from both actions should coincide. Indeed, let us vary the scalar field, . This induces the variations
[TABLE]
Since the connection is assumed to have the on-shell value, one has
[TABLE]
therefore
[TABLE]
hence the scalar field equation derived from the Palatini action coincides with the one obtained from the metric action . The same applies for equations obtained by varying the metric, hence theories derived from the Palatini action (2.1) and from the metric action (3.5) are equivalent. A similar equivalence holds for all other Horndeski models with as well, because a non-dynamical connection can always be integrated out and the metric-affine theory reduces to a metric theory.
Summarizing, varying the same action (2.1) within the metric approach and within the Palatini approach yields two different theories from the same metric KGB class. In the former case one obtains the theory with coefficient functions , , while in the latter case one obtains theory with coefficients , with defined by (5.14). Both theories are ghost-free and the GW speed is equal to one. Below we shall study solutions of these two theories to see how much they differ from each other.
The change expressed by (3.6) can be viewed as a “duality relation” between different theories, and one can look for its interpretation, for example within the effective hydrodynamical description developed in Ref.[13]. If the theory is invariant under shifts then the current in (3) is conserved, which can be expressed as
[TABLE]
with the “fluid 4-velocity” , “acceleration” , “chemical potential” , “density” where the “expansion” , and with the “diffusivity” (not to be confused with our ). The right-hand-side in (3.12) can be interpreted as the “diffusion term” [13].
Now, (3.6) does not change and the diffusivity , but it changes (equation of state) and the density . If repeated many times, it changes more and more, but it becomes identity if the theory is chosen such that and hence . Unfortunately, this condition does not have non-trivial solutions in the shift-invariant theory222The relation can be realized in a non-trivial way if the theory includes non-metricity terms of the type (1.14) [31]. . The duality does not change the right-hand-side of (3.12), but since changes, the whole diffusion is affected. Eq.(3.12) assumes the standard diffusion form in the limit of small and , with the diffusion coefficient [13], but since the duality only affects higher powers of , remains invariant in this limit. The impact of the duality on the diffusion away from the weak field limit should probably be analyzed separately.
4 Stability of the scalar sector
In Section 5 we shall study cosmologies in the Palatini-derived KGB theory and compare them with the those in the metric-derived theory. Even though these two theories belong to the same metric KGB class, hence they are free from Ostrogradsky ghost and contain three propagating degrees of freedom, their scalar sector can be unstable. We therefore study in this Section small perturbations of cosmological backgrounds. Decomposing the perturbations into tensor and scalar parts, one finds, as expected, that tensor modes always propagate with the speed of light. However, depending on the background, the scalar sector may contain ghost and tachyon instabilities. Their absence requires positivity of the kinetic coefficient and sound speed squared derived in Eqs.(4.14),(4). These conditions will be analysed in Section VI.
Let us assume the spacetime metric to be homogeneous and isotropic,
[TABLE]
while the scalar field to depend only on time,
[TABLE]
The Einstein equations (2.27) of the Palatini version of the theory reduce to
[TABLE]
whose consequence is the scalar field equation (2.29). Here and the prime denotes differentiation with respect to .
Suppose one finds a solution of Eqs.(4) (examples will be given below) describing a homogeneous and isotropic background (4.1),(4.2). Consider small perturbations of this background,
[TABLE]
In the linear approximation, the perturbations fulfill the equations obtained by perturbing the background equations,
[TABLE]
The metric perturbations can be decomposed into the scalar, vector, and tensor parts via
[TABLE]
where
[TABLE]
The spatial dependence is given by the plane waves where the wave vector can be oriented along the z-axis, so that the scalar modes are
[TABLE]
the vector amplitudes are chosen as
[TABLE]
while for the tensor modes the only non-trivial components of are
[TABLE]
Inserting everything into the perturbation equations (4.5) splits them into three independent groups for the scalar, vector, and tensor modes. These equations determine the effective action which also spits into three independent terms,
[TABLE]
where the bar denotes complex conjugation. One obtains in the tensor sector
[TABLE]
where the kinetic coefficient (not to be confused with ) is always positive while the sound speed . Therefore, the gravity waves propagate with the speed of light as expected.
The analysis in the vector sector shows that the vector modes have no kinetic term and , hence vector modes do not propagate.
The analysis in the scalar sector is more involved but facilitated by the fact that one can impose the gauge where (unless ). The equations then imply that the scalar amplitudes , , can be expressed in terms of and the effective action reduces to
[TABLE]
with
[TABLE]
where
[TABLE]
Both the kinetic term and sound speed squared should be positive for the system to be stable. Summarizing, the theory shows two propagating modes in the tensor sector and one scalar mode. The tensor modes propagate with the speed of light, as expected, while properties of the scalar mode depend on the background. The above formulas apply for the Palatini-derived theory described by (2.27), (2), (2.29), (2). The corresponding formulas in the metric theory described by (3.1)–(3) are obtained by making in (4), (4) the inverse to (3.4) replacement : .
5 Cosmologies
In order to study concrete solutions, we must specify the functions , , . We assume them to be independent of ,
[TABLE]
in which case the theory is invariant under shifts
[TABLE]
As the simplest option, we assume and to be linear in , hence
[TABLE]
where are constant parameters, so that
[TABLE]
Eqs.(2.27) then become
[TABLE]
with the energy-momentum tensor
[TABLE]
while the scalar field equation (2.29) becomes total derivative,
[TABLE]
with the current
[TABLE]
The equations of the corresponding metric version of the theory are obtained by simply omitting in (5.6),(5.8) the terms proportional to .
We shall now study homogeneous and isotropic cosmologies of this shift-invariant theory, first within its Palatini version and then within its metric version. Even though these two theories belong to the same KGB class, their solutions are different since the equations are not the same and the stability conditions derived in Section 4 are also not the same, since they are background-dependent.
It turns out that the problem reduces to a single algebraic equation (5.24) which determines algebraic curves whose position and critical points determine the properties of the cosmologies. One finds in this way many different solutions: self-accelerating cosmologies, recollapsing cosmologies, and bounces. We give below their detailed classification, analyse their stability conditions, identify stable branches, and make comparison with the previously known results [12], [14].
5.1 Master equations
Assuming the homogeneous and isotropic ansatz (4.1),(4.2) for the fields, the Einstein equations (5.5) reduce to
[TABLE]
with . These equations can also be obtained by injecting (5.3) to (4). The only non-trivial component of the scalar current (5.8) is
[TABLE]
and the scalar field equation (5.7) reads
[TABLE]
which implies that
[TABLE]
where is the integration constant – the scalar charge.
The simplest solution of these equations is and . This solution is stable, although the general stability analysis carried out above does not apply in this particular case since the gauge cannot be imposed if . One should repeat the analysis keeping and then one finds in the scalar sector .
For solutions with one can use the general formulas (4.14) for the kinetic term and the sound speed, which now reduce to
[TABLE]
If does not vanish then (5.13) can be resolved with respect to the Hubble parameter,
[TABLE]
Injecting this to (5.9) yields the algebraic relation between and ,
[TABLE]
while injecting to (5.10) determines the derivative of ,
[TABLE]
Eqs.(5.15),(5.16),(5.17) are invariant under
[TABLE]
which provides the one-to-one correspondence between solutions of two theories which differ by the sign of . Therefore, one can assume without loss of generality that . The equations are also invariant under the time reversal , which changes the sign of the first derivatives and of the current, but not of the second derivatives, hence
[TABLE]
This swaps the expanding solutions and contracting solutions.
It follows from (5.15) that if approaches zero then either the Hubble rate should diverge, or, if it remains finite, then the scale factor should diverge. This corresponds either to the initial singularity or to future infinity. Therefore, between these two extremities cannot vanish and should be sign definite, either everywhere positive or everywhere negative.
One can absorb the parameters and by expressing in terms of dimensionless333Assuming the spacetime coordinates to have the dimension of length, , our normalisation of the action (2.1) implies that are dimensionless while . quantities via
[TABLE]
where the Hubble scale is determined by the length scale ,
[TABLE]
Here if , while if then is an arbitrary dimensionless parameter. The variable in (5.20) must be non-negative while can be positive or negative.
Injecting (5.20) to (5.14)–(5.17) yields,
[TABLE]
where the sign of should be chosen such that , hence different values of correspond to different solutions whose scale factors are “homothetic” to each other. As a result, if then one can assume without loss of generality that either or , depending on sign of . One obtains also
[TABLE]
while Eq.(5.16) reduces to
[TABLE]
with
[TABLE]
Eq.(5.17) yields
[TABLE]
The kinetic term and the sound speed in (5.14) become
[TABLE]
Eqs.(5.23)–(5.1) determine the solutions and their stability.
5.2 Currentless solutions
Let us first consider solutions with vanishing scalar charge, , in which case, according to (5.13), the current is zero. If then, according to (5.20), one has either hence the system is in vacuum, or and then Eq.(5.24) reduces to
[TABLE]
hence is constant, and are constant as well, and the geometry is de Sitter. If then . If then diverges for and has a minimum in between, hence for exceeding some minimal value there are two different solutions of (5.27), and (see Fig.1).
Let us assume first that hence . Then for , for example, one finds two solutions of (5.27) with the following properties:
[TABLE]
Notice that for the first of these solutions hence it is unstable.
If decreases to the negative region then the two values and approach each other and merge for , in which case
[TABLE]
The Hubble rate vanishes for this solution and the geometry is flat, even though . This solution exists only for but it is unstable since . This does not mean that flat space is always unstable in the theory, because the flat space solution can also be obtained in a different way: by setting and , in which case it is stable (as was mentioned above, the formulas (5.1) do not apply if ).
Let us determine the stability region. If then and defined by (5.1) reduce to
[TABLE]
It follows that if hence all solutions with show either ghost or gradient instability. If then is always positive while will be non-negative if , hence if (see Fig.1)
[TABLE]
Solutions with always violate this condition hence they are all unstable. Solutions with fulfill this condition if
[TABLE]
To recapitulate, the currentless solutions are characterised by a constant value of the scalar field gradient and by a constant Hubble rate; their geometry is de Sitter. For they exist if only and they are stable for . All of such solutions for or are unstable (we shall later see that if the current does not vanish then stable solutions exist for any ).
5.3 Solutions with a non-zero current
If then the current is hence the amplitude defined in (5.20) does not vanish. However, since for , it follows that approaches zero at late times and the solutions then approach the described above configurations with constant and de Sitter geometry. It follows from the above analysis that if approaches zero then must approach either , in which case the product becomes negative and the solution becomes unstable, or approaches and then the solution is stable if .
For Eq.(5.24) can be resolved yielding two different solutions, or . From now on and till the end of the next sub-section we set , then
[TABLE]
These functions are defined only in the region where . This region must contain a zero of since we want the solution to approach for one of the de Sitter backgrounds described above. If then vanishes at which point is known to be unstable, whereas vanishes at , and we know that this point is stable. Therefore, we choose
[TABLE]
assuming that and hence for . For finite values of the universe size, when , one chooses , in which case one has . According to (5.22), the scale factor is proportional to
[TABLE]
where the sign of should be chosen such that . This implies that a for and a for . Injecting (5.35) to (5.23) yields the Hubble parameter,
[TABLE]
and similarly injecting to (5.1) yields and .
As a result, Eqs.(5.35),(5.36),(5.37) provide the solution in the parametric form, with being the parameter. Inverting a in (5.36) to obtain , the solution can be expressed in terms of the scale factor as shown in Fig.2.
One can see that, as the scale factor a varies from zero to infinity, the gradient squared of the scalar field, , decreases from infinity to the asymptotic value . The Hubble function decreases from infinity to the constant asymptotic value defined by (5.23) with and . The important point is that the kinetic term and the sound speed remain positive for all values of a, as seen in Fig.2, hence the solutions are always stable.
To determine the behaviour near the initial singularity, we notice that when is large and a is small, then one has from (5.35),(5.36),(5.37)
[TABLE]
hence
[TABLE]
As a result, the system behaves as a perfect fluid with the effective equation of state , which is somewhere in between the dust and radiation .
To recapitulate, the system admits cosmological solutions with a non-zero scalar current. Close to the initial singularity, the squared gradient of the scalar field is , which mimics a perfect fluid with the equation of state . As the size of the universe grows, and the Hubble rate approach constant values. These solutions are stable.
It is worth noting that these solutions can describe both the expansion and contraction of the universe, according to the choice of sign in Eq.(5.20). Choosing the plus sign yields , hence the expansion, in which case one should choose since . Choosing the minus sign gives the contraction with , and then one should choose . The two cases are related by the symmetry (5.19).
5.4 More general solutions
These are defined by the algebraic curve subject to (5.24). To study this curve, we plot together the functions and defined by (5.34), which allows us to distinguish different solution types. Depending on value of , these solutions can be classified as follows.
5.4.1
Type I. An example of such solutions is shown in Fig.3 for . Both and are everywhere negative and defined only in the region where the argument of the square root in (5.34) is positive.
In the limit the square root vanishes and the and branches merge at the point marked on the left panel in Fig.3. Nothing special happens at this point: the solution simply passes from the lower branch to the upper branch in the direction indicated by the arrow in Fig.3. The direction is determined by the fact that at the lower branch the derivative , as shown in Fig.3, hence decreases towards the minimal value , while at the upper branch the derivative and increases.
The scale factor obtained from (5.36) increases along the lower branch and the corresponding Hubble parameter is positive, , as shown on the right panel in Fig.3. After passing to the upper branch, the scale factor first continues to increase up to a maximal value, then the Hubble parameter changes sign and the universe starts shrinking.
Therefore, the universe starts from zero size at and , then it expands first along the branch and next along the branch, then the scale factor reaches a maximal finite value, after which the universe shrinks back to zero size along the branch. The sound speed becomes negative at the branch, hence the solution is unstable.
The solution remains qualitatively the same for any , when both and remain negative, but the maximal value of approaches zero from below when increases.
5.4.2
Type II. The curve remains qualitatively the same as before but the branch touches zero from below at the point O indicated on the left panel in Fig.4. The position of this point is described by Eq.(5.30) above. Since vanishes at this point, the universe size (5.36) becomes infinite. Therefore, there are actually two different solutions in this case. The part of the -curve on the left panel in Fig.4 which is on the left from the point O describes the universe expanding from zero size to infinity. The part of the curve on the right from O describes the universe shrinking from infinite size to zero.
Since at the point O the sound speed squared becomes negative (see Eq.(5.30)), both of these solutions show a gradient instability.
5.4.3
The curve remains qualitatively the same as before but shifts upwards and crosses zero twice at and with defined by Eq.(5.27). This is the case for the , , and curves on the left panel in Fig.4. Since , it follows that (see (5.36)), hence the single curve determines three different solutions of types III, IV, V described below.
Type III. This solution corresponds to the left part of the curve where (the , , and curves in Fig.4). This determines the universe expanding from zero to infinity. This solution can be stable or unstable, depending on the position of the point of the merging of the and . If the merging point is below the -axis (as for the -curve in Fig.4) then the solution is unstable. The solution becomes stable for when the merging point is at the -axis (the -curve in Fig.4), and it remains stable when moves further up (the -curve in Fig.4). The profiles of the solution in the latter two cases are similar to those shown in Fig.2.
Type IV. This solution corresponds to the part of the curve interpolating between and (the , , and curves in Fig.4). This describes a bounce – the universe shrinking to a finite size and then expanding back to infinity. This solution is always unstable since it contains the point , which is known to be unstable.
Type V. This solution corresponds to the right part of the curve where , (the , , and curves in Fig.4). This also corresponds to the universe expanding from zero to infinity (or contracting, depending on the sign choice in (5.20)), and it is always unstable because it contains the unstable point , .
5.4.4
The curve moves further upwards and develops a disjoint part – a small loop, as illustrated by the -curve shown on the right panel in Fig.4. The curve therefore splits into two disconnected subsets – the compact part (the loop) and the non-compact part. The non-compact part corresponds to three different solutions of Types III–V described above; Type III solution always being stable. The compact part corresponds to a new solution type with the following properties.
Type VI. The small loop shown in Fig.4 touches the vertical axis at the point where the universe size diverges. In the vicinity of this point one has
[TABLE]
where . The evolution along the loop corresponds to the universe starting from an infinite size in the past, then shrinking to a finite size, bouncing back and expanding again to an infinite size . These solutions show ghost.
5.4.5
If exceeds the value then the two disjoint pars of the curve interconnect to form one connected manifold, as illustrated by the curve in Fig.4. This corresponds to four different solutions. The two parts of the curve where correspond to solutions of Types III and V; Type III always being stable. The parts of the curve where correspond to two different solutions of the following new type.
Type VII. The two parts of the -curve which interpolate between points and or between and correspond to bounces – the universe starts from and ends up with an infinite size. These solutions are unstable.
Summarizing, the only stable solutions in the above classification are those of Type III; they exist only for . They are qualitatively the same as those previously described in sub-section 5.3.
5.5 Solutions with
Solving Eq.(5.24) for or yields
[TABLE]
which should be injected to (5.36), (5.37) and to (5.1) to determine the solutions. The behaviour of is shown in Fig.5, and this time one finds only two qualitatively different solution types. First, if then is illustrated by the curve shown on the left panel in Fig.5. This gives rise to Type I solutions described above, they are always unstable.
If then both and curves touch the vertical axis at the point with coordinates , which corresponds to (see Fig.5), hence the solution splits in two. One solution is generated by curves which emanate from downwards. These solutions are stable. The other solutions are generated by curves emanating from towards increasing values of , all of them are unstable.
Let us describe the stable solutions. If and then the Hubble parameter at the point becomes (see (5.23)), which corresponds to the behaviour (5.40). Therefore, the curves for shown in the lower left corner on the left panel in Fig.5 describe the universe starting from zero size in the past and expanding in the future as with . These solutions are stable.
If and then one has for small
[TABLE]
Therefore, the curve for in the left lower corner on the left panel in Fig.5 describes the universe starting from zero size in the past and entering in the future the regime corresponding to the equation of state.
If but then at small one has
[TABLE]
where . Therefore, the curves for shown in the lower left corner on the right panel in Fig.5 correspond to the universe starting from a zero size in the past and approaching asymptotically the de Sitter phase. The squared gradient of the scalar field asymptotically approaches zero. These solutions are stable.
If then for any one has
[TABLE]
therefore at all times the universe exactly follows the equation of state
[TABLE]
This type of behaviour we have already seen in (5.39) close to the singularity, but this time it holds everywhere. This solution is stable.
Summarizing, stable for solutions exist for and are generated by the curves residing in the lower left corners in the diagrams in Fig.5. They describe universes expanding from zero size to infinity. If then the universe approaches in the future the de Sitter phase with the Hubble rate if , while for it expands at late times according to the equation of state. For and the universe approaches the de Sitter phase with the same Hubble rate if , whereas for and it expands at all times according to the equation of state.
5.6 Solutions in the metric version of the theory
Let us now compare studied above solutions in the Palatini version of the theory with those arising in the metric version of the theory. As was mentioned, the equations of the metric version can be obtained from (5.9), (5.10), (5.11), (5.13) by omitting terms proportional to . Applying the rescaling (5.20) then yields the modified version of Eqs.(5.23), (5.24), (5.25), (5.1). The equation for becomes
[TABLE]
and one has
[TABLE]
The properties of perturbations are read-off from (4.14),(4), after replacing in these formulas . This yields the kinetic term and sound speed:
[TABLE]
The procedure is then the same as before: first one solves (5.46) to obtain with
[TABLE]
This determine algebraic curves shown in Figs.6,7 (in the online version of Figs.4–7 the and amplitudes are shown, respectively, in dark-blue and dark-red). The interpretation of these curves is obtained by injecting to (5.47) and (5.48): for example, points where either crosses the horizontal axis or touches the vertical axis correspond to the infinite size of the universe. As a result, the curves in Figs.6,7 corresponds either to universes expanding from zero to infinite size, or to universes expanding only up to a finite size and then shrinking, or to bounces.
The stable solutions are again only those generated by the parts of the curves located under the horizontal axis in the left lower corner of the diagrams in Figs.6,7. Such solutions exist for any but only for . Their profiles are qualitatively similar to those shown in Fig.2. The overall conclusion is that, despite their surprising variety, solutions in the Palatini-derived theory and those of the metric theory are qualitatively similar to each other.
5.7 Summary and comparison with the previously studied case
We presented above all homogeneous and isotropic cosmologies in two special shift-symmetric KGB models, one of which is the standard metric theory with the coefficient functions
[TABLE]
The other one is its Palatini version, which, according to the discussion in Section 3, can also be viewed as the standard metric theory with
[TABLE]
As we have seen, the classification of solutions in these two theories is rather complicated, but the solutions are sensitive only to and to the sign of , while only changes the scale. Despite some differences between the two theories seen in the figures above, the properties of the solutions turn out to be essentially the same in both cases and the solutions are always found to be either expanding or recollapsing universes, or bounces.
The most interesting physically are the stable expanding cosmologies. They exhibit the effective equation of state near singularity and at late times they approach the currentless de Sitter phase with a constant Hubble rate. If or then the Hubble rate is expressed in terms of in the usual way, , but for it is not directly related to , and if then
[TABLE]
where the numerical coefficient in model (5.50) and in model (5.51). The theories also show recollapses and bounces, but these solutions are always unstable.
To the best of our knowledge, a similar complete classification of KGB cosmologies has never been reported in the literature. At the same time, it was known before that the equations are completely integrable in the shift-symmetric case, and cosmologies were much studied from a more physical perspective already in the original KGB paper [12].
The specific model considered in Ref.[12] corresponds to (5.50) with 444See Eq.(5.1) in [12] where one should replace since the metric signature used in that paper is opposite to ours. but with replaced by an external matter energy density, . The case of a constant was also considered there, and it seems that what is shown in Fig.3 in Ref.[12] corresponds to our solutions with and on the left panel of our Fig.6. The curve shown there describes three different solutions: a stable cosmology, an unstable cosmology, and a bounce, the two latter showing the ghost. Now, on the right panel in Fig.6 we plotted profiles of these solutions against the dimensionless variable defined as in the region and in the region. We used the symmetry (5.19) to relate the values of the solutions for opposite signs of . The dimensionless Hubble parameter shown in Fig.6 is defined as for and for , while the dimensionless current is for and for . The vertical lines delimit the instability region where . The resulting diagram is very similar to Fig.3 in [12] and describes three different cosmologies, since zeros of correspond to the infinite universe size. The rightmost part of the diagram where describes the stable branch.
The main accent Ref.[12] was made on the analysis of the currentless solutions (called fantom attractors) in the presence of a generic external matter. Although this goes beyond the scope of our program, we can recover the same results. Replacing in (5.9) and in (5.10) with , yields
[TABLE]
where distinguishes between theories (5.50) and (5.51) by taking values [math] and , respectively. This defines , and , where is obtained by setting the current to zero (as in (5.15)):
[TABLE]
Injecting this to (5.53) and (5.54) yields algebraic relations which determine and in terms of and . Computing then and we recover the results of Ref.[12]. For example, we see that at early times the external matter dominates and while the effective equation of state of the scalar is then determined by , whereas at late times and .
Bounces in the presence of a “hot matter” were studied in Ref.[14]. Interestingly, solutions whose evolution at the turnaround point is stable were detected, although they still show instability somewhere [14]. Our bounces are also unstable, in addition we do not find solutions whose evolution at the turnaround point would be stable. The difference must be due to the fact that our bounces are “cold” and not “hot”, and also because the model considered in [14] corresponds to555After the replacement due to the signature change. , which is different from our model (5.51) with the negative coefficient in front of .
6 More general Horndeski models
We have studied up to now the Palatini version of the Horndeski models respecting the condition (1.4). These theories are described by second order equations and are therefore free of the Ostrogradsky ghost. It turns out that relaxing the condition (1.4) invariably produces higher derivatives within the Palatini approach. However, the ghost does not always arise. To illustrate this, let us consider a simple example obtained by setting in (1)
[TABLE]
with constant . The Horndeski Lagrangian reduces to
[TABLE]
where the dots denote total derivatives. The term in the second line vanishes, but it would be proportional to the non-metricity within the Palatini approach, hence dropping this term is equivalent to choosing a non-zero in (1.13)666Keeping the term in the metric-affine approach would render the torsion dynamical, as explained after Eq.(1.17).. Consider the metric-affine version of the third line in (6.2),
[TABLE]
where and . Varying this with respect to and using (2.18) yields
[TABLE]
In the metric case one has and hence the equation reduces to which contains only second derivatives. However, if then and the equation contains higher derivatives, which can be seen as follows. The Lagrangian can be represented as
[TABLE]
with
[TABLE]
where as usual . Introducing defined by the relation
[TABLE]
hence
[TABLE]
the Lagrangian becomes
[TABLE]
It is well-known that varying this Lagrangian with respect to the connection yields
[TABLE]
hence is the Levi-Civita connection for the effective metric . Since the latter contains derivatives in (6.8), it follows that contains second derivatives hence both and the equation contains third derivatives of .
At the same time, the relation (6.8) between and is an invertible disformal transformation, hence one can consider , as independent variables instead of , . Varying the Lagrangian with respect to yields
[TABLE]
which are the vacuum Einstein equation for the Ricci tensor constructed from the metric in the standard way. They imply that , hence the scalar field equation (6.4) is fulfilled as well. Therefore, the theory (6.3) is simply the vacuum General Relativity for the effective metric so that the ghost is absent.
The original metric is obtained from by inverting the relation (6.8), and since the latter contains the scalar field remaining undefined, there are infinitely many metrics for a given Ricci-flat . This ambiguity can be removed by adding to the Lagrangian to produce a non-trivial condition for . The equations will still contain higher derivatives when expressed in terms of , but they become second order equations when expressed in variables.
Summarizing, the theory (6.3) contains higher derivatives when parameterized in terms of and hence it is outside the Horndeski family. At the same time, it is ghost-free since the disformal transformation (6.8) removes the higher derivatives, hence it must belong to the DHOST family (similar examples were considered in [36]).
However, in the generic case the theory turns out to be outside the DHOST family and shows ghost. Consider, for example, the Palatini version of the entire piece of the Horndeski Lagrangian (1) generated by ,
[TABLE]
Solving the equation for the connection gives
[TABLE]
where is displayed in the Appendix. Injecting this back to yields for a generic a metric Lagrangian that belongs neither to the Horndeski nor to DHOST family. Therefore the theory contains ghost. For the particular choice the theory can be shown to be of the DHOST type, but it is unclear if the ghost can be removed in other cases, for example by adding to the Lagrangian a non-trivial as in (1.13).
The Palatini versions of the parts of the Lagrangian (1) containing remains totally unexplored since the torsion is then dynamical.
7 Concluding remarks
Summarizing the above discussion, we have studied what happens if the Horndeski theory is treated within the Palatini approach. It turns out that there are infinitely many metric-affine versions of the original Horndeski Lagrangian which differ from each other by terms proportional to the non-metricity tensor, as expressed by (1.13). Each defines a theory which is equivalent to a certain metric theory with the Lagrangian obtained by injecting the algebraic solution for the connection back to . Therefore, the metric-affine generalisations of the Horndeski theory reduce again to metric theories for a gravity-coupled scalar field.
Every such a metric theory can either belong to the original Horndeski family, or it can be of a more general DHOST type, or it can be something else, in which case it has the Ostrogradsky ghost. Therefore, the metric-affine generalisations of the Horndeski theory can be ghost-free but not all of them are ghost-free.
It is interesting to know when these theories are ghost-free. We were able to give the answer for the KGB subset of the Horndeski theory defined by the condition (1.4): it turns out that its metric-affine version defined by (1.6)–(1.9) is ghost-free because it yields a theory which is again in the metric KGB class. We have also checked that its generalisation defined by (1.13), where contains only the linear in the non-metricity terms shown in (1.14) remains ghost-free [31]. We classified all homogeneous and isotropic cosmologies in these theories.
The situation with more general Horndeski models is more complicated and will be reported separately [30]. It is possible that the metric-affine versions of the parts of the Horndeski Lagrangian containing could be made ghost-free by carefully adjusting but is unclear if this procedure works for generic . The situation is totally unexplored if the Lagrangian contains .
One should also say that the Horndeski theory is not the only one whose metric-affine versions can be ghost-free. For example, the theory described
[TABLE]
has second order equations but does not reduce to Horndeski theory when the non-metricity vanishes.
Another example is provided by the Lagrangian [35]
[TABLE]
where the terms in the first line are the same as in the Horndeski theory, whereas those in the second line do not have the Horndeski structure. Adding to this a suitably chosen made of the non-metricity and varying yields a particular member of the DHOST family [35] (we were able to confirm this [30]), hence the theory is ghost-free.
An example of a completely different type is provided by the Born-Infeld theory,
[TABLE]
which has second order equation [32]. It follows that the Horndeski Lagrangian is not the most general one that leads to second order field equations within the Palatini approach. An interesting problem would be to find the most general ghost-free metric-affine theory.
Acknowledgments.– It is a pleasure to acknowledge discussions with Evgeny Babichev, Thibault Damour, Alexander Vikman, and especially with Dr. Katsuki Aoki who explained to us his work and helped to clarify a number of important issues. M.S.V. thanks for hospitality the YITP in Kyoto, where a part of this work was completed. His work was also partly supported by the PRC CNRS/RFBR project grant, as well as by the Russian Government Program of Competitive Growth of the Kazan Federal University.
Appendix
Here is the explicit form of the non-metric part of the connection in (6.13):
[TABLE]
with and the functions defined as
[TABLE]
The function here should not be confused with those used in the main text.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] A. Riess et al., Observational evidence from supernovae for an accelerating universe and a cosmological constant , Astron.Journ. 116 (1998) 1009.
- 2[2] S. Perlmutter et al., Measurements of ω 𝜔 \omega and λ 𝜆 \lambda from 42 high-redshift supernovae , Astrophys.Journ. 517 (1999) 565.
- 3[3] E. J. Copeland, M. Sami and S. Tsujikawa, Dynamics of dark energy , Int. J. Mod. Phys. D 15 (2006) 1753–1936 , [ hep-th/0603057 ]. · doi ↗
- 4[4] A. Joyce, B. Jain, J. Khoury and M. Trodden, Beyond the Cosmological Standard Model , Phys. Rept. 568 (2015) 1–98 , [ 1407.0059 ]. · doi ↗
- 5[5] T. P. Sotiriou and V. Faraoni, f(R) theories of gravity , Rev. Mod. Phys. 82 (2010) 451–497 , [ 0805.1726 ]. · doi ↗
- 6[6] A. De Felice and S. Tsujikawa, f(R) theories , Living Rev. Rel. 13 (2010) 3 , [ 1002.4928 ]. · doi ↗
- 7[7] A. A. Starobinsky, A new type of isotropic cosmological models without singularity , Phys. Lett. B 91 (1980) 99–102 . · doi ↗
- 8[8] G. W. Horndeski, Second-order scalar-tensor field equations in a four-dimensional space , Int.J.Theor.Phys. 10 (1974) 363–384 . · doi ↗
