Tensor $f(R)$ theory of gravity

Tomasz Stachowiak

arXiv:1703.06850·gr-qc·August 16, 2017

Tensor $f(R)$ theory of gravity

Tomasz Stachowiak

PDF

TL;DR

This paper introduces a novel $f(R)$ gravity theory applying the function to the Ricci tensor, leading to modified Einstein equations with potential cosmological implications like singularity avoidance and exponential expansion.

Contribution

It proposes a new $f(R)$ gravity framework based on the Ricci tensor, extending standard $f(R)$ theories and deriving corresponding field equations for various connection types.

Findings

01

Modified Einstein equations derived for metric and nonmetric connections.

02

Cosmological models exhibit non-singular initial states and exponential expansion.

03

Potential observational tests for the new gravity theory are provided.

Abstract

I propose an alternative $f (R)$ theory of gravity constructed by applying the function $f$ directly to the Ricci tensor instead of the Ricci scalar. The main goal of this study is to derive the resulting modified Einstein equations for the metric case with Levi-Civita connection, as well as for the general nonmetric connection with torsion. The modification is then applied to the Robertson-Walker metric so that the cosmological evolution corresponding to the standard model can be studied. An appealing feature is that even in the vacuum case, scenarios without initial singularity and exponential expansion can be recovered. Finally, formulae for possible observational tests are given.

Equations198

L_{0} = f (tr [R]) = f (R_{a c} g^{c a}),

L_{0} = f (tr [R]) = f (R_{a c} g^{c a}),

L_{g} = tr [f (R)] = [f (R)]_{a c} g^{c a},

L_{g} = tr [f (R)] = [f (R)]_{a c} g^{c a},

R^{a}_{b} v^{b} = λ v^{a},

R^{a}_{b} v^{b} = λ v^{a},

⟨ u, R (v)⟩ = u^{a} g_{ab} R^{b}_{c} v^{c} = u^{a} R_{c a} v^{c} = R^{b}_{a} u^{a} g_{b c} v^{c} = ⟨ R (u), v ⟩,

⟨ u, R (v)⟩ = u^{a} g_{ab} R^{b}_{c} v^{c} = u^{a} R_{c a} v^{c} = R^{b}_{a} u^{a} g_{b c} v^{c} = ⟨ R (u), v ⟩,

[f^{*} (R)]^{a}_{b}

[f^{*} (R)]^{a}_{b}

= f_{0} 1^{a}_{b} + f_{1} R^{a}_{b} + f_{2} R^{a}_{s} R^{s}_{b} + f_{3} R^{a}_{s} R^{s}_{t} R^{t}_{b} + \dots,

f (ξ) = n = 0 \sum \infty f_{n} ξ^{n},

f (ξ) = n = 0 \sum \infty f_{n} ξ^{n},

L_{0}

L_{0}

= 1 + R^{a}_{a} + \frac{1}{2 !} (R^{a}_{a})^{2} + \frac{1}{3 !} (R^{a}_{a})^{3} + \dots,

L_{g}

= d + R^{a}_{a} + \frac{1}{2 !} R^{a}_{b} R^{b}_{a} + \frac{1}{3 !} R^{a}_{b} R^{b}_{c} R^{c}_{a} + \dots,

L_{0} = f (i \sum λ_{i}) vs L_{g} = i \sum f (λ_{i}) .

L_{0} = f (i \sum λ_{i}) vs L_{g} = i \sum f (λ_{i}) .

f (R) := \frac{1}{2 π i} \int_{C} (ξ 1 - R)^{- 1} f (ξ) d ξ,

f (R) := \frac{1}{2 π i} \int_{C} (ξ 1 - R)^{- 1} f (ξ) d ξ,

f (R) = C_{0} \tilde{f} (\frac{C _{0}}{C _{1}} \frac{R}{C _{0}}) \to C_{0} \tilde{f} (R / C_{0}),

f (R) = C_{0} \tilde{f} (\frac{C _{0}}{C _{1}} \frac{R}{C _{0}}) \to C_{0} \tilde{f} (R / C_{0}),

S = \int (\frac{1}{16 π G} L_{g} + L_{M}) - g d^{4} x,

S = \int (\frac{1}{16 π G} L_{g} + L_{M}) - g d^{4} x,

δ tr [f (R)]

δ tr [f (R)]

= tr [\frac{1}{2 π i} \int_{C} (ξ 1 - R)^{- 1} f^{'} (ξ) d ξ δ R] = tr [f^{'} (R) δ R],

tr [X_{1} X_{2} \dots X_{k}] = tr [X_{k} X_{1} X_{2} \dots X_{k - 1}],

tr [X_{1} X_{2} \dots X_{k}] = tr [X_{k} X_{1} X_{2} \dots X_{k - 1}],

\nabla_{e_{a}} e_{b} = Γ^{c}_{ba} e_{c},

\nabla_{e_{a}} e_{b} = Γ^{c}_{ba} e_{c},

\nabla_{a} X^{b} = \partial_{a} X^{b} + Γ^{b}_{c a} X^{c} .

\nabla_{a} X^{b} = \partial_{a} X^{b} + Γ^{b}_{c a} X^{c} .

T (X, Y) := \nabla_{X} Y - \nabla_{Y} X - [X, Y] = e_{a} T^{a}_{b c} X^{b} Y^{c},

T (X, Y) := \nabla_{X} Y - \nabla_{Y} X - [X, Y] = e_{a} T^{a}_{b c} X^{b} Y^{c},

T^{a}_{b c} = Γ^{a}_{c b} - Γ^{a}_{b c} .

T^{a}_{b c} = Γ^{a}_{c b} - Γ^{a}_{b c} .

R (X, Y) Z := \nabla_{[X} \nabla_{Y]} Z - \nabla_{[X, Y]} Z = e_{d} R^{d}_{ab c} Z^{a} X^{b} Y^{c},

R (X, Y) Z := \nabla_{[X} \nabla_{Y]} Z - \nabla_{[X, Y]} Z = e_{d} R^{d}_{ab c} Z^{a} X^{b} Y^{c},

R^{d}_{ab c} = \partial_{b} Γ^{d}_{a c} - \partial_{c} Γ^{d}_{ab} + Γ^{d}_{s b} Γ^{s}_{a c} - Γ^{d}_{sc} Γ^{s}_{ab},

R^{d}_{ab c} = \partial_{b} Γ^{d}_{a c} - \partial_{c} Γ^{d}_{ab} + Γ^{d}_{s b} Γ^{s}_{a c} - Γ^{d}_{sc} Γ^{s}_{ab},

R_{ab} := R^{c}_{a c b} .

R_{ab} := R^{c}_{a c b} .

δ Γ^{c}_{ba} = \frac{1}{2} g^{c d} (\nabla_{b} δ g_{a d} + \nabla_{a} δ g_{d b} - \nabla_{d} δ g_{ba}),

δ Γ^{c}_{ba} = \frac{1}{2} g^{c d} (\nabla_{b} δ g_{a d} + \nabla_{a} δ g_{d b} - \nabla_{d} δ g_{ba}),

δ R_{ab} = \nabla_{c} (δ Γ^{c}_{ab}) - \nabla_{b} (δ Γ^{c}_{a c}),

δ R_{ab} = \nabla_{c} (δ Γ^{c}_{ab}) - \nabla_{b} (δ Γ^{c}_{a c}),

δ R_{ab}

δ R_{ab}

= - \frac{1}{2} (g^{c d} \nabla_{b} \nabla_{a} δ g_{d c} + \nabla_{b} \nabla^{d} δ g_{a d} - \nabla_{b} \nabla_{d} g^{c d} δ g_{c a})

= \frac{1}{2} (\nabla^{d} \nabla_{b} δ g_{a d} + \nabla^{d} \nabla_{a} δ g_{b d} - □ δ g_{ab} - g^{c d} \nabla_{b} \nabla_{a} δ g_{c d}) .

0 = δ (1^{a}_{c}) = g_{b c} δ g^{ab} + g^{ab} δ g_{b c},

0 = δ (1^{a}_{c}) = g_{b c} δ g^{ab} + g^{ab} δ g_{b c},

δ R^{a}_{b} = g^{a c} (δ R_{c b} - R^{s}_{b} δ g_{cs}),

δ R^{a}_{b} = g^{a c} (δ R_{c b} - R^{s}_{b} δ g_{cs}),

δ (tr [f (R)] - g)

δ (tr [f (R)] - g)

= ([f^{'} (R)]^{a c} δ R_{c a} - [R f^{'} (R)]^{c d} δ g_{d c} + \frac{1}{2} tr [f (R)] g^{b d} δ g_{b d}) - g .

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

Tensor $f(R)$ theory of gravity

Tomasz [email protected]

Department of Applied Mathematics and Physics,

Graduate School of Informatics, Kyoto University,

606-8501 Kyoto, Japan

Abstract

I propose an alternative $f(R)$ theory of gravity constructed by applying the function $f$ directly to the Ricci tensor instead of the Ricci scalar. The main goal of this study is to derive the resulting modified Einstein equations for the metric case with Levi-Civita connection, as well as for the general nonmetric connection with torsion. The modification is then applied to the Robertson-Walker metric so that the cosmological evolution corresponding to the standard model can be studied. An appealing feature is that even in the vacuum case, scenarios without initial singularity and exponential expansion can be recovered. Finally, formulae for possible observational tests are given.

1 Introduction

The foundation of the present work is to consider a modified Lagrangian (density), which depends functionally on the full Ricci tensor $R_{ab}$ , not just on its trace $\mathscr{R}$ as is the case in the so-called $f(\mathscr{R})$ theories of gravity. The principles of relativity require that this modification be obtained covariantly, and not component-wise, so writing $f(R_{ab})$ could be misleading. Since $f$ will be a tensor-valued function, for the sake of distinction from the usual $f(\mathscr{R})$ theory, the extension will be referred to as tensor $f(R)$ .

The motivation in both cases is the same – the inclusion of higher-order-of-curvature effects which can classically be ignored, but which lead to important modifications in other regimes. Most notably, the Starobinsky inflation model [1] induced by quadratic terms is a particularly important result in this spirit. Although initially introduced on quantum gravity grounds, with corrections built from various contractions of the Ricci tensor, it is now often considered in the language of quadratic $f(\mathscr{R})$ theories [2].

Despite the initial similarity, the tensor $f(R)$ gravity presented here differs considerably from the usual one, and the goal of this article is to focus first on the development of this new theory, with a comparative study left for future work. Accordingly, the notation and mathematical setting will be given as well as the modified Einstein equations. Not to stop at the abstract level I will also consider possible applications to cosmology, with a view to nonsingular evolution, and provide basic formulae to be used in observational cosmology.

Notable differences and similarities with the ordinary $f(\mathscr{R})$ theory will be pointed out throughout the derivations in Sections 2, 4, and 5, but for a more complete, general review of the standard approach, the reader might want to consult review articles [3], [4], or [5] and references therein.

2 Construction of the modified action

In the usual $f(\mathscr{R})$ theories one postulates the Lagrangian

[TABLE]

with the summation convention used, and the covariant metric tensor denoted by $g_{ac}$ . On purely abstract grounds, the order in which $f$ and trace appear is not fixed, so instead of the above I will consider the Lagrangian to be

[TABLE]

where the square brackets are used to indicate elements of a matrix, and the bare symbol $R$ has to refer to the tensor not the scalar, as explained below.

A similar idea has been studied before by Borowiec [6, 7], but it differed from the present work in two ways. First, it used a torsionless metric and second, the Lagrangian depended on polynomial invariants of the Ricci tensor $\text{tr}[R^{k}]$ . Such scalars formed with powers $k$ higher than the space-time dimension can be reduced to the lower ones by using the characteristic polynomial. However, this cannot, in general, be done explicitly for transcendental functions – i.e., when one needs to use an infinite series of powers of $R$ . What is more, the coefficients of the characteristic polynomial themselves depend on the components of $R$ , leading to an unwieldy expression of an original function of $R$ in terms of a function of the invariants $\text{tr}[R^{k}]$ . The present work aims at overcoming this problem, and also at including connections with the most general torsion and nonmetricity.

To proceed with the general treatment, the first thing to settle is what tensors and operators to use, and in particular how to interpret $f(R)$ . Power series immediately come to mind, so what is needed is a representation of $R$ such that it can be composed with itself by matrix multiplication consistent with relativistic index contraction. In other words, $R$ should be an endomorphism, for instance on the tangent bundle over the space-time.

To treat $R$ as such an endomorphism, mixed indices have to be used so that the result composition ${R^{a}}_{b}{R^{b}}_{c}$ is again a mixed-indices tensor of the same valence. ${R_{a}}^{b}$ would do as well, but with the former choice the eigenvalue problem can be written as

[TABLE]

i.e., for eigenvectors rather than eigenforms, which seems more natural. The two are still equivalent through the musical isomorphism, and such an $R$ is a self-adjoint operator with regard to the metric

[TABLE]

provided that $R_{ab}$ is symmetric, which is the case for the Levi-Civita connection. When one allows for the torsion to be nonzero the above requires a generalization given in Section 4.

In the bracket-component notation, $f$ should act on $R$ considered as a linear operator with matrix elements ${[R]^{a}}_{b}$ , and should also give as the result an operator, whose elements are denoted by ${[f(R)]^{a}}_{b}$ . For example, for the composition with itself it is convenient to write ${[R\cdot R]^{a}}_{b}={[R^{2}]^{a}}_{b}$ , so the superscript 2 refers to the operator power, not a component. Accordingly, $R$ will signify the $(1,1)$ valence tensor, and for the Ricci scalar the contraction ${R^{a}}_{a}$ or $\mathscr{R}$ will be used. After “bracketing,” the index notation is recovered, which allows for raising and lowering; for brevity, the brackets will be omitted in the simplest cases such as $[R]_{ab}=R_{ab}$ .

For any analytic $f:\mathbb{R}\rightarrow\mathbb{R}$ the following definition of the matrix function $f^{*}:\mathbb{R}^{n^{2}}\rightarrow\mathbb{R}^{n^{2}}$ can be used 222As with $R$ , it will be convenient to sometimes write the identity operator without indices, so in order to avoid confusion with the variation $\delta$ , I will use ${\mathbb{1}^{a}}_{c}$ instead of the Kronecker symbol ${\delta^{a}}_{c}$ .:

[TABLE]

where

[TABLE]

and the sums are written explicitly, as they are not tensor contractions ( $R^{n}$ is an operator power as explained above). The above requires that the spectral radius of $\rho(R)=\lim\limits_{n\rightarrow\infty}\|R^{n}\|^{1/n}$ be less than the radius of convergence of the series $f(\xi)$ .

For example, when $f=\exp$ , the above two Lagrangians are

[TABLE]

where $d$ is the dimension of the space-time. Thus the first essential deviation appears at the quadratic level and is proportional to $f_{2}({R^{a}}_{b}{R^{b}}_{a}-({R^{a}}_{a})^{2})$ if the same $f$ is used in both approaches. The difference is also evident when the Lagrangians are written in terms of the eigenvalues of $R$ :

[TABLE]

A degeneracy in $\lambda_{i}$ might then lead to the same theories, e.g., when the traceless Ricci tensor vanishes: $\hat{R}^{a}_{\phantom{a}b}:={R^{a}}_{b}-\frac{1}{d}\mathscr{R}{\mathbb{1}^{a}}_{b}=0$ . The Ricci tensor is then proportional to the identity matrix and ${[f(R)]^{a}}_{a}=df(\mathscr{R}/d)$ , which, up to a simple rescaling of $f$ , is the same as the Lagrangian $\mathcal{L}_{0}$ . However, one has to be careful when making such substitutions directly in the action, because $R$ is determined only after having solved the Einstein equations. If the assumption $\hat{R}=0$ is justified from the beginning, the two theories coincide. We shall see in the examples below that even in an empty universe this condition might not hold generally but just for isolated solutions.

Note also – that if $f$ is determined, there is no freedom of choice for its constant term $f_{0}$ , which naturally corresponds to the cosmological constant. In other words, in such a nonperturbative interpretation, its value is tied to the whole expansion and cannot be adjusted independently. The expansion around $R=0$ also shows that when $f$ is almost linear, then higher-order terms can be ignored in the weak field limit leading to the Einstein-Hilbert action and a small perturbation of general relativity.

Although intuitive, the above definition is not very convenient when a function is real analytic but has complex singularities like $\tanh(\xi)$ . A definition better suited for the situation at hand is an elegant generalization of Cauchy’s formula333In what follows, $f$ and $f^{*}$ can safely be treated as the same object, so the star will be dropped.

[TABLE]

for a contour $C$ which encloses the spectrum of $R$ but not the singularities of $f(\xi)$ . The two definitions agree for fairly general assumptions, and for a function that is real on the real axis, the matrix $f(R)$ will also be real [8].

The dimension of $f(R)$ affects how the function is given, because $R$ has the units of curvature, and so should the Lagrangian. At first, it seems two constants are necessary to give $f(R)=C_{0}\tilde{f}(R/C_{1})$ in terms of a function $\tilde{f}$ which only contains dimensionless parameters, but this can be rewritten as

[TABLE]

with a redefined dimensionless $\tilde{f}$ . The remaining constant $C_{0}$ can then be further rescaled using the cosmological or the Hubble constant depending on context – this is done in Section 5.

Having defined $\text{tr}[f(R)]$ , the total action, including the matter Lagrangian $\mathcal{L}_{M}$ , is taken to be

[TABLE]

where $\mathscr{G}$ is the gravitational constant, and the modified Einstein equations can then be obtained in one of the two standard ways. One is to assume the Levi-Civita connection and take the metric as the dynamical variable; the other is to consider both the metric and the connection as dynamical. The former is called the metric and the latter the Palatini formulation (or, more generally, metric-affine).

In both cases the variation of the $f(R)$ term is needed, and the second definition of a tensor function allows us to easily calculate it as

[TABLE]

where the cyclic property of trace

[TABLE]

was used in the first line, and integration by parts in the second. Reexpressing $\delta R$ with $\delta g$ and $\delta\Gamma$ to arrive at the modified Einstein equation is the subject of the next two sections.

2.1 Definitions and notation

To shortly review the conventions used, the covariant derivative and the connection coefficients in a basis $\{e_{a}\}$ are related through

[TABLE]

so that for a coordinate basis $e_{a}=\partial_{a}$ one has

[TABLE]

As $\Gamma$ will not in general be symmetric in the lower indices, care needs to be taken regarding their order. The antisymmetric part of the connection defines the torsion as

[TABLE]

and in a coordinate basis, where $[\partial_{a},\partial_{b}]=0$ , it follows that

[TABLE]

The Riemann tensor is given by 444The brackets involving vectors denote commutation not antisymmetrization – i.e., there is no prefactor of $\frac{1}{2}$ .

[TABLE]

or, in term of components in a coordinate basis,

[TABLE]

and the Ricci tensor is the contraction

[TABLE]

Note, then that although $R_{ab}$ is constructed solely with the connection (curvature), for the operator ${R^{a}}_{b}=g^{ac}R_{cb}$ the metric is necessary. Finally, the signature will be taken to be $(-,+,+,+)$ , and the speed of light equal to unity, so that coordinates have the dimension of length, and the metric itself is dimensionless.

3 The metric approach

The natural connection solely determined by the metric through $\nabla_{a}g_{bc}=0$ and ${T^{a}}_{bc}=0$ is the Levi-Civita connection. Its variation, as expressed by $\delta g$ , is

[TABLE]

and in turn for the covariant Ricci tensor one has

[TABLE]

which accordingly gives

[TABLE]

Next, by observing that

[TABLE]

the variation of the operator $R$ becomes

[TABLE]

leading to

[TABLE]

The variation $\delta R_{ab}$ of (23) can be substituted into the above, and due to $\sqrt{-g}\;\nabla_{a}X^{a}=\partial_{a}\left(\sqrt{-g}X^{a}\right)$ each term containing the covariant derivative can be integrated by parts provided that the variations vanish at the boundary or that the boundary is empty. The result is

[TABLE]

Finally, defining the stress-energy tensor $\mathcal{T}$ by

[TABLE]

the condition $\delta\mathscr{S}=0$ gives the following modified Einstein equations

[TABLE]

As can be seen, the last two terms on the left-hand side reduce to the standard Einstein tensor for $f=\mathrm{Id}$ , whereas the other terms are zero since $f^{\prime}=1$ .

4 The Palatini approach

In the more general case, the connection is independent of the metric, and there are two assumptions that can be relaxed here: vanishing torsion and metric compatibility. In general the connection can be decomposed into the sum

[TABLE]

where $\widetilde{\Gamma}$ is the Levi-Civita connection for $g$ , $K$ is called the contorsion tensor, and $C$ describes the nonmetricity

[TABLE]

Accordingly, the variation of the Ricci tensor is now

[TABLE]

and neither the connection coefficients nor the Ricci tensor are symmetric in the lower indices. The eigenvalues of $R$ might not be real any more, in which case they appear in conjugate pairs. This means that the trace of $f(R)$ will still be real, for real analytic $f$ .

There is, however, a possible natural generalization, because of the following identity555The underline denotes the sum over cyclic permutations.

[TABLE]

which leads to the introduction of a new tensor, which is the symmetric part of $R$ ,

[TABLE]

These tensors have the same trace so there is no need for $S_{ab}$ in the standard $f(\mathscr{R})$ theories – the trace cancels the imaginary parts of the conjugate pairs of the eigenvalues. Here, the situation is different, because the function $f$ is applied to the eigenvalues of $R$ before the trace is taken, so although the final result is real, it also depends on the imaginary parts. The other reasons and equations for the $f(S)$ variant are given following the $f(R)$ derivation below.

In contrast to the preceding section, only first derivatives are present in the action, and the integration by parts requires an additional term, because the torsion affects the expression for covariant divergence:

[TABLE]

The total variation of the Lagrangian then becomes

[TABLE]

where the derivative tensor is denoted by $P_{ab}:=[f^{\prime}(R)]_{ab}$ for brevity.

In addition to the stress-energy tensor $\mathcal{T}$ , a new quantity is necessary to reflect the fact that matter fields can, in general, depend on the connection – if only through the covariant derivative. The hyper-momentum tensor is defined thus:

[TABLE]

and the modified Einstein equations can now be written as

[TABLE]

where the symmetrization is necessary, because the variation $\delta g_{bd}$ is symmetric, even though $R_{bd}$ is not.

The second set of equations can be simplified if an auxiliary connection is defined to be

[TABLE]

and using the associated covariant derivative $\hat{\nabla}$ , the second set of Einstein equations reads

[TABLE]

Additionally, contraction over the pair of indices $\{cd\}$ leads to

[TABLE]

which allows us to rewrite the main equations as

[TABLE]

As in the ordinary $f(R)$ formulation, the torsion equations become algebraic for the Einstein-Hilbert case $f(R)=R$ because $P^{ab}=[f^{\prime}(R)]^{ab}=g^{ab}$ , so that derivatives of $\Gamma$ only appear in $R$ . Further, if the matter fields are such that $\mathscr{Q}_{abc}\equiv 0$ , contractions of the torsion equations give

[TABLE]

This means that if ${T^{s}}_{sa}=0$ , then $2C_{(ad)c}=T_{dac}$ , and it follows immediately from (30) that $K_{abc}=C_{abc}$ . But that, by definition, means the connection must be the Levi-Civita one.

In other words, for $f(R)=R$ , zero hyper-momentum and totally antisymmetric torsion, the theory becomes standard general relativity. Note that for this to happen it is not necessary to assume zero torsion from the beginning, just that all its traces vanish.

Since the Ricci tensor is, in general, no longer symmetric, the tensor $P$ cannot be used directly to define a new metric for which equation (42) would define a metric connection. In the standard $f(\mathscr{R})$ theories, the tensor that enters is $R$ itself, and it can be decomposed into (anti)symmetric parts at the level of the Einstein equations, as the function $f$ is applied only to its trace, and all $f({R^{a}}_{a})$ terms are just scalars.

Here, the situation is different in that even in the first set of equations the symmetrization is applied to $Rf^{\prime}(R)$ , not to $R$ , and the second set of equations contains $f(R)$ , not $f^{\prime}(R)$ . Because even for the second power one has $g^{bc}X_{c(d}X_{a)b}\neq X_{(ab)}g^{bc}X_{(cd)}$ , symmetrizing the equations would not lead to a single distinguished tensor to be used as the new metric. Moreover, even though the components of $R$ are real, it seems natural to consider a self-adjoint matrix, for which the action is directly related to the eigenvalues as in (8).

These problems could be overcome by constructing the action with the symmetric tensor $S$ , introduced before, whose variation is simply $\delta S_{ab}=\frac{1}{2}(\delta R_{ab}+\delta R_{ba})$ . The derivation is essentially the same as in (36), and the difference is that the tensor contracted with $\delta g$ is already symmetric, so the Einstein equations are

[TABLE]

where now, by a slight abuse of notation, $P_{ab}=[f^{\prime}(S)]_{ab}$ , and the auxiliary covariant derivative is the one given by equation (39).

As before, the trace can be used to rewrite the second equation as (42), and following the same reasoning as for the standard $f(R)$ derivation, the torsionless connection with no hyper-momentum yields

[TABLE]

This would indicate that $\hat{\Gamma}$ is the Levi-Civita connection for the metric $P^{da}$ , but the situation is complicated by the fact that the tensor $P=f^{\prime}(R)$ is not conformally related to the original metric $g$ , so the signature might not be the same, and the determinant of $g$ is not directly proportional to that of $P$ ; also, raising of indices in $P$ does not amount to matrix inversion. It should also be kept in mind that with the standard extension of covariant derivative to tensor densities, which uses $\sqrt{|g|}$ to cancel the weight, the above equation can be rewritten as

[TABLE]

but this is not equivalent to

[TABLE]

unless $\sqrt{\det{P}}$ is used to extend $\hat{\nabla}$ to densities. Without specifying which extension is used, the condition $\nabla_{c}(\sqrt{|g|}g_{ab})=0$ does not necessarily indicate metricity, contrary to what can sometimes be found in the literature. Because of this freedom, and given the problems with inverting $P$ , the more fundamental equation (45) is better as an indication of a metric connection in the present case.

With some effort, the Christoffel formula can be used to express $\hat{\Gamma}$ as a function of derivatives of $P$ , but the derivatives of the connection coefficients are still involved in the nonlinear term $f^{\prime}(S)$ . The question is then whether they can be eliminated with the help of the remaining equations.

In the standard approach, the first set of the Einstein equations (44) can, in principle, be used to solve for the Ricci scalar and accordingly simplify the second set by using the Ricci tensor associated with the new metric and its Levi-Civita connection [3]. Here, one would have to solve nonlinear equations for the whole tensor $S$ in order to eliminate the connection in the same manner. At present, it appears that this path of investigation is not applicable, because the equations involve full tensors $R$ or $S$ , not just their traces.

5 FRW dynamics

The standard cosmological model is the basic example that needs to be considered in order to gain insight into the applicability of the proposed modification. The model assumes spatial homogeneity and isotropy, requiring the Robertson-Walker geometry, which in spherical coordinates $\{t,r,\theta,\varphi\}$ has the metric

[TABLE]

where $\text{d}\Omega^{2}=\text{d}\theta^{2}+\sin^{2}\theta\text{d}\varphi$ is the standard metric on the unit sphere. The final assumption in this first attempt at modified cosmology will be that the RW metric provides the only dynamical variable – the scale factor $a(t)$ – the connection is that of Levi-Civita and the metric formalism can be used.

Accordingly, the matter source will be taken to be a homogeneous perfect fluid with density $\rho$ and pressure $p$ , so that the stress energy tensor is

[TABLE]

where the four-velocity in these coordinates is just $u=\partial_{t}$ .

There are then effectively only two modified Einstein equations, one of third order and one of fourth corresponding to the $\mathscr{T}_{00}$ and $\mathscr{T}_{11}$ components of (29) respectively. However, the latter follows from the derivative of the former, which is the generalization of the Friedmann equation

[TABLE]

where $\lambda$ are the eigenvalues of $R$

[TABLE]

$H$ is the Hubble “constant” $H=\overset{\raisebox{-0.43057pt}[0.0pt][0.0pt]{\scaleobj{1}{.}}}{a}/a$ , and the overdot denotes the time derivative.

The present value of the constant, $H_{0}:=H(0)$ , is customarily used to obtain dimensionless quantities and, as discussed in Section 2, there is still an unspecified constant in the function $f$ . Although $H_{0}^{2}$ has the suitable dimension, it will not do as $C_{0}$ , because the function $f$ should be a fundamental quantity valid for all gravitational actions, not just the FRW cosmology, and thus cannot be defined with such specific constants. Instead, $C_{0}$ will become a physical parameter of the new theory, and the Hubble constant $H_{0}$ will serve to provide the dimensionless counterpart $c_{0}:=C_{0}H_{0}^{-2}$ .

Of course, the roles could be reversed, with $C_{0}$ used instead of $H_{0}$ , but for initial clarity it is better to keep with the convention of rescaling densities, time, etc., with $H_{0}$ . The dimensionless eigenvalues are then

[TABLE]

which gives e.g. $f(\lambda_{0})=f\left(\alpha H_{0}^{-2}\right)$ and leads to further simplification

[TABLE]

and similarly for $\beta$ . The main equation can then be rewritten as

[TABLE]

where $h$ , the density parameter and dimensionless time are defined by

[TABLE]

The function $F$ can then be specified with any suitable number of dimensionless parameters including $c_{0}$ . It could be considered to be given a priori by some elementary function like $A\sin(B\xi)$ , or defined by infinitely many expansion coefficients as the series (6). Yet to consider such coefficients as independent parameters would be to multiply entities beyond necessity, so I will adopt the former approach here.

A quantitative reason can also be given for this, in anticipation of the observational analysis. Finding the coefficients from the data would undoubtedly lead to better and better fits as the number of coefficients increases, but such a fit would come with a huge cost as measured by the Akaike or Bayesian information criteria, which are now standard tools of observational cosmology [9, 10].

As for the nature of parameters in the present case, some more information can be gleaned from the zeroth- and first-order expansions of $F$ , as they reproduce the standard model with the cosmological constant. The general form666As before, $\xi$ is just an auxiliary independent variable used to define functions and their rescalings. is $F(\xi)=F_{0}+F_{1}\xi$ , but the overall rescaling of the Lagrangian is not important, and taking $F_{1}=1$ gives the ordinary Friedmann equation

[TABLE]

upon identifying the cosmological constant $\Lambda=-2H_{0}^{2}F_{0}$ . In terms of the original function $f$ , this means that $f_{0}=-\Lambda/2$ , and it suggests that the cosmological constant itself could be used as a fundamental dimensional quantity by

[TABLE]

with $\tilde{f}$ carrying no other free parameters. Using the respective density parameter $\Omega_{\Lambda}:=\Lambda/3$ , this means that

[TABLE]

where the expansion of $\tilde{f}$ is then necessarily restricted to

[TABLE]

Turning now to the dynamics of this model, a minimal set of variables yielding a closed system can be built from the derivatives of $a(t)$ , or rather their rescaled versions $h$ and $\alpha$ , which are identically related by $\alpha=3(\overset{\raisebox{-0.43057pt}[0.0pt][0.0pt]{\scaleobj{1}{.}}}{h}+h^{2})$ . Also, the other of the eigenvalues can be eliminated through

[TABLE]

although for shorter notation it will be better to keep the symbol $\beta$ and understand it as a function of $a$ , $h$ and $\alpha$ , which will be the replacements for $\overset{\raisebox{-0.43057pt}[0.0pt][0.0pt]{\scaleobj{1}{.}}}{a}$ , $\overset{\raisebox{-0.43057pt}[0.0pt][0.0pt]{..}}{a}$ and $\overset{\raisebox{-0.43057pt}[0.0pt][0.0pt]{...}}{a}$ .

Because the conservation law $\nabla^{a}\mathscr{T}_{ab}=0$ still holds, the matter-energy density $\rho$ is expressible in terms of $a$ if one assumes an equation of state

[TABLE]

Finally, introducing

[TABLE]

for the sake of brevity, a dynamical system with three degrees of freedom described by the variables $\{\alpha,h,a\}$ is obtained:

[TABLE]

where the dot now refers to the new time $\tau$ . Note that the denominator of $v_{1}$ would only be identically zero for the purely linear $F$ , which is the standard general relativity. The form of $v_{2}$ and $v_{3}$ is dictated by the definition of $h$ and the essential dynamics lies with $v_{1}$ . This is also where we find the difference in complexity between the new theory and $f(\mathscr{R})$ , for which $v_{2}$ and $v_{3}$ are the same, but the first equation would read

[TABLE]

The difference between the two equations in the simplest quadratic case $F(\xi)=-\frac{3}{2}\Omega_{\Lambda}+\xi+F_{2}\xi^{2}$ is just $(\Omega_{\Lambda}+\Omega-h^{2})/(2F_{2}h)$ , which is nonzero exactly when the evolution deviates from the Friedmann equation. As was mentioned in Section 2, if $\hat{R}$ vanishes, then a simple rescaling of $F$ also leads to the same equations but in this particular geometry the condition is very restrictive. For flat universes (as in the examples below) the only solutions with this property are the de Sitter ones, $h=\text{const}$ , which do not exhaust all possible solutions, even when $\Omega=0$ . On the other hand, the difference disappears completely if we take different functions: $F(\xi)=F_{1}+\xi+F_{2}\xi^{2}$ for $f(R)$ and $\widetilde{F}(\xi)=4F_{1}+\xi+3F_{2}\xi^{2}$ for $f(\mathscr{R})$ ; the theories are equivalent for the Robertson-Walker geometry at the quadratic level, even when $\hat{R}\neq 0$ . However, no such simple relation could be found for cubic terms.

A general feature of the main system (63) is that if the geometry is flat, i.e., $k=0$ and the density does not depend on the scale factor, like for the cosmological constant, then the first two equations decouple and give a planar system. In fact, one could simply assume that no ordinary matter enters the equations as $\Omega$ , but instead consider the higher-order terms of $F$ as some sort of field imitating matter. For example, if $F(\xi)=-\frac{3}{2}\Omega_{f}+\xi+\tfrac{1}{2}F_{2}\xi^{2}$ , the main equation (54) becomes

[TABLE]

so that $\Omega_{f}$ acts as dark energy and the $F_{2}$ term acts as effective material content.

Another general, and problematic, feature of the $\overset{\raisebox{-0.43057pt}[0.0pt][0.0pt]{\scaleobj{1}{.}}}{\alpha}$ equation is the singularity at $h=0$ , i.e., when expansion changes to contraction and vice versa. This is not a singularity of equation (54) and can lead to a valid solution provided that the numerator of $v_{1}$ vanishes as well. Thus, care has to be taken when using the dynamical system form, because the singularities might simply signify that the left-hand side of the original equation is zero, and vice versa: a zero of $v_{1}$ might in fact be a singularity of the original equation (63).

5.1 Examples of cosmological models

A very basic example illustrating these features is to take a flat, empty universe and assume the exponential function

[TABLE]

which includes the linear action, but no cosmological constant in the usual sense. The specific form of $v_{1}$ is then

[TABLE]

where the additional factors in the arguments are only introduced to shorten the formula. It is still essentially transcendental, so one has to resort to qualitative analysis first to locate the points and regions of interest. This can be done with the help of Figure 1, which shows the planar vector field $(v_{1},v_{2})$ together with the locations of singular lines and zeros of the right-hand side $v$ (left panel), and the phase portrait constructed from typical trajectories (right panel); the particular value of $\Omega_{f}=3/2$ was chosen.

The left and right saddle points $A_{1}$ and $A_{2}$ correspond to time-reversed de Sitter and standard de Sitter solutions, respectively, and their positions $(h_{0},3h_{0}^{2})$ are given by $h_{0}=\pm\sqrt{2\Omega_{f}w/3}$ , where $w$ is the positive solution of $\mathrm{e}^{-2w}+w=1$ .

The singular critical point $B_{1}$ could be considered as a static solution because it lies on the singular line $h=0$ , but also on the $W=0$ line, so in fact equation (54) is satisfied. For the vector field, on the other hand, the limit at $B_{1}$ is not well defined, as it depends on the path.

Importantly, there are no periodic orbits on either side of $B_{1}$ , as the line $h=0$ separates the neighbourhood of $B_{1}$ into two elliptic sectors of opening $\pi$ . The “closed” trajectories have $B_{1}$ as their limit point, so they are asymptotically static both in the past and in the future.

More physically realistic evolutions here seem to consist of trajectories that are attracted by $A_{2}$ and subsequently scattered along the unstable direction towards infinity. These are expanding universes with ever increasing acceleration, and also with initial singularity, which can be read from the phase portrait: going back back in time, the trajectory has increasingly negative $\alpha$ , and discarding the exponentially small terms for large $h$ and $\alpha$ the right-hand side is approximately

[TABLE]

making $\alpha$ and $h$ diverge in finite (negative) time.

There are also two mixed cases – i.e., trajectories going from a big bang becoming asymptotically static as they tend to $B_{1}$ and vice versa: asymptotically static in the past, but then getting scattered by $A_{2}$ into accelerated expansion. These exemplary behaviours of the scale factor and the Hubble constant are plotted in Figure 2. Note that the time integration constant $\tau_{0}$ such that $a(\tau_{0})=1$ cannot always be chosen to make $h(\tau_{0})=1$ , so it is adjusted for each trajectory for better visibility in this and subsequent graphs.

It is probably more instructive to consider a more intricate model, which is furnished by taking a rational function

[TABLE]

which includes the constant term, so it can be identified with the cosmological constant as in (58). Note that if the series were to be used, different expansions in different regions would be required. The reduction of the resulting powers of $R$ with the characteristic polynomial would have to be carried out separately, which would lead to cumbersome expressions – if it were possible to obtain closed ones at all.

Direct substitution of this $F$ into (63) produces a $v_{1}$ which is several lines long, so it is perhaps best to skip its specific form and, similarly to before, view the vector field and the various singular lines of the phase space; they are shown in the left panel of Figure 3. The picture is now considerably more complex, with many more singular points of type $B$ , for which both the numerator and denominator in $\overset{\raisebox{-0.43057pt}[0.0pt][0.0pt]{\scaleobj{1}{.}}}{\alpha}$ vanish. These points signify a possible crossings through the otherwise impassable barriers indicated by the red lines.

There are still only two critical points $A_{1}$ , $A_{2}$ located at $\left(\mp\sqrt{\tfrac{1}{2}\Omega_{\Lambda}},\tfrac{3}{2}\Omega_{\Lambda}\right)$ , which are asymptotic equilibria, and as before, they correspond to time-reversed de Sitter and standard de Sitter solutions, respectively. However, as the phase diagram of Figure 3 shows, there are now two heteroclinic trajectories connecting them, one through $B_{1}$ at $\left(0,\tfrac{9}{2}\Omega_{\Lambda}\right)$ and the other through $B_{2}$ at $\left(0,\tfrac{3}{2}\Omega_{\Lambda}\right)$ .

There is a complication here, not present in the previous example, though. The horizontal green lines at $\pm\tfrac{3}{2}\Omega_{\Lambda}$ are singularities of $F$ , and so also of the Friedmann equation, but they cancel out in $v_{1}$ , resulting in the straight-line trajectories. These are not singularities of curvature either, because $\alpha$ and $h$ remain finite, so if one considers the action principle as purely formal to obtain the dynamical equations, these solutions could have some physical meaning.

A similar situation is found for the pair $B_{3}$ and $B_{4}$ located at $\left(\pm\sqrt{\Omega_{\Lambda}},-\tfrac{3}{2}\Omega_{\Lambda}\right)$ , except that the whole line can be thought of as just one trajectory for which $h$ goes from $\infty$ to $-\infty$ in finite time. On both lines, the second equation $\overset{\raisebox{-0.43057pt}[0.0pt][0.0pt]{\scaleobj{1}{.}}}{h}=v_{2}$ can be integrated to give

[TABLE]

where the integration constant $\tau_{0}$ can be complex, giving in effect three types of functions: tangent for the trajectory on the lower line, hyperbolic tangent for the $A_{1}A_{2}$ segment, and hyperbolic cotangent for the trajectories on the upper line that escape to $\pm\infty$ . The dependence of the scale factor and $h$ on time for these cases is shown in Figure 4. Additionally, the trajectories coming from infinity qualitatively reflect the behaviour of the generic trajectories in the respective region in Figure 3; in particular, the past singularity is reached in finite time.

Outside the singular lines, there are the two special heteroclinic orbits: from $A_{1}$ through $B_{1}$ to $A_{2}$ and from $B_{4}$ to $A_{2}$ . The first is possible, because the equation can be regularized by considering $\alpha$ as a function of $h$ so that $\alpha^{\prime}(h)=v_{1}/v_{2}$ , which leads to a local expansion at $B_{1}$

[TABLE]

This trajectory is similar to the one through $B_{2}$ but avoids the problem of singular action. The second case, upon closer inspection, also admits continuation through $B_{4}$ , as is revealed by switching again to $h_{1}:=h-\sqrt{\Omega_{\Lambda}}$ as the independent variable. The series for $\alpha$ can then be found

[TABLE]

Both of these solutions are shown in Figure 5, the first is probably the best candidate for a “bounce” universe, and the second has a big-bang singularity.

Looking more closely at the behaviour at infinity also reveals an asymptotic relation of the form $\alpha\sim-6h^{2}$ , which, together with the two previous expansions, suggests looking for the equation of the extended separatrix involving $\alpha+6h^{2}$ . Indeed, it turns out that there is a parabola through $B_{3}$ , $A_{1}$ , $B_{1}$ , $A_{2}$ $B_{4}$ given by

[TABLE]

which is an invariant set, i.e,

[TABLE]

as can be checked by direct substitution.

Eliminating $\alpha$ from $U=0$ leaves a simple Riccati equation $2\overset{\raisebox{-0.43057pt}[0.0pt][0.0pt]{\scaleobj{1}{.}}}{h}=3\Omega_{\Lambda}-6h^{2}$ , which again gives trigonometric solutions for $h$ and $a$ akin to (70) – in particular, for the big-bang type

[TABLE]

which is the behaviour of the standard Friedmann cosmology with the so-called stiff matter characterized by $p=\rho$ . The same equation of state holds also for a minimally coupled massless scalar field $\phi$ for which the energy density is just the kinetic term $\rho=\frac{1}{2}\dot{\phi}^{2}$ , or approximately when the potential term can be neglected: $\frac{1}{2}\dot{\phi}^{2}\gg V(\phi)$ . This suggests a correspondence analogous to that of standard $R^{2}$ theories, which are conformally equivalent to scalar field cosmologies [11].

The introduction of matter through a nonzero $\Omega$ term means that the system (63) can no longer be simply visualized on a plane, but particular solutions can still easily be obtained numerically. The most important ingredient would be dust matter ( $\gamma=1$ ), and following that, radiation ( $\gamma=\tfrac{4}{3}$ ), but since the latter constitutes a tiny fraction of $\Omega$ in the standard $\Lambda$ CDM model, $\Omega=\Omega_{m}a^{-3}$ was assumed in the numerical integration. Thus, this particular model will depart from reality close to the Big Bang by ignoring the radiationr-dominated GUT era and the inflationary phase, when the value of $\Lambda$ is much larger than the $\Lambda$ CDM one used below.

A surprising property to notice is that the parabola (73) is still an invariant set, and accordingly equation (75) gives a bing-bang solution also with dust. This is due to the singular nature of the denominator in the $\Omega/F^{\prime\prime}$ term in $v_{1}$ . Although this means that the stiff matter component dominates in the earliest epochs, the “effective equation of state” $p/\rho$ changes with time as the de Sitter state is reached. By analogy with the standard Friedmann cosmology, one can eliminate $p$ from the second Einstein equation to obtain the time-dependent adiabatic index as

[TABLE]

This function can be used to compare the behaviour of the density for the present model and the corresponding Friedmann equation including the stiff matter term, i.e.,

[TABLE]

The comparison is shown in Figure 6.

In the present case, there is no constraint on the sum of all the $\Omega$ terms, and $\Omega_{\Lambda}$ and $\Omega_{m}$ need not be the same as in the $\Lambda$ CDM model, because the Einstein equations are different. The parameter values are both subject to estimation from observations, but for the present qualitative comparison one can use the asymptotic behaviour of (75): $a\sim\exp(\sqrt{\Omega_{\Lambda}/2}\tau)$ , which should correspond to the relevant asymptotics of $\Lambda$ CDM, i.e., $a\sim\exp(\sqrt{0.7}\tau)$ , so that $\Omega_{\Lambda}=1.4$ was chosen for the $f(R)$ equations.

At any rate, the comparison shows that the universe whose trajectory lies in the first quadrant (Figure 3) and tends to the de Sitter attractor $A_{2}$ has $\gamma=2/3$ during the big bang (red in Figure 6), so it corresponds to cosmic strings [13, 14]. This is peculiar, because it means that the matter term ( $a^{-3}$ ) must be cancelled close to the initial singularity, so that only the $a^{-2}$ term matters instead. It happens due to the trajectory approaching the horizontal singular line of $\alpha=3\Omega_{\Lambda}/2$ so asymptotically the solution (70) holds and $a\sim(\tau-\tau_{0})$ . The transition from cosmic strings directly to exponential expansion makes this class of trajectories unlikely as physical models.

The heteroclinic trajectory (green in Figure 6) is unchanged by dust with $\gamma\approx 2$ stationary at first, then decaying to the “dark energy” level. This decrease is faster than for the corresponding $\Lambda$ CDM with stiff matter (blue), but the agreement is much better than in the previous scenario. The shape resembles more that of the standard $\Lambda$ CDM (orange) in that there is no cusp, although different types of matter dominate initially. This in itself is not an obstacle, as it is unlikely that classical GR and dustlike matter determine the initial singularity anyway, and in the bouncing scenarios $a^{\prime\prime}(0)=0$ , so that $\gamma$ could even tend to infinity.

An interesting analogy here is that the heteroclinic trajectory is unchanged by the addition of dust, so that it can be thought of as defined purely by the geometry and the function $f(R)$ – quite as the cosmological constant can be thought of as a geometric term rather than an actual material component. In both cases, such content-independent gravity only makes sense as a model for the late homogeneous universe, not at smaller scales like black holes. Note also that this particular example (69) was deliberately chosen with a singularity so that it cannot be treated perturbatively. By itself, it may not be a replacement for $\Lambda$ CDM, but its most prominent feature, the invariant manifold $U=0$ , appears as a guidepost in further generalizations, partly because it effectively reduces the fourth-order Einstein equations to an analogue of the Friedmann equation, which is easily solvable. One goal of future investigations will thus be to find models where such invariant curves exist and are nontrivially perturbed by matter.

Coming back to the general dynamics, an undesirable global feature of dynamics with a singular $F(\xi)$ is that the phase space is cut into several regions by the red lines and the trajectories cannot be continued through them even with local analysis because the vector field’s directions are opposite on each side. Nevertheless, $A_{2}$ is a steady state attractor for almost the whole first quadrant, and there are two heteroclinic scenarios without singularities.

This behaviour is more pronounced when one considers more peculiar setups – for example, with the periodic Lagrangian

[TABLE]

Because $F$ enters the equations with the rescaled eigenvalues $\alpha$ and $\beta$ as its arguments, it is more convenient to eliminate $h$ and use the eigenvalues as the dependent variables. In order to do that, a rescaled time $\text{d}\sigma:=\text{d}\tau/h$ can be used, giving for the flat case

[TABLE]

This setup gives rise to a period cell structure of the phase space, as seen in Figure 7, and there are infinitely many critical points and heteroclinic orbits to choose from.

At present this cannot be considered to be more than a toy model, but it hints at the possibility of constructing a phase space with compartments for different epochs of evolution separated by the singular lines and transitions taking place through the critical points. The behaviours of $h$ and $a$ would need to be recovered from that of $\alpha$ and $\beta$ in order to give physical interpretation, and at first glance, it is hard to judge whether the complexity comes from the choice of dependent variables, or is an intrinsic feature of the tensor $f(R)$ theory.

The determination of the actual (real, if one can call it that) $F(\xi)$ , or $f$ , is a question in itself, and at present it is hard to imagine what other fundamental theory could provide it. At the very least it should be constrained by observations, but some new approach will be required not to merely fit subsequent polynomial approximations of a series if one wants to recover the complete function.

6 Observational formulae

In order to assess the applicability of the proposed construction one must turn to observational cosmology. The detailed numerical analysis is outside the scope of this article and will be deferred to future work. Nevertheless, some preparatory analysis is straightforward and can be given here.

The standard cosmological test relies on the supernovae Ia data and the relationship between the redshift and luminosity. In the Friedmann case, there is a direct relation between $H^{2}$ and the redshift, so the integration of time and distance is straightforward. Here, the equations involve up to the third derivative of the scale factor, so another route needs to be taken: for small redshifts, a series formula binding various expansion coefficients can be given, while in the general case, the dynamical system has to be integrated.

Recall first that the redshift is linked to the scale factor by $z+1=a^{-1}$ , for $a(0)=1$ at present, and that the luminosity distance to an object at comoving distance $r$ is $d_{L}=r(1+z)$ . Provided, then, that $r$ can be expressed by $z$ , this will allow us to calculate the apparent luminosity and relate to observations [12].

The required expression follows from the condition of the null geodesic: $\text{d}s^{2}=0$ , which for the metric (48) gives directly

[TABLE]

where a limit is understood for $k=0$ . Assuming that $a$ or $z$ are monotonic functions of $t$ that can be used for parametrization of the light path, the above can be rewritten as

[TABLE]

In the standard model, $H$ is simply given as a function of $z$ by the Friedmann equation, and the integral can even be explicitly calculated by means of elliptic functions [13]. As mentioned above, this cannot be done here, but following [13], the main equation can be used to give constraints of the higher characteristicss – the deceleration parameter $q$ and the jerk $j$ :

[TABLE]

A change of the independent variable from $t$ (or $\tau$ ) to $z$ immediately gives

[TABLE]

which then allows us to expand $h$ in the integral (81) in powers of $z$ , so that the whole expression can be expanded as

[TABLE]

For small $z$ , this provides a means to finding $H_{0}$ , $q_{0}$ and $j_{0}$ from the luminosity data, but one also has to take into account that these parameters are not independent. In the standard model, $q$ can be eliminated because $h^{\prime}(z)$ is an explicit function of $z$ and the density parameters $\Omega$ . Similarly here, the jerk is constrained by the main equation, which for this purpose becomes

[TABLE]

with

[TABLE]

So, given the function $F$ , the constraint on $j_{0}$ is

[TABLE]

Finally, to obtain the luminosity distance for larger redshifts, where a series expansion is not practicable, an augmented dynamical system is a straightforward solution. Assuming again that $z$ can be used as the independent variable, as is the case in exponential expansion, a dynamical equation for $d_{L}$ is necessary instead of the integral (81).

The null geodesic condition gives

[TABLE]

and denoting the dimensionless distance by $l=H_{0}d_{L}$ leads to

[TABLE]

while the basic system now reads

[TABLE]

Because $z$ has become the independent variable, this system is non-autonomous and only two-dimensional (regardless of $k$ and $\Omega$ ). Even in the Friedmann case, for more complex $H(z)$ , the integral (81) has to be obtained numerically. The only complication here is that three ordinary differential equations need to be integrated; their initial conditions follow from the definitions

[TABLE]

7 Conclusions

The main modification of the gravitational action proposed here is to include terms nonlinear in curvature, but going further than polynomials, so that rational functions with a finite radius of convergence or even transcendental functions can be used. Additionally, instead of considering just a function of the Ricci scalar $f(\text{tr}[R])$ , the whole tensor can be treated as an argument, and the trace taken at the very end to produce a scalar Lagrangian density $\text{tr}[f(R)]$ . In the case of transcendental functions, this considerably changes the results, when compared to the ordinary $f(\mathscr{R})$ theories.

With a view to fully general treatment, such as including spin, the presented derivation is valid for affine connections with nonvanishing torsion and without the assumption of metricity. An important consequence is that for nonsymmetric Ricci tensors one can no longer introduce an obvious metric conformal to the original $g_{ab}$ . This stems from the nonlinear functions of the Ricci tensor entering the equations, instead of just functions of the Ricci scalar multiplying $R_{ab}$ or $g_{ab}$ .

Despite the difficulties, workable equations can be derived and applied to the Robertson-Walker geometry so that the analogue of the standard cosmological model may be studied. As is generally the case, the modified Einstein equations are of higher order, and instead of one Friedmann equation, one has a three-dimensional dynamical system.

An obvious complication is that the dynamical variables enter the equations both inside and outside the transcendental functions, which leaves little hope for explicit solutions. Nevertheless, these models are within reach and if the function $f$ is determined from other fundamental principles, the dynamics and observational consequences can still be effectively analysed, as shown here.

The analysis of phase portraits for both rational and transcendental $f$ reveals critical points which are attractors and which correspond to de Sitter solutions. More importantly, there also exist non-singular “big bounce” evolutions, which are heteroclinic trajectories, and explicit solutions for them can be given. For the dynamical systems to be two-dimensional it was assumed that the curvature was zero and no ordinary matter was present. On the one hand this allows for a complete visualization of the phase diagram, but on the other, it limits the physical applicability. Still, the late or present Universe with accelerated expansion can be modelled as the de Sitter attractor, while for the big bounce solutions the scale factor does not approach zero, so that matter density never dominates and neglecting it is justifiable.

If dustlike matter is included, the separatrix of the above simplified rational model survives and the same explicit solutions hold. One still has both big bounce and big bang solutions, not unlike those of $\Lambda$ CDM with stiff matter. In general, matter changes the early evolution around the separatrix but not on it. Thus, the next possible step in constructing a viable model seems to be identifying $f(R)$ such that it also has an invariant submanifold, but which depends on $\Omega_{m}$ , not just on the geometry and $\Lambda$ .

In any case, the elegant feature here is that the cosmological constant can appear naturally because of how the theory is constructed – it is identified with the constant term of $f(R)$ . Yet, even when this term was zero ( $f=\exp{}-1$ ), the same sort of accelerated expansion appeared.

A more detailed study of the initial singularity in the presence of matter and curvature index $k$ could lead to more interesting results still. For example, seeing how one of the scenarios imitates stiff matter, it will be interesting to ask if such cosmologies can be equivalent to standard general relativity with a scalar field, similarly to the ordinary $R^{2}$ case. It is also the quadratic $f(R)$ case for the Robertson-Walker geometry, when there is an equivalence with the ordinary $f(\mathscr{R})$ , although it does not seem to extend to higher orders. Another convergence is found when the traceless Ricci tensor vanishes, so that $R_{ab}$ is proportional to $g_{ab}$ and the Einstein equations for both theories coincide. However, as the examples show, even for an empty universe this might correspond only to fixed points, not to general solutions of the full theory.

With a view to future work, some observational formulae are also given, so that the basic cosmological tests can be applied. A comparison to the standard model is in order to help guide the subsequent theoretical developments. Specifically some constraints on the function $f$ should be obtained. The crudest way would be to fit the first coefficients of its expansion, but of course there is no hope in recovering the whole series this way.

Rather, one might want to approach the problem by trying to fit a differential equation satisfied by $f$ . Already for linear differential equations with rational coefficients this would reduce the number of parameters to finite, while at the same time allowing for a the vast family of (confluent) hypergeometric functions and their generalizations.

Future investigations could also address the question of reduction of the order of the dynamical system (63). For the Einstein-Hilbert action, the third derivative of the scale factor does not enter, and only the Friedmann equation, which is a relation between $H$ and $a$ , is left. Here, the equation involving the third derivative of the scale factor, or $\overset{\raisebox{-0.43057pt}[0.0pt][0.0pt]{\scaleobj{1}{.}}}{\alpha}$ , would be reduced if $3F^{\prime\prime}(\alpha)+F^{\prime\prime}(\beta)=0$ . For independent $\alpha$ and $\beta$ this happens only if $F$ is linear, so that GR is recovered.

If, on the other hand, there is a relation $\beta=\psi(\alpha)$ , then a nontrivial solution to the functional equation $3J(\alpha)+J(\psi(\alpha))=0$ could potentially be found. Such a relation is in itself a second-order differential equation for the scale factor, so the dynamics is simplified, but it then also means that the function $F$ is determined by $F^{\prime\prime}(\xi)=J(\xi)$ .

Ideally however, the function $f$ should be mainly constrained by experiment not just the simplicity of the resulting equations. If this theory passes the basic cosmological tests, analysing it in a wider context of gravitational physics will help address this issue. Questions of instabilities will have to be answered, although as suggested by [3], the Palatini approach, applicable here, provides a setting to avoid at least the Ostrogradski instability. In general, issues such as ghost fields, semiclassical stability and post-Newtonian (Solar System) tests will be required, and hopefully undertaken, to ascertain the overall viability of the presented extension.

Bibliography14

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] Alexei A. Starobinsky, “A new type of isotropic cosmological models without singularity”, Phys. Lett. B 91 , 1, 99-102 (1980).
2[2] Gianluca Allemandi, Andrzej Borowiec and Mauro Francaviglia, “Accelerated cosmological models in Ricci squared gravity”, Phys. Rev. D 70 , 103503 (2004).
3[3] Gonzalo J. Olmo, “Introduction to Palatini theories of gravity and nonsingular cosmologies”, chapter 7 in Open Questions in Cosmology , ed. G.J. Olmo, In Tech Publishing (2012).
4[4] Shin’ichi Nojiri and Sergei D. Odintsov, “Unified cosmic history in modified gravity: From F(R) theory to Lorentz non-invariant models”, Physics Reports, 505, 2–4, 59–144 (2011).
5[5] Thomas P. Satiriou and Valerio Faraoni, “ f ( R ) 𝑓 𝑅 f(R) theories of gravity”, Reviews of Modern Physics, 82(1), 451 (2010).
6[6] Andrzej Borowiec, Nonlinear Lagrangians of the Ricci Type, ar Xiv preprint gr-qc/9906043 (1999).
7[7] Andrzej Borowiec, “Metric-polynomial structures and gravitational Lagrangians”, in Institute of Physics Conference Series 173 , 241–244, (2002).
8[8] Nicholas J. Higham, Functions of Matrices , SIAM (2008).

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Tensor f(R)f(R)f(R) theory of gravity

Abstract

1 Introduction

2 Construction of the modified action

2.1 Definitions and notation

3 The metric approach

4 The Palatini approach

5 FRW dynamics

5.1 Examples of cosmological models

6 Observational formulae

7 Conclusions

Tensor $f(R)$ theory of gravity