Sachs equations for light bundles in a cold plasma

Karen Schulze-Koops; Volker Perlick; Dominik J. Schwarz

arXiv:1705.04810·gr-qc·October 20, 2017

Sachs equations for light bundles in a cold plasma

Karen Schulze-Koops, Volker Perlick, Dominik J. Schwarz

PDF

TL;DR

This paper generalizes Sachs equations to describe light bundle propagation in a cold plasma-filled universe, revealing modifications to cosmological distances and redshift relations without relying on Einstein's equations.

Contribution

It introduces a new formulation of Sachs equations accounting for cold plasma effects, extending previous models to non-vacuum cosmological contexts.

Findings

01

Modified reciprocity law in plasma environments

02

Small changes in cosmological redshift predictions

03

Altered Hubble law in plasma-filled cosmology

Abstract

We study the propagation of light bundles in non-empty spacetime, as most of the Universe is filled by baryonic matter in the form of a (dilute) plasma. Here we restrict to the case of a cold (i.e., pressureless) and non-magnetised plasma. Then the influence of the medium on the light rays is encoded in the spacetime dependent plasma frequency. Our result for a general spacetime generalises the Sachs equations to the case of a cold plasma Universe. We find that the reciprocity law (Etherington theorem), the relation that connects area distance with luminosity distance, is modified. Einstein's field equation is not used, i.e., our results apply independently of whether or not the plasma is self-gravitating. As an example, our findings are applied to a homogeneous plasma in a Robertson-Walker spacetime. We find small modifications of the cosmological redshift of frequencies and of the…

Equations229

ω_{p} (x) = \frac{e ^{2} n _{e} ( x )}{m _{e}}, f_{p} (x) = \frac{ω _{p} ( x )}{2 π} = 8980 n_{e} (x) Hz cm^{3/2} .

ω_{p} (x) = \frac{e ^{2} n _{e} ( x )}{m _{e}}, f_{p} (x) = \frac{ω _{p} ( x )}{2 π} = 8980 n_{e} (x) Hz cm^{3/2} .

\frac{d x ^{μ}}{d s} = \frac{\partial H}{\partial p _{μ}}, \frac{d p _{μ}}{d s} = - \frac{\partial H}{\partial x ^{μ}}, H (x, p) = 0 .

\frac{d x ^{μ}}{d s} = \frac{\partial H}{\partial p _{μ}}, \frac{d p _{μ}}{d s} = - \frac{\partial H}{\partial x ^{μ}}, H (x, p) = 0 .

H (x, p) = \frac{1}{2} g^{μν} (x) p_{μ} p_{ν} .

H (x, p) = \frac{1}{2} g^{μν} (x) p_{μ} p_{ν} .

K^{μ} = \frac{\partial H}{\partial p _{μ}} = g^{μν} p_{ν}, g_{μν} K^{μ} = p_{ν} .

K^{μ} = \frac{\partial H}{\partial p _{μ}} = g^{μν} p_{ν}, g_{μν} K^{μ} = p_{ν} .

K^{μ} = ω U^{μ} + k^{μ}, U^{ν} k_{ν} = 0,

K^{μ} = ω U^{μ} + k^{μ}, U^{ν} k_{ν} = 0,

ω = - p_{ν} U^{ν}

ω = - p_{ν} U^{ν}

ω = k_{μ} k^{μ} .

ω = k_{μ} k^{μ} .

\displaystyle\frac{\omega_{e}}{\omega_{r}}=\frac{p_{\nu}U^{\nu}\big{|}_{e}}{p_{\sigma}U^{\sigma}\big{|}_{r}}=\frac{g_{\mu\nu}K^{\mu}U^{\nu}\big{|}_{e}}{g_{\rho\sigma}K^{\rho}U^{\sigma}\big{|}_{r}}

\displaystyle\frac{\omega_{e}}{\omega_{r}}=\frac{p_{\nu}U^{\nu}\big{|}_{e}}{p_{\sigma}U^{\sigma}\big{|}_{r}}=\frac{g_{\mu\nu}K^{\mu}U^{\nu}\big{|}_{e}}{g_{\rho\sigma}K^{\rho}U^{\sigma}\big{|}_{r}}

\frac{λ _{r}}{λ _{e}} = \frac{ω _{e}}{ω _{r}} .

\frac{λ _{r}}{λ _{e}} = \frac{ω _{e}}{ω _{r}} .

E_{1} \mapsto cos α E_{1} + sin α E_{2} + a_{1} K,

E_{1} \mapsto cos α E_{1} + sin α E_{2} + a_{1} K,

E_{2} \mapsto - sin α E_{1} + cos α E_{2} + a_{2} K,

Y_{i} = D_{i 1} E_{1} + D_{i 2} E_{2} + y_{i} K .

Y_{i} = D_{i 1} E_{1} + D_{i 2} E_{2} + y_{i} K .

\frac{d ^{2}}{d s ^{2}} D = D Z .

\frac{d ^{2}}{d s ^{2}} D = D Z .

\displaystyle Z_{hj}=g\bigl{(}E_{h},R(K,E_{j},K)\bigr{)}\,.

\displaystyle Z_{hj}=g\bigl{(}E_{h},R(K,E_{j},K)\bigr{)}\,.

\displaystyle Z_{hj}=\frac{1}{2}\mathrm{Ric}(K,K)\delta_{hj}+g\bigl{(}E_{h},C(K,E_{j},K)\bigr{)}\,,

\displaystyle Z_{hj}=\frac{1}{2}\mathrm{Ric}(K,K)\delta_{hj}+g\bigl{(}E_{h},C(K,E_{j},K)\bigr{)}\,,

\frac{d}{d s} D = D S,

\frac{d}{d s} D = D S,

\frac{d}{d s} S + S S = Z .

\frac{d}{d s} S + S S = Z .

S = (0 Ω - Ω 0) + (σ_{1} σ_{2} σ_{2} - σ_{1}) + (θ 0 0 θ),

S = (0 Ω - Ω 0) + (σ_{1} σ_{2} σ_{2} - σ_{1}) + (θ 0 0 θ),

ρ = θ + i Ω, σ = σ_{1} + i σ_{2} .

ρ = θ + i Ω, σ = σ_{1} + i σ_{2} .

\frac{d ρ}{d s} + ρ^{2} + ∣ σ ∣^{2} = \frac{1}{2} Ric (K, K)

\frac{d ρ}{d s} + ρ^{2} + ∣ σ ∣^{2} = \frac{1}{2} Ric (K, K)

\displaystyle\frac{d\sigma}{ds}+2\theta\sigma=\frac{1}{2}\,g\bigl{(}E_{1}+iE_{2},C(K,E_{1}+iE_{2},K)\bigr{)}\,.

\displaystyle\frac{d\sigma}{ds}+2\theta\sigma=\frac{1}{2}\,g\bigl{(}E_{1}+iE_{2},C(K,E_{1}+iE_{2},K)\bigr{)}\,.

D = R M

D = R M

D = (cos ψ sin ψ - sin ψ cos ψ) (D_{+} 0 0 D_{-}) (cos χ - sin χ sin χ cos χ) .

D = (cos ψ sin ψ - sin ψ cos ψ) (D_{+} 0 0 D_{-}) (cos χ - sin χ sin χ cos χ) .

\displaystyle\frac{dD_{\pm}}{ds}\,+\,i\,D_{\pm}\,\frac{d\chi}{ds}\,-\,i\,D_{\mp}\,\frac{d\psi}{ds}\,=\,D_{\pm}\big{(}\overline{\rho\,}\,\pm\,\sigma\,e^{-2i\chi}\big{)}

\displaystyle\frac{dD_{\pm}}{ds}\,+\,i\,D_{\pm}\,\frac{d\chi}{ds}\,-\,i\,D_{\mp}\,\frac{d\psi}{ds}\,=\,D_{\pm}\big{(}\overline{\rho\,}\,\pm\,\sigma\,e^{-2i\chi}\big{)}

\displaystyle H(x,p)=\frac{1}{2}\Bigl{(}g^{\mu\nu}(x)p_{\mu}p_{\nu}+\omega_{p}(x)^{2}\Bigr{)}\,,

\displaystyle H(x,p)=\frac{1}{2}\Bigl{(}g^{\mu\nu}(x)p_{\mu}p_{\nu}+\omega_{p}(x)^{2}\Bigr{)}\,,

g_{μν} \frac{d x ^{μ}}{d s} \frac{d x ^{ν}}{d s} = - ω_{p}^{2},

g_{μν} \frac{d x ^{μ}}{d s} \frac{d x ^{ν}}{d s} = - ω_{p}^{2},

ω = ω_{p}^{2} + k_{μ} k^{μ},

ω = ω_{p}^{2} + k_{μ} k^{μ},

d ℓ = k_{μ} k^{μ} d s = ω^{2} - ω_{p}^{2} d s .

d ℓ = k_{μ} k^{μ} d s = ω^{2} - ω_{p}^{2} d s .

\frac{λ _{r}}{λ _{e}} = \frac{ω _{e}^{2} - ω _{p e}^{2}}{ω _{r}^{2} - ω _{p r}^{2}}

\frac{λ _{r}}{λ _{e}} = \frac{ω _{e}^{2} - ω _{p e}^{2}}{ω _{r}^{2} - ω _{p r}^{2}}

z \equiv \frac{λ _{r} - λ _{e}}{λ _{e}}, Z \equiv \frac{ω _{e} - ω _{r}}{ω _{r}} .

z \equiv \frac{λ _{r} - λ _{e}}{λ _{e}}, Z \equiv \frac{ω _{e} - ω _{r}}{ω _{r}} .

1 + Z (z, ω_{r}) = (1 + z) 1 + \frac{1}{ω _{r}^{2}} (\frac{ω _{p e}^{2}}{( 1 + z ) ^{2}} - ω_{p r}^{2}) .

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

Sachs equations for light bundles in a cold plasma

Karen Schulze-Koops1,2, Volker Perlick2 and Dominik J. Schwarz1

1 Fakultät für Physik, Universität Bielefeld, Postfach 100131, 33501 Bielefeld, Germany

2 ZARM, University of Bremen, 28359 Bremen, Germany.

Email: [email protected], [email protected]

Abstract

We study the propagation of light bundles in non-empty spacetime, as most of the Universe is filled by baryonic matter in the form of a (dilute) plasma. Here we restrict to the case of a cold (i.e., pressureless) and non-magnetised plasma. Then the influence of the medium on the light rays is encoded in the spacetime dependent plasma frequency. Our result for a general spacetime generalises the Sachs equations to the case of a cold plasma Universe. We find that the reciprocity law (Etherington theorem), the relation that connects area distance with luminosity distance, is modified. Einstein’s field equation is not used, i.e., our results apply independently of whether or not the plasma is self-gravitating. As an example, our findings are applied to a homogeneous plasma in a Robertson-Walker spacetime. We find small modifications of the cosmological redshift of frequencies and of the Hubble law.

1 Introduction

For most applications of general relativity to astrophysics light propagation may be modelled in terms of rays and the influence of a medium may be neglected. Then light rays are lightlike geodesics of the spacetime metric. However, the Universe is filled by a dilute medium which affects the propagation of light in various ways. Examples are dust grains, gas and molecular clouds, and most commonly plasma. Here we focus on the effects of a plasma on the propagation of bundles of light rays and in the context of general relativity. While plasma effects on the propagation111We do not consider effects like Thomson scattering of visible light in a dilute cold plasma, by which only a small number of individual photons are randomly scattered out of or into a light bundle. of visible light are negligible, plasmas do effect radio waves. Here are two examples. Firstly, the travel time of a radio signal from a pulsar to the observer on the Earth is influenced by the interstellar medium. This is usually expressed in terms of the so-called dispersion measure which is one of the most important observables in pulsar astronomy, see e.g. [1, 2]. Secondly, the Solar corona has an effect on the travel time and on the bending angle of radio signals; the relevant formulas, using the linearised version of gravity, have been derived by Muhleman et al. [3, 4].

In these two examples, and in most other applications to astrophysics, the medium may be modelled as a non-magnetised pressureless (“cold”) plasma. Then the influence of the medium onto the light rays is determined by the plasma frequency $\omega_{p}$ which is related to the electron number density $n_{e}$ by

[TABLE]

Here $e$ is the electric charge of the electron, and $m_{e}$ is the mass of the electron. Plasma effects modify the vacuum results by terms of order $(\omega_{p}(x)/\omega(x))^{2}$ and derivatives of that ratio, where $\omega(x)$ is the observed frequency with respect to a chosen observer. If $\omega$ is much bigger than $\omega_{p}$ , the influence of the medium is negligible, i.e., light rays are lightlike geodesics of the spacetime metric. If $\omega$ is bigger than $\omega_{p}$ but not much bigger, the influence of the plasma is palpable. If $\omega$ is lower than $\omega_{p}$ , light cannot travel through the plasma at all. In all applications to astrophysics and cosmology, the plasma frequency is in the radio regime. E.g., in the Solar corona the electron density drops from $\sim 10^{8}$ cm*-3* at the solar limb to $\sim 10^{3}$ cm*-3* at $10\,R_{\odot}$ , which corresponds to $f_{p}$ varying between $\sim 100$ MHz and $\sim 0.1$ MHz [5, 6, 7]. In the interstellar space the plasma frequency is typically a few kHz, while in the intergalactic space it can be as low as a few Hz. This is the reason why effects from plasmas may be safely neglected in the optical frequency range.

Without a low frequency radio telescope in a space orbit or on the Moon all radio signals from the Universe must pass Earth’s ionosphere in which $n_{e}\sim 10^{6}$ cm*-3*. Thus Earth’s ionosphere is not transparent for frequencies below 10 MHz. Low frequency radio waves ( $10$ MHz to $300$ MHz) are currently observed with modern radio interferometers, such as the Giant Metrewave Radio Telescope (GMRT) [8] and the Low Frequency Array (LOFAR) [9]. Both are multi-purpose observatories. There are also several instruments dedicated to the detection of the 21 cm line of hydrogen and its intensity fluctuation at high redshifts, i.e. from the epochs of reionisation and cosmic dawn, such as the Murchison Widefield Array (MWA) [10]. The best imaging resolution at the lowest frequencies, the aspect most relevant to this work, is currently obtained by means of LOFAR. The upgrade LOFAR2.0 will lead to further increase of sensitivity. The next generation of instruments will include a low frequency instrument of the Square Kilometre Array (SKA-LOW), which will improve todays sensitivities and survey speeds at frequencies between $50$ and $300$ MHz. The space-based array Orbiting Low Frequency Antenna for Radio astronomy (OLFAR), using a swarm of nano-satellites in Moon orbits, has been suggested [11, 12]. This is supposed to operate in the frequency range of 0.1 to 15 MHz, so it may be able to detect plasma effects that are not observable until now.

In the two examples mentioned above the gravitational field is weak, so the linearised version of general relativity is sufficient. However, one may ask if there are effects of a plasma on light near black holes and other compact objects where the linearised theory is not applicable. Then the theory of general relativity has to be used without approximations. The exact formula for the bending angle in a plasma whose density depends only on the radial coordinate was calculated on the Schwarzschild spacetime and in the equatorial plane of the Kerr spacetime by Perlick [13]. For further discussions of plasma effects in strong gravitational fields see Bisnovatyi-Kogan and Tsupko [14, 15, 16], Morozova, Ahmedov and Tursunov [17], Er and Mao [18], Rogers [19, 20, 21] and Perlick, Tsupko and Bisnovatyi-Kogan [22, 23].

Here we want to discuss the influence of a plasma on the geometry of light bundles. This is of relevance for image distortion (which is coded in the shear of the bundle) and for distance measures (which are coded in the expansion of the bundle). To the best of our knowledge, this question has not been investigated before. (There is a paper by Noonan [40] who addresses the influence of a refractive medium on the geometry of light bundles in general relativity. Note, however, that his equations are not valid if the index of refraction depends on the frequency, i.e., they do not apply to a dispersive medium such as a plasma.) We restrict to ray optics in a non-magnetised pressurefree (cold) plasma. The fundamental equations, which will be briefly reviewed below, are derived and discussed in Perlick [13]. For a magnetised plasma see Breuer and Ehlers [24, 25]. For early work on light propagation in dispersive media on a general-relativistic spacetime we refer to Synge [26], Madore [27], Bičák and Hadrava [28], Anile and Pantano [29, 30].

The paper is organised as follows: In Section 2 we review the known theory of light bundles in vacuum on a general-relativistic spacetime. In Section 3 we introduce the relevant definitions for light bundles in a plasma, following the vacuum case as closely as possible; based on these definitions, we derive the propagation equations (Sachs equations) for such bundles and we prove a plasma version of the reciprocity relation (Etherington law). We illustrate the general results with a homogeneous plasma on a Robertson-Walker spacetime in Section 4 and conclude in Section 5.

We use greek indices taking values 0, 1, 2, 3 and, occasionally, latin indices taking values 1, 2. Greek indices, for which Einstein’s summation convention is in force, are lowered and raised with the spacetime metric $g_{\mu\nu}$ and its inverse $g^{\mu\nu}$ , respectively. We use units making $c$ and $\hbar$ equal to 1 so that there is no difference between the 4-momentum of a light ray (i.e., of a photon) and the corresponding wave covector. Our choice of signature is $(-,+,+,+)$ . When we use index notation we define the components of the Riemann tensor by $R_{\mu\nu\sigma\tau}=g\big{(}\partial_{\mu},\nabla_{\partial_{\nu}}\nabla_{\partial_{\sigma}}\partial_{\tau}-\nabla_{\partial_{\sigma}}\nabla_{\partial_{\nu}}\partial_{\tau}\big{)}$ and the components of the Ricci tensor by $R_{\nu\tau}=R^{\sigma}{}_{\nu\sigma\tau}$ .

2 Light propagation in vacuum

2.1 The Hamiltonian for light rays in vacuum

Light rays are the solutions to Hamilton’s equations

[TABLE]

Here $x=x(s)$ denotes a curve, parametrised by a real quantity $s$ , in spacetime, which describes the light ray and $p=p(s)$ is its canonical momentum. For light rays in vacuum on a general-relativistic spacetime, it is well known that the Hamiltonian is

[TABLE]

In this case the solutions to (2) are the lightlike geodesics of the spacetime metric. For a derivation of general-relativistic ray optics in vacuum from Maxwell’s equations see, e.g., Schneider, Ehlers and Falco [31].

For the tangent vector $K^{\mu}=dx^{\mu}/ds$ of a light ray, (3) implies

[TABLE]

With respect to an observer with 4-velocity $U^{\nu}$ , normalised as usual by the condition $g_{\mu\nu}U^{\mu}U^{\nu}=-1$ , the tangent vector to the light ray can be decomposed into a part parallel and a part perpendicular to the 4-velocity of the observer,

[TABLE]

where

[TABLE]

is to be interpreted as the frequency as measured by the observer and $k^{\mu}$ is the spatial wave vector. This decomposition puts the dispersion relation $H(x,p)=0$ into the familiar form

[TABLE]

The definition of the frequency immediately implies the redshift formula

[TABLE]

where $e$ and $r$ stand for “emitter” and “receiver”, respectively. Owing to (7), we may rewrite the redshift formula in terms of the wavelength $\lambda=2\pi/\sqrt{k_{\mu}k^{\mu}}$ rather than in terms of the frequency $\omega$ ,

[TABLE]

It is often convenient to think of light propagation in terms of classical point particles (“classical photons”) that move along the trajectories determined by Hamilton’s equations (2). In this interpretation we have to identify the frequency $\omega$ with the energy of a classical photon. In our units, making $c$ and $\hbar$ equal to unity, the unit of a frequency is the same as the unit of an energy, namely 1/length. This choice of a unit for $\omega$ fixes the unit for any other scalar quantity: The Hamiltonian $H$ , which should not be confused with the energy of a photon, has dimension 1/length2, the curve parameter $s$ has dimension length2 and $x^{\mu}p_{\mu}$ is dimensionless. The dimension of tensor components, such as e.g. $g_{\mu\nu}$ , $p_{\mu}$ or $U_{\mu}$ , depend of course on the dimensions of the chosen coordinates.

If we multiply the Hamiltonian with an arbitrary function $f(x,p)$ that has no zeros, the light rays remain unchanged up to a reparametrisation. This will be of crucial relevance for our treatment in the plasma case. In vacuum, however, we will always use the parametrisation adapted to the Hamiltonian (3). For each light ray, this parametrisation is unique up to an affine reparametrisation. As usual, we refer to it as to an affine parametrisation. A change of the affine parametrisation corresponds to a multiplication of $K^{\mu}$ , $p_{\mu}$ , $\omega$ and $k^{\mu}$ by a factor which is constant along the light ray. Note that the frequency ratio (8) is unaffected by such a reparametrisation.

2.2 Light bundles in vacuum

It is now our goal to re-derive the equations that determine the shape and the size of light bundles in vacuum. This is known material, the relevant results go back to Jordan, Ehlers and Sachs [32] and to Sachs [33]. In Section 3 below we will then investigate if and how these results can be generalised to the plasma case.

Our notation follows Perlick [34]. For most parts of the discussion of light bundles we find it convenient to use coordinate-free notation. Then the tangent vector field $K$ to an affinely parametrised light ray in vacuum has to satisfy the conditions $g(K,K)=0$ and $\nabla_{K}K=0$ , where $g$ is the spacetime metric and $\nabla$ is the Levi-Civita derivative of $g$ .

Definition 2.1 (Light bundle)

Let $\lambda$ be an affinely parametrised light ray in vacuum and $K$ its tangent vector field. An (infinitesimally thin) light bundle along $\lambda$ is a set $\mathcal{B}=\big{\{}c_{1}Y_{1}+c_{2}Y_{2}\,\big{|}\,c_{1}^{2}+c_{2}^{2}\leq 1\big{\}}$ where $Y_{1}$ and $Y_{2}$ are two vector fields along $\lambda$ such that

(a)

$\nabla_{K}\nabla_{K}Y_{i}=R(K,Y_{i},K)$ * for $\>i=1,2$ ;*

(b)

$g(K,Y_{i})=0$ * for $\>i=1,2$ ;*

(c)

$Y_{1},Y_{2}$ * and $K$ are linearly independent almost everywhere.*

In condition (a) $R$ denotes the Riemannian curvature tensor. This condition is the Jacobi equation (also known as the geodesic deviation equation) which expresses the fact that $Y_{1}$ and $Y_{2}$ are connecting vectors from the central geodesic $\lambda$ to an infinitesimally close neighbouring geodesic. Condition (b) makes sure that these neighbouring geodesics are again lightlike and that $Y_{1}$ and $Y_{2}$ span a spacelike plane at all points where they are linearly independent. By condition (c), this is true almost everywhere. This condition guarantees that the cross-section of the bundle is two-dimensional except, possibly, at isolated points where this cross-section may collapse to a point or to a line. Such points are known as the caustic points of the bundle.

As one wants to write the Jacobi equation for $Y_{1}$ and $Y_{2}$ in matrix form, it is usual to introduce an appropriate basis of vector fields along $\lambda$ .

Definition 2.2 (Sachs basis)

A pair of vector fields, $(E_{1},E_{2})$ , along a light ray $\lambda$ with tangent vector field $K$ is called a Sachs basis if

(a)

$g(E_{i},E_{j})=\delta_{ij}$ * for $\>i,j=1,2$ ,*

(b)

$g(K,E_{i})=0$ * for $\>i=1,2$ ,*

(c)

$\nabla_{K}E_{i}=0$ * for $\>i=1,2$ .*

A Sachs basis along $\lambda$ is unique up to transformations

[TABLE]

where $\alpha$ , $a_{1}$ and $a_{2}$ are constant. The two vectors of a Sachs basis may be interpreted as spanning a screen. Choosing, at one point of $\lambda$ , a timelike vector $U$ that is to be interpreted as the 4-velocity of an observer singles out all Sachs bases perpendicular to $U$ . They are unique up to spatial rotations, i.e., up to transformations (10) with $a_{1}=a_{2}=0$ . In this sense, the transformation (10) may be interpreted as the combination of a spatial rotation, determined by $\alpha$ , and a boost, determined by $a_{1}$ and $a_{2}$ .

By Definition 2.2 the vectors $K$ , $E_{1}$ and $E_{2}$ span the orthocomplement of $K$ at each point of $\lambda$ . We can thus write our bundle vectors $Y_{i}$ as linear combinations of these vectors,

[TABLE]

We call the $2\times 2$ matrix $\boldsymbol{D}=(D_{ij})$ the Jacobi matrix. It determines the shape of the cross section of the light bundle in the Sachs basis.

Plugging (12) into the Jacobi equation and applying the operator $g(E_{h},\cdot)$ yields the matrix Jacobi equation

[TABLE]

Here $s$ is the affine parameter along the light ray, i.e., $\nabla_{K}f=df/ds$ for scalar functions $f$ , and $\boldsymbol{Z}=(Z_{hj})$ is the optical tidal matrix, defined by

[TABLE]

$\boldsymbol{Z}$ is symmetric, $Z_{hj}=Z_{jh}$ , because of the symmetry properties of the Riemann tensor. With the help of the well-known decomposition of the curvature tensor into Weyl tensor $C$ and Ricci tensor $\mathrm{Ric}$ (see e.g. Wald [35], p. 40, but note that our sign convention for the Ricci tensor is different) the optical tidal matrix can be rewritten as

[TABLE]

where we have used that $g(K,K)=0$ .

2.3 Sachs equations for light bundles in vacuum

With the deformation matrix $\boldsymbol{S}$ defined by

[TABLE]

the matrix Jacobi equation (13) reads

[TABLE]

Here we have assumed that $\boldsymbol{D}$ has full rank, which is the case almost everywhere (see Definition 2.1). Decomposing $\boldsymbol{S}$ into antisymmetric, symmetric-tracefree and trace parts,

[TABLE]

defines the rotation $\Omega$ , the shear $(\sigma_{1},\sigma_{2})$ , and the expansion $\theta$ of the bundle which are usually combined into the two complex optical scalars

[TABLE]

We read from (17) and (18) that the optical scalars have the same dimension as the curve parameter $s$ , namely that of an area. Then the matrix equation (17) gives the two complex Sachs equations

[TABLE]

Note that the deformation matrix $\boldsymbol{S}$ and, thus, the optical scalars characterise the change of the shape of the light bundle. If the optical scalars are known, the shape is determined by solving the first-order differential equation (16) for the matrix $\boldsymbol{D}$ . We may use a polar decomposition of $\boldsymbol{D}$ ,

[TABLE]

where $\boldsymbol{R}$ is orthogonal and $\boldsymbol{M}$ is symmetric and, thereupon, diagonalise the matrix $\boldsymbol{M}$ with the help of another orthogonal matrix. As the product of two orthogonal matrices is again orthogonal, this gives us a parametrisation of the matrix $\boldsymbol{D}$ in terms of two angles, $\psi$ and $\chi$ , and two eigenvalues, $D_{+}$ and $D_{-}$ ,

[TABLE]

Under a transformation (10) of the Sachs basis, the four parameters of the Jacobi matrix change according to $D_{\pm}\mapsto D_{\pm}$ , $\chi\mapsto\chi-\alpha$ , $\psi\mapsto\psi$ . We see that the parameter $\alpha$ has the only effect of rotating the angle $\chi$ by a constant amount while the parameters $a_{1}$ and $a_{2}$ have no influence at all. This demonstrates the important fact that the shape and the size of the cross-section of a light bundle are independent of the Sachs basis, i.e., of the chosen screen. This result was proven by Sachs [33] and we will refer to it as to Sachs’s theorem.

In terms of $D_{+}$ , $D_{-}$ , $\psi$ and $\chi$ , the matrix equation (16) can be rewritten, after a bit of algebra, as

[TABLE]

where an overbar denotes complex conjugation.

3 Light propagation in a plasma

3.1 The Hamiltonian for light rays in a plasma

The Hamiltonian for light rays in a plasma on a general-relativistic spacetime was rigorously derived by Breuer and Ehlers [24, 25] from Maxwell’s equations coupled to two charged fluids, a positively charged fluid modelling the ions and a negatively charged fluid modelling the electrons. By linearising the equations of motion for the electrons about a background field and assuming that only the electrons could follow a rapidly oscillating wave motion they derived a coupled system of linear wave equations for the perturbations of the electromagnetic field and of the electron fluid. The transition to ray optics was performed by a two-scale method, sending to infinity simultaneously the frequency and the scale on which the background fields vary. Breuer and Ehlers allowed for an arbitrary electromagnetic background field, i.e., they considered a magnetised plasma, but they assumed the electron fluid to be pressureless. (This is often called a cold plasma.) Then the resulting equations of motion for the light rays could be put into Hamiltonian form (2), where the Hamiltonian was an eighth-order polynomial in the momentum coordinates. At a more fundamental level, the two-fluid plasma model which is the starting point for the derivation by Breuer and Ehlers can be derived from kinetic theory. In this case one would start out from the general-relativistic Boltzmann equation without a collision term (also known as Liouville equation or Vlasov equation) and derive phenomenological two-fluid, or multi-fluid, flows from that, see e.g. [37]. In this derivation the equivalence principle is used, assuming that the plasma couples minimally to the gravitational field, i. e., that there are no curvature couplings. With the simplifying assumption that the pressure can be neglected, the phenomenological plasma model of Breuer and Ehlers results. Here, in contrast to the work of Breuer and Ehlers, we want to consider the considerably simpler case of a non-magnetised pressureless plasma. For this case, the derivation of the Hamiltonian for the light rays can also be found in the book by Perlick [13]. The Hamiltonian takes the simple form

[TABLE]

where $\omega_{p}(x)$ is a scalar function on the spacetime known as the plasma frequency. In the derivation $\omega_{p}(x)^{2}$ comes about as a multiple of the (background) number density function $n_{e}$ of the electrons, as was already anticipated in Eq. (1). As usual in relativity, $n_{e}$ is defined with respect to the rest system of the electron fluid, so it is an invariant scalar (i.e., independent of which coordinates are chosen on spacetime). Therefore, also the plasma frequency is an invariant scalar..

Note that the Hamiltonian (25) depends on the density of the plasma but not on its 4-velocity, i.e., it is Lorentz invariant. Light rays in a medium whose Hamiltonian is of this form have been studied in detail in the book by Synge [26]. Apparently, Synge was not aware of the fact that a non-magnetised pressureless plasma is an example of such a medium.

For $\omega_{p}\to 0$ , the Hamiltonian (25) reproduces of course the vacuum Hamiltonian (3). The most important difference between the plasma case and the vacuum case is in the fact that (3) is homogeneous with respect to the momentum coordinates while (25) is not. This reflects the property of a plasma to be dispersive, i.e., the property that the path of a light ray depends not only on the spatial initial direction but also on the frequency. For a detailed discussion of ray optics in dispersive and non-dispersive media from a spacetime perspective we refer to Perlick [13].

As in vacuum, the tangent vector $K^{\mu}=dx^{\mu}/ds$ of a light ray satisfies (4). The equation $H(x,p)=0$ is, thus, equivalent to

[TABLE]

which demonstrates that light rays in a plasma are timelike curves.

With the frequency defined by (6), we read from (4) that the redshift formula (8) is valid in the plasma as well. If decomposed into frequency and spatial wave vector, the dispersion relation $H(x,p)=0$ in a plasma reads

[TABLE]

which demonstrates that, at a spacetime point $x$ , only frequencies $\omega>\omega_{p}(x)$ are possible. The limiting case $\omega=\omega_{p}$ corresponds to an observer whose 4-velocity $U^{\mu}$ is tangent to the light ray; for such an observer the spatial wave vector $k^{\mu}$ is zero and (momentarily) “the light ray stands still”. Note that, in terms of the decomposition (5) with respect to an observer, the length element along the light ray as measured by this observer is

[TABLE]

We may rewrite the redshift law (8) in terms of the wavelength $\lambda=2\pi/\sqrt{k_{\mu}k^{\mu}}$ , rather than in terms of the frequency. However, instead of the vacuum relation (9) we have now to use the equation

[TABLE]

in accordance with the dispersion relation (27). Therefore, in a plasma we have to distinguish between a wavelength redshift $z$ and a frequency redshift $Z$ ,

[TABLE]

They are related by the equation

[TABLE]

The easiest way of verifying this relation is to start from the right-hand side of (31) and to express z by its defining equation, given in (30). Then substituting for $\lambda_{r}/\lambda_{e}$ from (29) gives immediately $\omega_{e}/\omega_{r}$ , i.e, the left-hand side of (31).

We consider the frequency as the primary quantity and the wavelength as secondary. There are two reasons for this, one from a conceptual point of view and one from the point of view of a practitioner: Firstly, when light enters from one medium into another its wavelength changes while its frequency remains the same. Secondly, plasma effects are non-negligible only in the radio regime where common spectroscopic measuring devices determine the frequency and not the wavelength. This is in agreement with what Appenzeller [36] writes on p.54 of his text-book on astronomical spectroscopy: “In general, using the frequency is preferable as (it) is an intrinsic property of the radiation, whereas the wavelength depends on the medium in which the wave is propagating … Counting oscillations is relatively easy in the radio bands up to 100 GHz …”. On the other hand, the wavelength redshift is also of relevance. E.g., modern arrays of radio telescopes like LOFAR, MWA, SKA, etc are related to wavelength measurements. Also, we will see in Section 4 below that for a homogeneous plasma on a Robertson-Walker spacetime the wavelength redshift is related to the scale factor by the same equation as in vacuum whereas the frequency redshift is given by a rather complicated expression.

We have already mentioned that, up to parametrisation, the solutions to the system (2) are unchanged if we multiply the Hamiltonian with a nowhere vanishing function. In the plasma case it is often advantageous to switch to the dimensionless Hamiltonian

[TABLE]

This is possible on any domain where $\omega_{p}$ has no zeros. Solving Hamilton’s equations

[TABLE]

with this transformed Hamiltonian shows that the light rays in the plasma are the timelike geodesics of the conformally rescaled metric

[TABLE]

Up to a constant factor, the dimensionless new parameter $\tilde{s}$ is proper time with respect to the metric $\tilde{g}{}_{\mu\nu}$ . It is related to the old parameter $s$ by

[TABLE]

in particular

[TABLE]

With the help of (36) one may rewrite the redshift law (8) in terms of $\widetilde{K}{}^{\mu}$ ,

[TABLE]

3.2 Light bundles in a plasma

For the following considerations we restrict ourselves to a spacetime region where the plasma density $\omega_{p}(x)$ has no zeros. On this region the conformally rescaled metric (34) is well defined and we may use for light rays the parametrisation adapted to the Hamiltonian $\tilde{H}$ . Then the tangent vector field $\widetilde{K}$ to a light ray has to satisfy the equations $\tilde{g}\big{(}\widetilde{K},\widetilde{K}\big{)}=-1$ and $\widetilde{\nabla}_{\widetilde{K}}\widetilde{K}=0$ where $\widetilde{\nabla}$ is the Levi-Civita derivative of the metric $\tilde{g}$ .

We now define the notion of a light bundle in the plasma, following the construction for the vacuum case as closely as possible.

Definition 3.1 (Light bundle)

Let $\tilde{\lambda}$ be a light ray in the plasma, parametrised by proper time $\tilde{s}$ with respect to the metric $\tilde{g}$ , and $\widetilde{K}$ its tangent vector field. An (infinitesimally thin) light bundle along $\tilde{\lambda}$ is a set $\mathcal{B}=\big{\{}c_{1}\widetilde{Y}{}_{1}+c_{2}\widetilde{Y}{}_{2}\,\big{|}\,c_{1}^{2}+c_{2}^{2}\leq 1\big{\}}$ where $\widetilde{Y}{}_{1}$ and $\widetilde{Y}{}_{2}$ are two vector fields along $\tilde{\lambda}$ such that

(a)

$\widetilde{\nabla}_{\widetilde{K}}\widetilde{\nabla}_{\widetilde{K}}\widetilde{Y}_{i}=\widetilde{R}(\widetilde{K},\widetilde{Y}_{i},\widetilde{K})$ * for $\>i=1,2\,$ ;*

(b)

$\tilde{g}(\widetilde{K},\widetilde{Y}_{i})=0$ * for $\>i=1,2\,$ ;*

(c)

$\widetilde{Y}_{1}$ * and $\widetilde{Y}_{2}$ are linearly independent almost everywhere.*

In condition (a), $\tilde{R}$ denotes the Riemannian curvature tensor of the metric $\tilde{g}$ . This condition makes sure that $\widetilde{Y}{}_{1}$ and $\widetilde{Y}{}_{2}$ are connecting vectors to neighbouring $\tilde{g}-$ geodesics. Condition (b) implies that these neighbouring geodesics are again timelike and parametrised by proper time with respect to $\tilde{g}$ , and that $\widetilde{Y}{}_{1}$ and $\widetilde{Y}{}_{2}$ span a spacelike plane at each point where they are linearly independent. The third condition makes sure that this is true almost everywhere, i.e., that the bundle has a two-dimensional cross-section except, possibly, at some caustic points.

Here it is important to realise that Definition 3.1 allows the neighbouring light rays to have arbitrary frequencies. We do not attempt to define something like “a bundle of light rays with a given frequency”, and it is hard to see how this could be done. Even in cases where we have a distinguished observer field with respect to which the frequency could be defined, we would have to face the problem that the frequency is not in general conserved along a light ray.

The following definition is crucial for all that follows.

Definition 3.2 (Sachs basis)

A pair of vector fields, $(\widetilde{E}{}_{1},\widetilde{E}{}_{2})$ , along a light ray $\tilde{\lambda}$ with tangent vector field $\widetilde{K}$ , is called a Sachs basis if

(a)

$\tilde{g}(\widetilde{E}_{i},\widetilde{E}_{j})=\delta_{ij}$ * for $\>i,j=1,2$ ,*

(b)

$\tilde{g}(\widetilde{K},\widetilde{E}_{i})=0$ * for $\>i=1,2$ ,*

(c)

$\widetilde{\nabla}_{\widetilde{K}}\widetilde{E}_{i}=0$ * for $\>i=1,2$ .*

The choice of a timelike vector $U$ at one point of $\tilde{\lambda}$ that is not tangent to $\tilde{\lambda}$ singles out those Sachs bases that are perpendicular to $U$ . As in the vacuum case, they are unique up to transformations

[TABLE]

with a constant $\alpha$ . However, two arbitrary Sachs bases along a light ray in the plasma are related by a transformation that is quite different from the vacuum case (10). We will not write down such a general transformation which involves a rotation in all three spatial dimensions.

In the vacuum case, the three vectors $K$ , $E_{1}$ and $E_{2}$ spanned the orthocomplement of $K$ which was lightlike; as the vector fields $\widetilde{Y}{}_{1}$ and $\widetilde{Y}{}_{2}$ were in this orthocomplement, they could be written as a linear combination of these three vectors. By contrast, in a plasma the vectors $\widetilde{K}$ , $\widetilde{E}{}_{1}$ and $\widetilde{E}{}_{2}$ span a 3-dimensional timelike hyperplane in the tangent space at each point, so $\widetilde{Y}{}_{1}$ and $\widetilde{Y}{}_{2}$ cannot be written as a linear combination of them. However, we may write these two vector fields in the form

[TABLE]

where $\widetilde{Y}{}_{i}^{\perp}$ is orthogonal to $\widetilde{K}$ , $\widetilde{E}{}_{1}$ and $\widetilde{E}{}_{2}$ . If we interpret $\widetilde{E}_{1}$ and $\widetilde{E}_{2}$ as two vectors that span a screen, the component $\widetilde{Y}{}_{i}^{\perp}$ is perpendicular to the screen and, thus, irrelevant for the shape of the bundle on the screen. Hence, the shape of the bundle on the screen is determined by the matrix $\widetilde{\boldsymbol{D}}=\big{(}\widetilde{D}{}_{ij}\big{)}$ . The bundle cross-section on the screen is 2-dimensional if the matrix $\tilde{\boldsymbol{D}}$ is non-degenerate. By condition (c) of Definition 3.1, this is true at almost all points of the bundle for almost all choices of the Sachs basis. (However, in contrast to the vacuum case, in the plasma it is possible to make a “bad choice” of the Sachs basis such that the screen is not transverse to the bundle.)

Plugging (40) into the Jacobi equation for the $\widetilde{Y}_{i}$ and applying the operator $\tilde{g}(\tilde{E_{h}},\cdot)$ leads to the matrix Jacobi equation in the plasma,

[TABLE]

where $\tilde{s}$ is the parameter along the light ray (i.e., proper time with respect to the metric $\tilde{g}$ ) and $\widetilde{\boldsymbol{Z}}=(\widetilde{Z}{}_{jh})$ with

[TABLE]

In analogy to the vacuum case, we may use the symmetry of the Riemann tensor $\widetilde{R}$ to conclude that

[TABLE]

and we may decompose $\widetilde{R}$ into the Ricci tensor $\widetilde{\mathrm{Ric}}$ , the Ricci scalar $\tilde{\mathcal{R}}$ and the Weyl tensor $\widetilde{C}$ ,

[TABLE]

where $\tilde{g}(\widetilde{K},\widetilde{K})=-1$ .

3.3 Sachs equations for light bundles in a plasma

In analogy to the vacuum case, we may introduce the deformation matrix $\widetilde{\boldsymbol{S}}$ in the plasma by

[TABLE]

Then the matrix Jacobi equation (41) implies

[TABLE]

From decomposing $\widetilde{\boldsymbol{S}}$ into antisymmetric, symmetric-tracefree and trace parts we would get a set of Sachs equations in terms of the twiddled quantities. However, in view of applications to physics this is not the kind of Sachs equations we want to have in a plasma. The reason is that, according to item (a) of Definition 3.2, our Sachs basis vectors are normalised with respect to the metric $\tilde{g}$ . As a consequence, the matrix $\widetilde{\boldsymbol{D}}$ gives us the bundle cross-section as it is measured with the metric $\tilde{g}$ . Actually, in view of applications to physics we need the bundle cross-section as it is measured with the spacetime metric $g$ . Therefore, we introduce a rescaled Sachs basis,

[TABLE]

and a correspondingly rescaled Jacobi matrix

[TABLE]

The physically relevant optical scalars have to be defined in terms of this modified Jacobi matrix. At the same time, we may change from the parametrisation by $\tilde{s}$ to the parametrisation by $s$ which is given by (35). This has the advantage that the parameter $s$ , in contrast to the parameter $\tilde{s}$ , remains meaningful in the vacuum limit $\omega_{p}\to 0$ .

It is now straight forward, though somewhat tedious, to derive the Sachs equations in a plasma. On the left-hand side of (46), we rewrite $d/d\tilde{s}$ in terms of $d/ds$ with the help of (35), and we express the matrix $\tilde{\boldsymbol{S}}$ in terms of the matrix $\boldsymbol{S}$ defined by

[TABLE]

hence

[TABLE]

On the right-hand side of (46), we express the Ricci tensor and the Weyl tensor of the metric $\tilde{g}$ in terms of the Ricci tensor and the Weyl tensor of the conformally related metric $g$ , using the well-known transformation formulas which are given, e.g., in Appendix D of the book by Wald [35]. If we define the optical scalars as in vacuum by (18) and (19), we get the following set of Sachs equations:

[TABLE]

Note that the Weyl tensor term in (51) goes to 0 if $K$ becomes lightlike. Hence, for $\omega_{p}\to 0$ these equations reduce indeed to the Sachs equations in vacuum.

As the relation between the Jacobi matrix $\boldsymbol{D}$ and the deformation matrix $\boldsymbol{S}$ is unchanged in comparison to the vacuum case, equation (24) is still valid in the plasma if we parametrise the Jacobi matrix according to (23). However, in the plasma there is no analogue to Sachs’s theorem: The eigenvalues $D_{+}$ and $D_{-}$ of the Jacobi matrix and, thus, the shape and the size of a bundle cross-section depend on the Sachs basis. For a given Sachs basis, the corresponding values $D_{+}$ and $D_{-}$ give the size of the cross-section as measured by an observer only for those observers whose 4-velocity is orthogonal to the Sachs basis vectors.

3.4 Reciprocity theorem and related results

In vacuum it is well known that for Jacobi matrices a certain conservation law is satisfied. This gives rise to the so-called reciprocity theorem (also known as the Etherington law [38]) which is of great relevance in view of physics; among other things, it relates the area distance to the luminosity distance which is particularly important for cosmology. In this section we will prove a generalisation of this conservation law for light bundles in a plasma. For a detailed discussion of the vacuum case see, e.g., Perlick [34].

When we defined light bundles in a plasma we considered a central light ray with a parameter $\tilde{s}$ adapted to the Hamiltonian $\tilde{H}$ of (32). We then switched, by (35), to a parameter $s$ adapted to the Hamiltonian $H$ of (25). The parametrisation with respect to $s$ has the obvious advantage that it remains meaningful in the limit $\omega_{p}\to 0$ where the light rays become lightlike geodesics of the spacetime metric and $s$ becomes an affine parameter. For this reason, we use the parametrisation by $s$ in the following theorem.

Theorem 3.1

Let $\lambda$ be a light ray in a plasma, parametrised by the parameter $s$ adapted to the Hamiltonian (25). Let $\boldsymbol{D}{}_{1}$ and $\boldsymbol{D}{}_{2}$ be the Jacobi matrices (48) associated with two bundles along this light ray. Then

[TABLE]

where $(\,.\,)^{T}$ means the transpose of a matrix.

Proof: By (41),

[TABLE]

where we have used that $\widetilde{\boldsymbol{Z}}^{T}=\widetilde{\boldsymbol{Z}}$ . Hence

[TABLE]

$\Box$

The conservation law can be applied to the case that $\boldsymbol{D}{}_{1}=\boldsymbol{D}{}_{2}=\boldsymbol{D}$ . We will now prove that this implies an important property of homocentric bundles, i.e., of bundles where $\boldsymbol{D}$ vanishes at a certain point.

Theorem 3.2

A homocentric bundle is twistfree.

Proof: For a homocentric bundle with Jacobi matrix $\boldsymbol{D}=\boldsymbol{D}{}_{1}=\boldsymbol{D}{}_{2}$ , the conservation law implies that

[TABLE]

As $\boldsymbol{D}$ is generically invertible, this implies that $\boldsymbol{S}=\boldsymbol{S}{}^{T}$ . By (18), $\Omega=0$ .

$\Box$

The most important consequence of the conservation law is the following theorem.

Theorem 3.3 (Reciprocity Theorem)

Let $\lambda$ be a light ray in a plasma, parametrised by the parameter $s$ adapted to the Hamiltonian (25). Let $\boldsymbol{D}{}_{1}$ and $\boldsymbol{D}{}_{2}$ be the Jacobi matrices (48) associated with two bundles along this light ray with

[TABLE]

where $c_{1}$ and $c_{2}$ are positive constants. Then

[TABLE]

If we parametrise the Jacobi matrices according to (23),

[TABLE]

Proof: The conservation law implies that

[TABLE]

Inserting the initial conditions (57) and (58) yields (59). (60) follows by taking the determinant. $\Box$

The reciprocity theorem for light bundles in vacuum was proven by Etherington [38].

We will now discuss the notions of area distance and luminosity distance and their relation in a plasma. To that end we have to consider two bundles along the same light ray, one with a vertex at the receiver and one with a vertex at the emitter, see Fig. 1, and we have to apply the reciprocity theorem. For a discussion of area distance and luminosity distance in vacuum see e.g. Schneider, Ehlers and Falco [31] or Perlick [34].

The area distance $D_{A}$ of an emitter from a receiver is defined as

[TABLE]

We could directly observe the area distance if we had standard rulers, i.e., light sources of a known cross-sectional area. Then the apparent size of such an object in the sky would give us directly its area distance. While we do not have perfect standard rulers, we may at least estimate reasonably well the cross-sectional area of some objects (e.g. galaxies of a certain type) which allows us to determine the area distance of these objects to within a certain accuracy. (Instead of the area, one could consider all angular diameters of the cross-section. The resulting distance measures are called the angular-diameter distances. The area distance is the average of the angular-diameter distances over all transverse directions. In an isotropic situation the notions of area distance and angular-diameter distance coincide.) To work out this definition in a plasma, we consider a light ray parametrised by the parameter $s$ adapted to the Hamiltonian (25), i.e., the tangent vector $K$ of the light ray satisfies $g(K,K)=-\omega_{p}^{2}$ . The parameter runs in the future direction from the value $s_{e}$ at the emission event to the value $s_{r}$ at the reception event. We choose a Sachs basis along the light ray and 4-velocity vectors $U_{e}$ and $U_{r}$ at the emission event and at the reception event, respectively, which are both assumed to be orthogonal to the Sachs basis. We consider a bundle with a vertex at the receiver,

[TABLE]

where $\omega_{r}$ is the frequency of the light ray with respect to an observer with 4-velocity $U_{r}$ . The solid angle of this bundle at the receiver is assumed to be measured with respect to the same observer. The cross-sectional area at the emission event is assumed to be measured by an observer with 4-velocity $U_{e}$ . It is crucial to keep in mind that the following calculation applies only to the case that both $U_{e}$ and $U_{r}$ are orthogonal to the chosen Sachs basis.

If the bundle is parametrised according to (23), the definition of the area distance (62) can be rewritten as

[TABLE]

where $\ell(\varepsilon)$ is the length of the segment of the light ray from $s_{r}-\varepsilon$ to $s_{r}$ , as measured by the observer with 4-velocity $U_{r}$ . As we consider the limit $\varepsilon\to 0$ , we need $\ell(\varepsilon)$ only to within linear order which can be read from (28),

[TABLE]

where $\omega_{pr}$ is the plasma frequency at the reception event. Thus, (64) reads

[TABLE]

As the vertex condition (63) implies that

[TABLE]

we finally find that

[TABLE]

We have chosen the opening angle in (63) such that in vacuum we have just $D_{A}^{2}=\big{|}D_{1+}(s_{e})D_{1-}(s_{e})\big{|}$ .

The corrected luminosity distance $D_{L,\mathrm{corr}}$ is defined as

[TABLE]

If a light source isotropically emits photons at a known rate, the number flux arriving on the Earth would give us directly its corrected luminosity distance. We may calculate it in analogy to the area distance, but now with a bundle that has a vertex at the emitter,

[TABLE]

where $\omega_{e}$ is the frequency of the light ray at the emission event as measured by the observer with 4-velocity $U_{e}$ . The result is

[TABLE]

where $\omega_{pe}$ is the plasma frequency at the emission event.

The (uncorrected) luminosity distance $D_{L}$ differs from $D_{L,\mathrm{corr}}$ by a redshift factor,

[TABLE]

i.e., it is related to the energy flux, rather than to the number flux of photons, that arrives at the receiver. The luminosity distance could be directly observed if we had standard candles, i.e., light sources that isotropically emit photons of a certain frequency at a known rate. Then the energy flux from such an object that arrives on the Earth would give us directly its corrected luminosity distance. As there are light sources that come quite close to being standard candles, e.g. supernovae of type Ia, the luminosity distance may be viewed as a quantity that can be measured reasonably well.

Upon inserting (68) and (71) into (72), the reciprocity law (60) yields

[TABLE]

In the well-known vacuum case, $D_{L}$ and $D_{A}$ are related by a redshift factor squared. In a plasma, we see that only one of these two factors is a frequency ratio; the other one is a wavelength ratio. This reveals a fundamental difference between the vacuum and the plasma case. While in the vacuum case we can ignore the difference between a thermal and a monochromatic light source, this is no longer the case in the plasma. In the vacuum case all frequencies behave in the same way and we can transfer the results for monochromatic light sources to those of arbitrary light sources. The vacuum version of (73) is thus usually written for the bolometric luminosity distance. In the plasma case, however, we have to specify the frequency, i.e., $D_{L}$ is to be understood as based on a monochromatic luminosity.

We repeat that the given formulas for $D_{A}$ and $D_{L}$ in a plasma are valid only if both $U_{e}$ and $U_{r}$ are orthogonal to the chosen Sachs basis. For observers with other 4-velocities, frequencies, angles and cross-sectional areas have to be transformed with the corresponding formulas from special relativity.

4 Example: Spatially homogeneous plasma in a Robertson-Walker spacetime

We specify the spacetime metric to the Robertson-Walker case,

[TABLE]

where $a(t)>0$ is the scale factor and $k$ takes the value 1, 0 or $-1$ depending on whether the spatial curvature is positive, zero or negative. We assume that the plasma density is spatially homogeneous, i.e., that $\omega_{p}$ is a function only of $t$ . Then the Hamiltonian (25) for light rays in the plasma reads

[TABLE]

When discussing light bundles in this spacetime, because of the symmetry it suffices to restrict to the case that the central ray is radial, $d\vartheta/ds=d\varphi/ds=0$ . For such a ray, Hamilton’s equations (2) read

[TABLE]

The frequency of the light ray with respect to the observer field $U=\partial_{t}$ is

[TABLE]

Dividing (78) by (76) and inserting (80) yields, after some rearrangements, the result that

[TABLE]

is constant along the ray.

This gives us the redshift law in the Robertson-Walker universe with a plasma,

[TABLE]

This demonstrates that the wavelength ratio is given in the plasma by the same formula as in vacuum,

[TABLE]

but that the frequency ratio is not, recall (30). If we introduce the Hubble function $H(t)$ by the usual equation

[TABLE]

the wavelength redshift $z$ varies along the ray according to the familiar law

[TABLE]

where $t_{r}$ is kept fixed and $t=t_{e}$ parametrises the ray.

We can estimate the importance of the difference between the wavelength redshift $z$ and the frequency redshift $Z$ by evaluating the plasma frequency as a function of cosmological (wavelength) redshift $z$ . This is easily done adopting the ratio of free electron number density compared with the hydrogen nuclei number density, $x_{e}$ , for the best-fit cosmology provided by the analysis of the Planck team [39]. Then in (1) we can write $n_{e}(z)=n_{B}(0)(1+z)^{3}(1-Y_{p})x_{e}(z)$ , where $n_{B}(0)$ denotes the number density of baryons in the Universe, $Y_{p}$ is the helium mass fraction and corrects for the effects from helium. The factor $(1+z)^{3}$ accounts for the dilution of the number density due to the expansion. $x_{e}(z)$ and $f_{p}(z)$ are plotted in Fig. 2. One can clearly see that the cosmic plasma frequency is well below $10$ MHz, the typical plasma frequency of Earth’s iononsphere, for the photon transparent Universe. This implies that the observable frequency dependent difference (31),

[TABLE]

is tiny.

We will now specify the Sachs equations (51) and (52) to the case of a homogeneous plasma on a Robertson-Walker spacetime. With the help of the constant of motion $C$ we may rewrite the tangent vector to the light ray as

[TABLE]

$C$ is determined if we know the frequency at one particular event along the light ray. In the vacuum case, $\omega_{p}=0$ , choosing different values for the constant of motion has the only effect of choosing different affine parameters for the light ray.

For writing the Sachs equation in the Robertson-Walker universe with a plasma, we have to choose a Sachs basis along our radial light ray. In the case at hand, a natural choice is

[TABLE]

It is easy to see that these two vector fields are, indeed, orthonormal with respect to the metric $\tilde{g}{}_{\mu\nu}=\omega_{p}^{2}g_{\mu\nu}$ . Moreover, with a bit of algebra one can verify that they satisfy $\widetilde{\nabla}_{\widetilde{K}}\widetilde{E}{}_{i}=0$ . The rescaled Sachs basis (47) is then obviously given by

[TABLE]

With $K$ , $E_{1}$ and $E_{2}$ known we may evaluate the Sachs equations (51) and (52). We just have to calculate the Weyl tensor, the Ricci tensor and the Ricci scalar of the Robertson-Walker metric (74) which is an elementary textbook matter. The Weyl tensor vanishes, the non-zero components of the Ricci tensor are

[TABLE]

and the Ricci scalar reads

[TABLE]

The Sachs equations (51) and (52) read

[TABLE]

From (96) we read that the plasma does not produce shear, i.e., if the shear vanishes at one point, then it vanishes along the entire ray. For a bundle with $\sigma=0$ and $\Omega=0$ , the Sachs equations and (23) reduce to

[TABLE]

Differentiating (98) with respect to $s$ and inserting (97) yields

[TABLE]

where we have inserted the constant of motion $C$ from (82). This equation has to be viewed in conjunction with (76), which relates the parameter $s$ to the Robertson-Walker time coordinate $t$ , and (78), which tells how the frequency changes along the ray. If the time-dependence of the scale factor $a(t)$ and of the plasma frequency $\omega_{p}(t)$ has been specified, integration of this system of equations determines the influence of the plasma on the focussing properties and, thus, on the distance measures in the Robertson-Walker universe.

It is convenient to replace the affine parameter $s$ by the cosmological redshift $z$ , which is done by means of

[TABLE]

where we used (76), (81) and (86) in the last step. The Hubble function $H$ and the frequency $\omega$ are now viewed as functions of the redshift $z$ . In order to get rid of the second derivative with respect to $s$ we need to evaluate $d\omega/ds$ , which is done by means of (78), (81) and (80). We also replace all derivatives with respect to cosmic time by a derivative with respect to cosmological redshift and obtain after a lengthy calculation

[TABLE]

where a prime denotes a derivative with respect to redshift and $a_{0}=a(t_{r})$ denotes the value of the scale factor at the reception event (“today”).

For vanishing plasma frequency ( $\omega_{p}=0$ ) and a spatially flat Universe ( $k=0$ ), we recover from this equation the well known result $D_{\pm}(z)=1/(1+z)\int_{0}^{z}d\bar{z}/H(\bar{z})$ if we choose the initial conditions appropriately. In this paper we will not attempt to integrate (4) with a non-vanishing plasma density which, in general, will have to be done numerically. As an illustration of our general results, we will be satisfied by deriving the influence of the plasma on the distance-redshift relation for small redshift, i.e., on the Hubble law, in a Robertson-Walker universe.

Taylor expansion of the function $D_{\pm}(z)$ about the parameter value $z(s_{r})=0$ , where the light ray meets the receiver, gives

[TABLE]

Imposing the vertex condition (63) requires

[TABLE]

where $H_{0}=H(z=0)$ denotes the Hubble constant, i.e. the present day value of the Hubble expansion rate. We further use (4) at $z=0$ , which gives

[TABLE]

It is convenient to replace the derivatives of the Hubble expansion rate by the deceleration parameter $q_{0}=(H^{\prime}/H-1)(z=0)$ . We find,

[TABLE]

Here and in the following, we write $\omega_{pr}^{\prime}$ for $\omega_{p}^{\prime}(z=0)$ . According to (68), (106) gives us the area distance

[TABLE]

The leading term is the well known Hubble law modified by the square root in front of the term linear in redshift. As the Hubble law is usually formulated for standard candles and not for standard rulers, we can also write it for the luminosity distance, by applying the modified reciprocity law

[TABLE]

In order to obtain a consistent Taylor expansion of the luminosity distance up to second order in redshift, we Taylor expand

[TABLE]

up to linear order in $z$ , which gives

[TABLE]

Hence, we find

[TABLE]

Comparison of (107) and (111) shows that for the linear Hubble law it makes no difference if we use $D_{A}$ or $D_{L}$ . At this order, the plasma modifies the vacuum Hubble law by the factor $\sqrt{1-\omega_{pr}^{2}/\omega_{r}^{2}}$ . As this factor is smaller than 1, the effect of the plasma is that light sources at a given (area or luminosity) distance appear redder than light sources at the same (area or luminosity) distance on the same spacetime in vacuum. To estimate the factor $\sqrt{1-\omega_{pr}^{2}/\omega_{r}^{2}}$ numerically, we read from Fig. 2 that at present in our Universe $\omega_{pr}<10^{-5}\mathrm{MHz}$ while the ionosphere limits us to $\omega_{r}>10\,\mathrm{MHz}$ , hence $\sqrt{1-\omega_{pr}^{2}/\omega_{r}^{2}}=1-\varepsilon$ where $\varepsilon<10^{-12}$ . This demonstrates that the deviation from vacuum is unmeasurably small as long as we do not have a radio telescope, or better an array of radio telescopes, in space that could operate at frequencies considerably below 10 MHz. We have mentioned already in the introduction that there are some plans for such arrays.

Note that we have not used Einstein’s field equation. Our result is independent of whether or not the plasma is self-gravitating and no assumption was made on how the scale factor depends on time.

5 Conclusions

The purpose of this work was to discuss the geometry of light bundles in a non-magnetised cold plasma on an arbitrary spacetime. This is of relevance for calculating the effects of such a medium on image deformation and on distance measures. In fact most of the baryonic matter in the Universe is in the aggregate state of a plasma until today.

In comparison to the vacuum case, the most important difference is in the fact that the shape and the size of the cross-section of a bundle are no longer independent of the observer, i.e., in a plasma there is no analogue of the Sachs theorem. Based on a modified definition of a Sachs basis, we have derived modified Sachs equations for the shear, the expansion and the twist of a light bundle propagating in a cold plasma. These equations allowed us to derive a modified reciprocity theorem (Etherington law) which relates luminosity distance to area distance. In vacuum these two distance measures are related by a redshift factor squared. A similar law holds in a plasma; however, one of the redshift factors is a frequency ratio whereas the other one is a wavelength ratio. The fact that we have to distinguish between these two types of redshift factors is another important difference to the vacuum case.

In view of applications to astrophysics or cosmology, it seems to us that all plasma effects are negligibly small in the optical frequency range but some of them may be observable with radio signals.

As an illustration of our general results, we have considered a homogeneous plasma on a Robertson-Walker spacetime. Throughout the paper, and also in this example, we have not used Einstein’s field equation so that the results apply to a self-gravitating plasma and equally well to a “test plasma” on a given spacetime background. In the Robertson-Walker case, we have investigated the influence of the plasma on distance measures and on the distance-redshift relation. We believe that these results are of some interest from a conceptual point of view, although the difference to the vacuum case is too small for being observed with existing instruments. However, the real Universe does contain structure and the magnitude of the expected plasma effect does not only depend on the plasma frequency and its time derivatives, but also its spatial variations.

Our general results may be applied to other examples, e.g. to a spherically symmetric plasma around a Schwarzschild black hole. In this case the plasma would have an influence on image deformation, in particular on the size of Einstein rings, and on the magnification of images. We are planning to discuss this example in another paper. Other interesting applications might be weak and strong lensing of radio galaxies by galaxy clusters, whose baryonic matter content is known to be dominated by the cluster plasma. We would expect the achromatic gravitational lensing at radio frequencies to be modified by chromatic corrections from plasma effects.

Acknowledgements

We would like to thank Isabel Oldengott for providing the free electron fraction $x_{e}(z)$ and we thank Marcus Brüggen, Walter Pfeiffer and Joris Verbiest for valuable discussions. We gratefully acknowledge support from the DFG within the Research Training Group 1620 “Models of Gravity”.

References

[1] Lorimer D R and Kramer M 2005 Handbook of Pulsar Astronomy (Cambridge: Cambridge University Press)
[2]

Lyne A G and Graham-Smith F 2005 Pulsar astronomy (Cambridge: Cambridge University Press)

[3]

Muhleman D O and Johnston I D 1966 Phys. Rev. Lett. 17 455

[4]

Muhleman D O, Ekers R D and Fomalont E 1970 Phys. Rev. Lett. 24 1377

[5]

Newkirk G Jr 1967 Ann. Rev. Astron. Astrophys. 5 213

[6]

Gallagher P T, et al. 1999 Astrophys. J. 524 L133

[7]

Mercier C and Chambe G 2015 Astron. Astrophys. 583 A101; 2016 ibid. 585 C1 [Erratum]

[8] Swarup G et al. 1991, Current Science 60 95
[9] van Haarlem M P, Wise M W, Gunst A W, et al. 2013 Astron. Astrophys. 556 A2
[10] Lonsdale C J, Cappallo R J, Morales M F, et al. 2009 Proc. IEEE 97 1497
[11]

Budianu A, Meijerink A and Bentum M J 2015 Acta Astronautica 107 14

[12]

Bentum M J, Bonetti, L and Spallicci, A D A M 2017 Adv. Space Sci. 59 736

[13]

Perlick V 2000 Ray Optics, Fermat’s Principle, and Applications to General Relativity (Lecture Notes in Physics. Monographs vol m61) (Heidelberg: Springer)

[14]

Bisnovatyi-Kogan G S and Tsupko O Y 2009 Gravitation and Cosmology 15 20

[15]

Bisnovatyi-Kogan G S and Tsupko O Y 2010 Mon. Not. Roy. Astron. Soc. 404 1790

[16]

Tsupko O Y and Bisnovatyi-Kogan G S 2013 Phys. Rev. D 87 124009

[17]

Morozova V, Ahmedov B and Tursunov A 2013 Astrophys. Space Sci. 346 513

[18]

Er X and Mao S 2014 Mon. Not. Roy. Astron. Soc. 437 2180

[19]

Rogers A 2015 Mon. Not. Roy. Astron. Soc. 451 17

[20]

Rogers A 2017 Mon. Not. Roy. Astron. Soc. 465 2151

[21]

Rogers A 2017 Universe 3 3

[22]

Perlick V, Tsupko O Y and Bisnovatyi-Kogan G S 2005 Phys. Rev. D 92 104031

[23]

Perlick V and Tsupko O Y 2017 Phys. Rev. D 95 104003

[24]

Breuer R A and Ehlers J 1980 Proc. Roy. Soc. London A 370 389

[25]

Breuer R A and Ehlers J 1981 Proc. Roy. Soc. London A 374 65

[26]

Synge J L 1960 Relativity. The general theory (Amsterdam: North-Holland)

[27]

Madore J 1974 Comm. Math. Phys. 38 103

[28]

Bičák J and Hadrava P 1975 Astron. Astrophys. 44 389

[29]

Anile A M and Pantano P 1977 Phys. Lett. A 61 215

[30]

Anile A M and Pantano P 1979 J. Math. Phys. 20 177

[31]

Schneider P, Ehlers J and Falco E 1992 Gravitational lenses (Heidelberg: Springer)

[32]

Jordan P, Ehlers J and Sachs R K 1961 Akad. Wiss. Lit. Mainz, Abh. Math. Nat. Kl. 1

[33]

Sachs R K 1961 Proc. Roy. Soc. London A 264 309

[34]

Perlick V 2004 Living Rev. Relativity 7(9) http://www.livingreviews.org/lrr-2004-9

[35]

Wald R M 1984 General relativity (Chicago: Chicago University Press)

[36]

Appenzeller I 2013 Introduction to astronomical spectroscopy (Cambridge: Cambridge University Press)

[37]

Elsässer K and Popel S 1997 Phys. Plasmas 4 2348

[38]

Etherington I M H 1933 The Philos. Mag. and J. of Science (Ser. $7$ ) 15 761

[39]

Planck collaboration, Ade A A et al. 2016 Astron. Astrophys. 594 A14

[40]

Noonan T W 1983 Astrophys. J. 265 451

Bibliography40

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] Lorimer D R and Kramer M 2005 Handbook of Pulsar Astronomy (Cambridge: Cambridge University Press)
2[2] Lyne A G and Graham-Smith F 2005 Pulsar astronomy (Cambridge: Cambridge University Press)
3[3] Muhleman D O and Johnston I D 1966 Phys. Rev. Lett. 17 455
4[4] Muhleman D O, Ekers R D and Fomalont E 1970 Phys. Rev. Lett. 24 1377
5[5] Newkirk G Jr 1967 Ann. Rev. Astron. Astrophys. 5 213
6[6] Gallagher P T, et al. 1999 Astrophys. J. 524 L 133
7[7] Mercier C and Chambe G 2015 Astron. Astrophys. 583 A 101; 2016 ibid. 585 C 1 [Erratum]
8[8] Swarup G et al. 1991, Current Science 60 95

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Sachs equations for light bundles in a cold plasma

Abstract

1 Introduction

2 Light propagation in vacuum

2.1 The Hamiltonian for light rays in vacuum

2.2 Light bundles in vacuum

Definition 2.1** (Light bundle)**

Definition 2.2** (Sachs basis)**

2.3 Sachs equations for light bundles in vacuum

3 Light propagation in a plasma

3.1 The Hamiltonian for light rays in a plasma

3.2 Light bundles in a plasma

Definition 3.1** (Light bundle)**

Definition 3.2** (Sachs basis)**

3.3 Sachs equations for light bundles in a plasma

3.4 Reciprocity theorem and related results

Theorem 3.1

Theorem 3.2

Theorem 3.3** (Reciprocity Theorem)**

4 Example: Spatially homogeneous plasma in a Robertson-Walker spacetime

5 Conclusions

Acknowledgements

References

Definition 2.1 (Light bundle)

Definition 2.2 (Sachs basis)

Definition 3.1 (Light bundle)

Definition 3.2 (Sachs basis)

Theorem 3.3 (Reciprocity Theorem)