Covariant kinematics of light in media and a generalized Raychaudhuri   equation

Robert T. Thompson; Mohsen Fathi

arXiv:1705.11108·gr-qc·November 15, 2017

Covariant kinematics of light in media and a generalized Raychaudhuri equation

Robert T. Thompson, Mohsen Fathi

PDF

TL;DR

This paper develops a covariant framework for analyzing light propagation in various media within curved spacetime, generalizing the Raychaudhuri equation to enhance understanding of optical systems and their relation to general relativity.

Contribution

It introduces a covariant kinematic approach for light in media and extends the Raychaudhuri equation to media with varying refractive properties and spacetime characteristics.

Findings

01

Derived covariant kinematics of light in media within curved spacetime

02

Generalized Raychaudhuri equation for diverse media and spacetime types

03

Potential applications in transformation optics and gravitational studies

Abstract

There is ongoing interest in adopting various tools and ideas from general relativity for optical applications and the study of light propagation through natural or engineered media. Here, the covariant kinematics of light propagating through arbitrary dielectric media in curved space-times are derived, allowing for analysis and tracing of congruences of light through media that may smoothly vary in character between vacuum, positively refracting, and negatively refracting; or null, timelike, and spacelike with respect to the background metric. The kinematics are then used to generalize the Raychaudhuri equation -- an important tool in general relativity that describes the focus of a congruence. These results will be useful for the analysis of optical devices, particularly those designed using transformation optics, and serve as theoretical tools to study generalized concepts in general…

Equations282

\dot{Θ} = - \frac{1}{2} Θ^{2} - \hat{B} \indices_{β}^{α} \hat{B} \indices_{α}^{β} - R \indices_{λ ρ τ}^{ρ} u^{λ} u^{τ} .

\dot{Θ} = - \frac{1}{2} Θ^{2} - \hat{B} \indices_{β}^{α} \hat{B} \indices_{α}^{β} - R \indices_{λ ρ τ}^{ρ} u^{λ} u^{τ} .

⋆ \indices_{α β}^{μν} = \frac{1}{2} ∣ g ∣ ϵ_{α β σ ρ} g^{σ μ} g^{ρ ν},

⋆ \indices_{α β}^{μν} = \frac{1}{2} ∣ g ∣ ϵ_{α β σ ρ} g^{σ μ} g^{ρ ν},

d F

d F

d ⋆ F

d ⋆ P = J_{m u l t i p o l e},

d ⋆ P = J_{m u l t i p o l e},

d F

d F

d ⋆ (F + P)

G = ⋆ χ F

G = ⋆ χ F

G_{μν} = ⋆ \indices_{μν}^{α β} χ \indices_{α β}^{σ ρ} F_{σ ρ} .

G_{μν} = ⋆ \indices_{μν}^{α β} χ \indices_{α β}^{σ ρ} F_{σ ρ} .

F = d A,

F = d A,

d F = d^{2} A = 0,

d F = d^{2} A = 0,

d ⋆ d A = J

d ⋆ d A = J

d ⋆ χ d A = 0.

d ⋆ χ d A = 0.

δ = (- 1)^{k (m + 1) - 1} ⋆ d ⋆

δ = (- 1)^{k (m + 1) - 1} ⋆ d ⋆

δ χ d A = 0.

δ χ d A = 0.

(δ χ d A)_{α} = g_{α β} \frac{1}{∣ g ∣} \partial_{ν} (∣ g ∣ g^{ν σ} g^{β ρ} χ \indices_{σ ρ}^{λκ} \partial_{[λ} A_{κ]}) = 0.

(δ χ d A)_{α} = g_{α β} \frac{1}{∣ g ∣} \partial_{ν} (∣ g ∣ g^{ν σ} g^{β ρ} χ \indices_{σ ρ}^{λκ} \partial_{[λ} A_{κ]}) = 0.

A_{μ} = \hat{A}_{μ} (x^{ρ}) e^{i S (x^{ρ})}

A_{μ} = \hat{A}_{μ} (x^{ρ}) e^{i S (x^{ρ})}

\frac{\partial _{ν} A ^ _{μ}}{A ^ _{μ}} ≪ \partial_{ν} S .

\frac{\partial _{ν} A ^ _{μ}}{A ^ _{μ}} ≪ \partial_{ν} S .

\frac{\partial _{ν} χ \indices _{μα}^{σ ρ}}{χ \indices _{μα}^{σ ρ}} ≪ \partial_{ν} S \mbox an d \frac{\partial _{ν} g ^{σ ρ}}{g ^{σ ρ}} ≪ \partial_{ν} S,

\frac{\partial _{ν} χ \indices _{μα}^{σ ρ}}{χ \indices _{μα}^{σ ρ}} ≪ \partial_{ν} S \mbox an d \frac{\partial _{ν} g ^{σ ρ}}{g ^{σ ρ}} ≪ \partial_{ν} S,

g^{ν σ} χ \indices_{σ α}^{λκ} (\partial_{ν} S) (\partial_{λ} S) \hat{A}_{κ} = g^{ν σ} χ \indices_{σ α}^{λκ} p_{ν} p_{λ} \hat{A}_{κ} = 0,

g^{ν σ} χ \indices_{σ α}^{λκ} (\partial_{ν} S) (\partial_{λ} S) \hat{A}_{κ} = g^{ν σ} χ \indices_{σ α}^{λκ} p_{ν} p_{λ} \hat{A}_{κ} = 0,

X \indices_{α}^{κ} \hat{A}_{κ} = 0

X \indices_{α}^{κ} \hat{A}_{κ} = 0

X \indices_{α}^{κ} = g^{ν σ} χ \indices_{σ α}^{λκ} p_{ν} p_{λ}

X \indices_{α}^{κ} = g^{ν σ} χ \indices_{σ α}^{λκ} p_{ν} p_{λ}

det (X) = 0.

det (X) = 0.

X adj (X) = det (X) I .

X adj (X) = det (X) I .

adj (X) = 0

adj (X) = 0

adj (X) = P (p \otimes p^{♯})

adj (X) = P (p \otimes p^{♯})

P = 0.

P = 0.

P = H_{+} H_{-} = (a^{α β} p_{α} p_{β} + F^{μν σ ρ} p_{μ} p_{ν} p_{σ} p_{ρ}) (a^{α β} p_{α} p_{β} - F^{μν σ ρ} p_{μ} p_{ν} p_{σ} p_{ρ}) = 0.

P = H_{+} H_{-} = (a^{α β} p_{α} p_{β} + F^{μν σ ρ} p_{μ} p_{ν} p_{σ} p_{ρ}) (a^{α β} p_{α} p_{β} - F^{μν σ ρ} p_{μ} p_{ν} p_{σ} p_{ρ}) = 0.

γ_{\pm}^{α β} = \frac{\partial ^{2} H _{\pm}}{\partial p _{α} \partial p _{β}} .

γ_{\pm}^{α β} = \frac{\partial ^{2} H _{\pm}}{\partial p _{α} \partial p _{β}} .

H_{+} = \frac{1}{2} γ_{+}^{α β} (x^{μ}, p_{μ}) p_{α} p_{β} = 0, \mbox or H_{-} = \frac{1}{2} γ_{-}^{α β} (x^{μ}, p_{μ}) p_{α} p_{β} = 0,

H_{+} = \frac{1}{2} γ_{+}^{α β} (x^{μ}, p_{μ}) p_{α} p_{β} = 0, \mbox or H_{-} = \frac{1}{2} γ_{-}^{α β} (x^{μ}, p_{μ}) p_{α} p_{β} = 0,

\frac{1}{2} g^{α β} p_{α} p_{β} = 0

\frac{1}{2} g^{α β} p_{α} p_{β} = 0

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

Covariant kinematics of light in media and a generalized Raychaudhuri equation

Robert T. Thompson

[email protected]

Institute of Applied Physics, Karlsruhe Institute of Technology (KIT), 76128 Karlsruhe, Germany

Mohsen Fathi

[email protected]

Department of Physics, Payame Noor University (PNU), P.O. Box 19395-3697, Tehran, Iran

Abstract

There is ongoing interest in adopting various tools and ideas from general relativity for optical applications and the study of light propagation through natural or engineered media. Here, the covariant kinematics of light propagating through arbitrary dielectric media in curved space-times are derived, allowing for analysis and tracing of congruences of light through media that may smoothly vary in character between vacuum, positively refracting, and negatively refracting; or null, timelike, and spacelike with respect to the background metric. The kinematics are then used to generalize the Raychaudhuri equation – an important tool in general relativity that describes the focus of a congruence. These results will be useful for the analysis of optical devices, particularly those designed using transformation optics, and serve as theoretical tools to study generalized concepts in general relativity.

I Introduction

The last decade has seen a resurgence of interest in the covariant, tensorial description of electrodynamics in dielectric media. This has been largely spurred by developments in transformation optics Leonhardt and Philbin (2006), which uses longstanding ideas about the similarity of the refractive properties of dielectric media to those of curved space-times to design optical media with unusual features or capabilities, such as invisibility cloaks Pendry et al. (2006); Schurig et al. (2006). Other interesting applications of transformation optics, such as space-time event cloaking McCall et al. (2011), highlight the potential of the general relativity-inspired formulation of transformation optics, and there is wide interest in what other ideas and tools from general relativity may be imported for optical applications.

Transformation optics as it stands is limited in its capacity to model and control real-world phenomena, such as the dispersion, dissipation, and nonlinearities that are manifest in the metamaterials used to actually realize transformation devices. Progress incorporating these features in transformation optics has been made by formulating and manipulating tensorial versions of their descriptions Gratus et al. (2016); Paul and Rahm (2012); Bergamin et al. (2011), which further encourages the development of covariant electrodynamics in media. It is in the context of electrodynamics within dielectric media residing in a curved background space-time that we study one of the fundamental tools of general relativity: the kinematics of a congruence of light.

While the fantastic technological possibilities proffered by metamaterials and transformation optics are a significant motivator, there is also a great deal of overlap with other topics of theoretical interest. Analog models of curved space-times seek to represent aspects of extreme gravitational systems – e.g. light propagation near black holes – with some kind of laboratory-accessible system Unruh (1981, 1995); Reznik (2000); Schützhold et al. (2002); Novello et al. (2002); Barceló et al. (2011); Thompson and Frauendiener (2010). Mathematically, they are very naturally related to ideas in transformation optics and metamaterials Greenleaf et al. (2007); Smolyaninov (2013); Chen et al. (2010); Lu et al. (2010); Boston (2015); Narimanov and Kildishev (2009), which has resulted in a great deal of cross-fertilization of ideas. The tensorial, differential forms formulation of electrodynamics used here is also very closely related to developments in the premetric electrodynamics program, which seeks a deeper understanding of the structure and pliability of electrodynamics by considering the metric as a subsidiary field that does not directly enter into Maxwell’s equations Hehl et al. (2002); Hehl and Obukhov (2003). Lastly, a major focus of research in the area of quantum gravity and physics beyond the standard model is based on Lorentz violating space-times, and it has been shown that Lorentz violation in effective field theories is connected to Finsler geometries Kostelecký and Samuel (1989); Girelli et al. (2007); Kostelecký (2011). The idea that space-time is Lorentzian was inspired by Maxwell’s equations in vacuum Einstein (1905), but one of the interesting features of refractive media is that, as discussed below, the light cone is determined by what can be considered an effective Finslerian structure – the optical metric – that provides a natural and accessible context for studies in Finsler-type space-times Perlick (2000); Lämmerzahl and Hehl (2004); Skákala and Visser (2009).

What ties these topics together in this work is the formal study of electrodynamics on manifolds that possess the additional structure of a dielectric medium. Since the vacuum can itself be considered a trivial dielectric medium, allowing for a nontrivial dielectric structure is a natural generalization. The similarity between light in curved space-times and light in refractive media has been recognized since the early days of general relativity Eddington (1920); Gordon (1923), and an identification between the two by Plebanski Plebanski (1960) allowed the specification of a dielectric analog model of curved vacuum space-times Felice (1971) and was used as a foundation for transformation optics Leonhardt and Philbin (2006). The point of departure made here compared with earlier related formulations or identifications of electrodynamics in media is an explicit recognition that any real medium resides in a space-time of which the curvature should be specifically accounted for, and a desire to disentangle the refractive action of the medium from that of the background space-time in which it resides. For example, the well-known book by Post Post (1962) considers a tensorial formulation of electrodynamics in media but does not include separate information about the background metric – an implicit assumption of flat space-time – while the premetric electrodynamics program uses a differential forms formulation of electrodynamics but considers a general structural field with no specified metric. This could be interpreted as combining space-time and medium contributions, but for a rigorous formulation of transformation optics or study of light propagation in media it is desirable to distinguish the medium from the background space-time Thompson et al. (2011a, b); Thompson and Fathi (2015); Fathi and Thompson (2016).

II Goal of this paper

Consider a laser beam. Physically, one might think of the laser beam as defining a column of light occupying some physical region of space. The first question one might ask is what path the beam follows. The next question one might ask is how the shape of the beam changes along its path. To be more specific, one might want to measure the shape of the transverse cross section of the beam at one instant of time, nominally representing the shape of a phase front, and then compare this with the shape of a transverse cross section of the beam at a different position or later time. Has the beam expanded, contracted, rotated, sheared from a circle to an ellipse? We may think of the laser beam as made up of a bundle of curves, called a congruence, and we wish to characterize the transverse behavior, the kinematics, of this congruence. Thus we wish to go beyond the idea of ray trajectories to study these other physically meaningful and measurable characteristics of a beam.

The kinematics of congruences is a fundamental tool in general relativity. Not only is it important for accurate ray tracing, as displayed by the spectacular rendering of a black hole in the film Interstellar James et al. (2015), but it is also fundamental in determining whether, and what type of, a horizon exists Hawking and Ellis (1975). The associated Raychaudhuri equation gives information on the evolution of the cross sectional area of the congruence – essentially how the focus changes along the length of the beam – and is the basis of the focusing and singularity theorems Hawking and Ellis (1975).

There are differences between null, timelike, and spacelike congruences (e.g. in the dimensionality of their transverse subspaces), and while the strategy for studying their kinematics is the same in each case, they must be done separately. On the other hand, light propagating inside dielectric media typically moves slower (or sometimes faster, which is not strictly forbidden by causality for heavily dispersive media) than the vacuum speed of light. Given that metamaterials allow the possibility (at least in principle) for light rays to smoothly transition between null, timelike, and even spacelike propagation with respect to the background metric, it is natural to ask whether one must shift between the null, timelike, and spacelike pictures of kinematics as the congruence is traced, or whether there exists some unified approach that allows us to continuously follow a congruence of light through media.

The typical textbook derivation (see, e.g., Ref. Poisson (2004)) of the kinematical decomposition of a null congruence assumes that it is both geodesic and null with respect to the space-time metric. Since neither of these conditions hold within refracting media the textbook derivation is not valid for our purpose. To address this more general question of non-null, non-geodesic light propagation in media we develop the theory ab initio from Maxwell’s equations with two main goals in mind.

First, we seek to derive the kinematics of a congruence of light passing through linear dielectric media residing in a potentially curved background space-time. This will allow a more complete analysis and congruence tracing through media, which should be useful for fields like transformation optics.

Second, we find a generalized form of the Raychaudhuri equation that is valid for congruences in linear dielectric media residing in a potentially curved background space-time. For affinely parametrized null geodesics in curved vacuum space-times, the Raychaudhuri equation is

[TABLE]

This equation provides information about the change in the cross sectional area $\Theta$ of a beam as one follows the beam; i.e., it describes the focus of the beam along its length. Aside from interest in focusing from an optics point of view, a generalized Raychaudhuri equation is important to the theory of light in media because it contains the Riemann tensor and is therefore a measurable probe of curvature. We expect that, given an arbitrary refractive medium, additional medium-dependent terms should appear in this generalized Raychaudhuri equation. Given the longstanding idea that linear dielectric media is equivalent to a vacuum space-time, one might conjecture that these medium contributions will appear in the form of a tensor akin to the Riemann tensor, comprised of derivatives of the optical metric, which does not vanish even in flat space-time. Although we indeed find medium-dependent contributions to the Raychaudhuri equation, we do not find that they appear in the form of a Riemann-like tensor.

It was mentioned above that one of the interesting features of light in media is that the light cone is determined by an effective Riemann-Finsler “optical metric.” Recently there has been interest in the Raychaudhuri equation in Finsler space-times Stavrinos (2005, 2012); Minguzzi (2015), but our result turns out to be quite different. The reason is that, while the lightcone is determined by the effective pseudo-Finslerian optical metric, this is not the only structure present in the system, and a real-world observer actually makes measurements with respect to the background metric. Thus, while the optical metric is interesting in that it provides a structure that can be interpreted as Finslerian, the medium does not fundamentally behave like a vacuum pseudo-Finsler manifold since the underlying space-time structure is still Riemannian.

The paper is organized as follows. In Sec. III we review the covariant formalism of light in media that we work with and take the ray-optics limit. In Sec. IV we derive the first order kinematics of a light congruence in media. In Sec. V we derive a generalized Raychaudhuri equation for light in media within curved space-times, which reduces to the usual Raychaudhuri equation in vacuum. The main finding in this section is that the medium does not contribute a term that might be interpreted as an effective curvature tensor – even for impedance matched media where the optical metric becomes purely pseudo-Riemannian. We conclude with Sec. VI.

III Geometrical optics limit of covariant electrodynamics in media

Using the geometric optics limit of classical electrodynamics, we seek to study the kinematical behavior of a congruence of light rays traversing an arbitrary dielectric medium residing in a possibly curved background space-time. Furthermore, we are interested in working in a totally covariant, four-dimensional tensorial framework based on differential forms. In this section we review such a framework for electrodynamics in media and its geometric optics limit. A more complete introduction to tensorial electrodynamics may be found in, e.g., Refs. Post (1962); Misner et al. (1973); Baez and Muniain (1994); Thompson and Cummer (2012). To be clear, although in the Introduction we mentioned transformation optics and dielectric analog space-times as motivations for studying covariant electrodynamics in media on manifolds, we make no assumptions on the form of the medium under consideration; we simply want to study the propagation of light through an arbitrary slab of stuff residing in a curved background space-time.

III.1 Covariant electrodynamics in media

Space-time curvature affects the propagation of light. The deflection of starlight passing near the Sun provided the first observational confirmations of general relativity, and such gravitational lensing is commonly exploited in modern astronomical and cosmological studies. In Maxwell’s equations, information about the space-time is entirely contained in an operator $\star$ , called the Hodge dual. All pseudo-Riemannian manifolds endowed with a metric, e.g., space-time, have a naturally associated Hodge dual constructed from the metric, and thus the Hodge dual contains all the space-time information of the metric tensor. For $k<m$ , the Hodge dual takes a $k$ -form to an $(m-k)$ -form; in the case of a four-dimensional space-time, it maps a 2-form into another 2-form and has the index expression

[TABLE]

where $g^{\mu\nu}$ is the contravariant form of the metric and $\epsilon_{\alpha\beta\sigma\rho}$ is the completely antisymmetric Levi-Civitá symbol. The electromagnetic fields $(\vec{E},\vec{B})$ are components of an antisymmetric second rank covariant tensor (2-form), $\mathbf{F}=F_{\mu\nu}$ , and Maxwell’s homogeneous and inhomogeneous equations are Misner et al. (1973); Baez and Muniain (1994)

[TABLE]

where $\mathrm{d}$ denotes the exterior derivative and $\mathbf{J}$ is the charge-current 3-form. In the presence of polarizable media, an incident field $\mathbf{F}$ can induce multipole moments that contribute to $\mathbf{J}$ even though the total monopole or free charge contribution to $\mathbf{J}$ may be zero. The solutions to Maxwell’s equations then include the particular solution

[TABLE]

in which case the general solution satisfies

[TABLE]

We typically work with macroscopically neutral media where $\mathbf{J}_{free}=0$ , but this is not necessary. In media where the polarization response is linear in the applied field, the particular solution $\mathbf{P}$ is linear in the homogeneous solution $\mathbf{F}$ , and we may write Thompson et al. (2011a)

[TABLE]

where $\bm{\chi}$ contains all the information about the medium such as permeability and permittivity in a metric-independent way.

As an alternative and useful interpretation, one could assume the fields $\mathbf{F}$ and $\mathbf{G}$ are independent and contain the components of $(\vec{E},\vec{B})$ and $(\vec{D},\vec{H})$ , respectively. Then solutions to Maxwell’s equations $\mathrm{d}\mathbf{F}=0$ and $\mathrm{d}\mathbf{G}=\mathbf{J}_{free}$ require a constitutive relation that is $\mathbf{G}=\star\mathbf{F}$ in vacuum but which can be uniquely extended to Eq. (8) in the presence of linear media. In component form, the constitutive relation reads

[TABLE]

The dielectric tensor $\chi\indices{{}_{\mu\nu}^{\alpha\beta}}$ is independently antisymmetric under index exchange on either $\mu\nu$ or $\alpha\beta$ , allowing $\bm{\chi}$ the maximum of 36 independent parameters that can fully relate the six independent components of $\mathbf{G}$ and $\mathbf{F}$ . It can be thought of as a 2-form-valued bivector that operates on 2-forms. In this constitutive relation, the vacuum is uniquely defined as the trivial dielectric such that $\bm{\chi}_{vac}\mathbf{F}=\mathbf{F}$ . The usual permeability, permittivity, and magnetoelectric coupling matrices can be extracted from $\bm{\chi}$ and interpreted in their usual sense in a locally flat frame, but these traditional matrix quantities intrinsically mix components of $\bm{\chi}$ with components of the metric even in flat space-time, where, for example, factors of $r$ appear in spherical or cylindrical coordinates.

III.2 Geometric optics limit

The wave propagation of light is described by a second order equation, but Maxwell’s equations provide two first order equations. The most straightforward approach to geometric optics is to consider the 1-form potential $\mathbf{A}=A_{\mu}$ , which in a locally flat Minkowski space-time is equivalent to the 4-vector potential $A^{\mu}=(\phi,\vec{A})$ . The field strength tensor $\mathbf{F}$ is related to $\mathbf{A}$ by

[TABLE]

and the nilpotency of the exterior derivative automatically guarantees the satisfaction of Maxwell’s homogeneous equation

[TABLE]

while in vacuum a Green’s function approach to the solution of the inhomogeneous equation

[TABLE]

leads to Jefimenko’s equations Jackson (1998).

Since we are interested in the geometric optics of light propagation through macroscopically neutral media, we consider

[TABLE]

Operating on both sides of this equation with the Hodge dual, it may be rewritten in terms of the codifferential of a $k$ -form

[TABLE]

as

[TABLE]

While the exterior derivative increases the degree of a differential form by 1, e.g., $\mathrm{d}$ of a 1-form results in a 2-form, the codifferential decreases the degree by 1; so while $\bm{\chi}\mathrm{d}\mathbf{A}$ is a 2-form, $\delta\bm{\chi}\mathrm{d}\mathbf{A}$ is a 1-form. The codifferential of a $k$ -form can be given an indexed expression in terms of ordinary derivatives and the metric, allowing Eq. (15) to be recast as

[TABLE]

To reach the geometric optics limit we assume a solution of the form

[TABLE]

and require that the amplitude $\hat{A}_{\mu}(x^{\rho})$ is slowly varying with respect to the phase $S(x^{\rho})$ , subject to the constraint Post (1962)

[TABLE]

By furthermore requiring the space-time and the medium to also be slowly varying compared with $S$ ,

[TABLE]

one finds that the principal part of Eq. (16), called the eikonal equation, is

[TABLE]

where we write $p_{\nu}=\partial_{\nu}S$ (or the index-free version $\bm{p}=\mathrm{d}S$ ). In other terminology, this is equivalent to a WKB approximation, where one assumes a solution of the form $\mathbf{A}=\hat{\mathbf{A}}e^{i\frac{S}{\lambda}}$ and retains the leading order terms of Eq. (16) in the limit $\lambda\to 0$ Born and Wolf (1999).

Writing Eq. (20) as

[TABLE]

and thinking of

[TABLE]

as a $4\times 4$ matrix, it may be seen that the existence of a nontrivial solution to Eq. (20) requires

[TABLE]

In fact, this condition is satisfied identically. By the antisymmetry of the second set of indices on $\bm{\chi}$ , $\hat{A}_{\mu}\propto p_{\mu}$ is already a trivial solution, so any nontrivial solution resides in the three-dimensional subspace orthogonal to $p_{\mu}$ , meaning the matrix is effectively only three dimensional 111The orthogonality of $\mathbf{A}$ and $\bm{p}$ is also a consequence of the Lorenz condition, the adoption of which is required in conjunction with the assumed plane wave solution Eq. (17). There are some different methods for dealing with this (see, for example, Ref. Hehl et al. (2002)); we offer a purely algebraic argument based on the classical adjugate matrix $\mathrm{adj}(\bm{X})$ , defined such that

[TABLE]

If $\bm{X}$ is invertible, then $\mathrm{adj}(\bm{X})\propto\bm{X}^{-1}$ , but $\mathrm{adj}(\bm{X})$ is defined even if $\bm{X}^{-1}$ does not exist. Since $\det(\bm{X})=0$ identically, then it must be true that $\bm{X}\mathrm{adj}(\bm{X})=0$ . Since $\bm{X}$ is nonzero and arbitrary, the subsidiary condition

[TABLE]

must be satisfied. Although this is a matrix condition, one may show that it is actually of the form

[TABLE]

where $P$ is a polynomial of fourth order in $p_{\mu}$ and $(\bm{p}\otimes\bm{p}^{\sharp})\indices{{}_{\mu}^{\nu}}=p_{\mu}g^{\nu\alpha}p_{\alpha}$ is a nonzero matrix consisting of simple multiples of components of $p_{\mu}$ . Therefore, the condition for nontrivial solutions to Eq. (20) reduces to the scalar condition

[TABLE]

Since we are interested in characterizing the medium with an (possibly psuedo-Finslerian) optical metric, we restrict ourselves to media for which the polynomial $P$ can be factored to

[TABLE]

Such a factorization may require the imposition of some symmetry conditions on $\bm{\chi}$ that reduce the number of free parameters from 36, but determining the space of allowable $\bm{\chi}$ is beyond the scope of the present work. For our purpose it is sufficient to assume that $P$ is factorizable so.

Thus, it can be seen that there are two possible branches, $H_{+}=0$ or $H_{-}=0$ , corresponding to two different propagation states of light; i.e. the medium displays birefringence for which there exist two independent propagating eigenstates that follow distinct ray trajectories. The solutions depend not only on the location within the medium, but also on the direction of light propagation. Since $H_{+}$ and $H_{-}$ are quadratic in $p_{\mu}$ , each branch furthermore supports two future-directed solutions traveling in opposite directions, e.g. left/right or ingoing/outgoing rays.

For each propagation state, the corresponding function $H_{+}(x^{\mu},p_{\mu})$ or $H_{-}(x^{\mu},p_{\mu})$ defines a pseudo-Finslerian structure on the cotangent bundle. One may then define two associated pseudo-Finslerian optical metrics,

[TABLE]

This may be summarized by saying that in the geometric optics limit, there exist two branches of on-shell solutions of Maxwell’s equations at each point that must everywhere satisfy the corresponding conditions

[TABLE]

where we emphasize that $\gamma_{\pm}^{\mu\nu}$ depends on both the position and the wave vector. The fact that these equations have the same form as the condition for null curves on a vacuum manifold,

[TABLE]

provides the source of the terminology “optical metric,” but it is important to realize that while the optical metric defines the light cone, it is an emergent quantity that serves as an additional structure on the manifold and is not a replacement of the background space-time metric. Physical observables are still measured with respect to the background space-time metric, and the connection (covariant derivative) is still usually adopted as that which is compatible with the background metric.

From Eq. (22), it is clear that the $\gamma^{\alpha\beta}_{\pm}$ contain contributions from both the background space-time metric and the medium. The $\gamma^{\alpha\beta}_{\pm}$ fall into one of three categories depending on the properties of the medium:

If the medium parameters are all set to their vacuum values (vanishing media), then $F^{\mu\nu\sigma\rho}=0$ and $a^{\alpha\beta}=\frac{1}{2}g^{\alpha\beta}$ , and the optical metrics degenerate to the space-time metric $\gamma_{+}^{\alpha\beta}=\gamma_{-}^{\alpha\beta}=g^{\alpha\beta}$ . This shows a) that light in vacuum follows the null curves of the space-time and b) that the space-time metric also serves as the optical metric of the vacuum, which itself can be considered a trivial dielectric. The fact that the space-time metric serves as the optical metric in vacuum is not surprising since it is the only available structure in the absence of ponderable media. Ponderable dielectric media provide additional structure that contributes to the optical metric. 2. 2.

For impedance matched media, $F^{\mu\nu\sigma\rho}=0$ . In this case, the two optical metrics are everywhere degenerate (no birefringence) but distinct from the background metric, $\gamma_{+}^{\alpha\beta}=\gamma_{-}^{\alpha\beta}\neq g^{\alpha\beta}$ . Since the square root term containing $F^{\mu\nu\sigma\rho}$ vanishes, the degenerate optical metric now has the properties of a pseudo-Riemannian metric instead of a pseudo-Finslerian metric, but it should still be understood as comprised of some combination of the medium parameters and the background space-time in which the medium resides. 3. 3.

In all other media, the two optical metrics are typically distinct, $\gamma_{+}^{\alpha\beta}\neq\gamma_{-}^{\alpha\beta}\neq g^{\alpha\beta}$ , but it is possible that they degenerate for specially chosen incident waves with $p_{\mu}$ such that $F^{\mu\nu\sigma\rho}p_{\mu}p_{\nu}p_{\sigma}p_{\rho}=0$ . Such $p_{\mu}$ correspond to light propagating along an optical axis of the medium.

IV Kinematics of light in media

The fact that the optical metric is not the same as the space-time metric reflects the fact that light in media is generically nongeodesic and non-null with respect to the background space-time, and the light cone within the medium does not coincide with the light cone of the space-time. We reiterate that although ray propagation is governed by the optical metric, the structure imposed by the background space-time is still present, and an observer still makes physical measurements of length and angle with respect to $g^{\alpha\beta}$ rather than $\gamma^{\alpha\beta}$ .

In this section, we study the kinematical decomposition of a congruence of light in arbitrary dielectric media. The formalism developed here enables ray tracing through smoothly varying media and allows us to follow the congruence regardless of whether it is null, timelike, or spacelike with respect to the background metric.

We begin with Hamilton’s canonical equations for the congruence, which gives us a general prescription for ray tracing through any media residing in any space-time. Note that since these equations are derived directly from the geometric optics limit of Maxwell’s equations rather than from a Lagrangian, there is no assumption of energy conservation and this formalism may be used for ray tracing through time-dependent media.

Once the congruence has been established, we define a subspace that is transverse to the optical light cone (i.e. the light cone of the medium rather than the background light cone). Physically, for example, this corresponds to looking at the cross sectional slice of a laser beam – a surface of constant phase. We then look at the evolution of this surface as it propagates along the beam to see how it expands or contracts, rotates, or shears.

From here on, we consider only a single propagation eigenstate by choosing either of $H_{+}$ or $H_{-}$ and dropping the index so that we have a generic $H$ . We have seen that in the geometrical optics limit, on-shell solutions of Maxwell’s equations must everywhere satisfy the condition $H=0$ . This means that if light follows some curve $\gamma$ parametrized by $\tau$ , then

[TABLE]

everywhere along the curve. Since $H=H(x^{\mu},p_{\mu})$ is a function on the cotangent bundle,

[TABLE]

which implies Hamilton’s canonical equations

[TABLE]

Let

[TABLE]

denote the tangent to $\gamma$ . Since $\gamma$ is not necessarily geodesic, we have nontrivial acceleration $a^{\alpha}$ ,

[TABLE]

In a vacuum space-time it is true that $g_{\mu\nu}s^{\mu}=s_{\nu}=p_{\nu}$ , but this is no longer true in media. Instead, in the presence of the optical metric $\gamma^{\alpha\beta}$ Hamilton’s equations result in $s^{\mu}=\gamma^{\mu\nu}p_{\nu}$ , and subsequently it is no longer true that $g_{\alpha\beta}s^{\alpha}s^{\beta}=0$ . An observer making measurements relative to the space-time metric will find $s^{\mu}\neq g^{\mu\nu}p_{\mu}$ , and will be lead to the conclusion that the Poynting vector does not always coincide with the direction of wave propagation inside anisotropic media, but from Hamilton’s equations and the form of $H$ it is always true that $s^{\alpha}p_{\alpha}=0$ .

It is worth noting that there is an arbitrariness in the definition of $H$ since if $H(x^{\mu},p_{\mu})=0$ then $C(x^{\mu})H(x^{\mu},p_{\mu})=0$ with monotonic $C(x^{\mu})\neq 0$ will also be satisfied. This is equivalent to a rescaling of the parameter along $\gamma$ . The curve $\gamma$ therefore represents an equivalence class of curves through the same space-time points but visited at different parameter values.

The preceding statements would benefit from some clarification. A solution to Maxwell’s equations in the geometric optics limit requires information about both the position $x^{\mu}$ and the wave (co)vector $p_{\mu}$ . The parametrized curve $\gamma$ is a curve $\gamma:\mathbb{R}\to T^{*}M$ in the cotangent bundle given by

[TABLE]

Thus, the space of solutions to Maxwell’s equations is the eight-dimensional cotangent bundle (i.e., the phase space), and Hamilton’s Eqs. (34) provide eight equations for the eight components of the solution curve. On the other hand, the physical ray trajectory, $\tilde{\gamma}$ , is a curve on the space-time manifold $M$ found by projecting from the cotangent bundle with

[TABLE]

Since $\pi$ simply selects the $x^{\mu}$ components of a point $q\in T^{*}M$ , the projection of $\gamma(\tau)$ into $\tilde{\gamma}(\tau)$ is a relatively trivial matter, but confusion can arise if the distinction is not pointed out. For example, the tangent to a curve in $T^{*}M$ has components over $\{\partial/\partial p_{\mu}\}$ and so we see derivatives with respect to $p_{\mu}$ in Eq. (33), whereas the derivative of a function $F(x^{\mu})$ on $M$ along $\tilde{\gamma}$ would only be of the form $s^{\mu}F_{,\mu}$ . Similarly, denoting $s^{\mu}$ as the “tangent to $\gamma$ ” as mentioned just before Eq. (36) is misleading, as it is actually the projection of the tangent to $\gamma$ , or equivalently the tangent of the projected curve $\tilde{\gamma}$ . The same is true for the acceleration of Eq. (36), which is actually that of $\tilde{\gamma}$ .

IV.1 Jacobi fields

With a congruence of rays representing solutions to Eqs. (34) in hand, we look for a characterization of the subspace transverse to the congruence. To reiterate, we are now looking at a congruence of ray trajectories $\bar{\gamma}(\tau)\subset M$ that are the projection onto $M$ of a congruence of solution curves $\gamma(\tau)\subset T^{*}M$ , and we choose one particular curve $\tilde{\gamma}$ to act as a reference curve. We take the chosen reference curve to be the particular solution

[TABLE]

to Eqs. (34), and thus $\bm{u}$ is tangent to the chosen curve $\tilde{\gamma}(\tau)$ .

The first step is to find a set of vector fields that can be tracked along the congruence and which will provide a measure for changes along the congruence. It is always possible to find a set of vector fields $\{\bm{\xi}\}$ that are Lie invariant along the curve such that

[TABLE]

Such vector fields are called Jacobi fields. It may be shown that at a point $p$ on the reference curve $\tilde{\gamma}$ , the vector $\bm{\xi}(p)$ is tangent to a curve (not a member of the congruence) that connects $p$ to a point $q$ on a nearby curve in the congruence Poisson (2004). Physically, the Jacobi field appears to fill the role of characterizing nearby geodesics, but note that the tangent vector field $\bm{u}$ is itself Lie invariant along $\tilde{\gamma}$ , so not every Jacobi field fulfills the desired role of pointing to a nearby curve in the congruence, and what we want is a Jacobi field that points transversely to a nearby curve (at least initially).

For timelike congruences in vacuum, one typically enforces the choice $\xi^{\beta}u_{\beta}=0$ to ensure that $\bm{\xi}$ is transverse, but since $u^{\alpha}k_{\alpha}=0$ this condition is, by itself, insufficient to guarantee that $\bm{\xi}$ does not have a component along $\bm{u}$ . Consequently, at any point $p$ on $\gamma$ , one may wish to decompose $\bm{\xi}$ into a piece that is parallel to $\bm{u}$ and a piece that is orthogonal to $\bm{u}$ ,

[TABLE]

recognizing that only the orthogonal part of $\bm{\xi}$ provides the desired information about the transverse behavior of the congruence. But as shown below, this is only a partial decomposition of $\bm{\xi}$ that must be augmented by a missing component.

IV.2 Projection operator

In vacuum, the self-orthogonality of a null curve, $u^{\mu}u_{\mu}$ =0, makes it difficult to identify the truly transverse component of $\bm{\xi}$ . To do so requires an operator that projects a vector into the subspace transverse to $\bm{u}$ . As mentioned at the beginning of this section, the cross sectional measurement of a laser beam is made at some instant of time at a fixed position along the beam length; e.g., for a beam propagating in the $z$ -direction, one would make a measurement in the $xy$ plane at some fixed $z$ . The transverse subspace is therefore two dimensional, while the complimentary space (the light cone) is also two dimensional. The vector $\bm{u}$ does not, by itself, span the light cone, and we require a second null vector to help define the transverse subspace via a projection operator.

In the literature, dealing with null congruences in vacuum, this “auxiliary null vector” is usually introduced arbitrarily, with the arbitrariness justified post facto because it drops out of the final answer. However, this turns out to no longer be true in the more general setting pursued here, and we will be forced to seek a canonical construction of the auxiliary vector that is based on some physically meaningful quantity. The details of this construction will be deferred until Sec. V. For now, we assume the existence of two future-directed lightlike pairs $(u^{\alpha},k_{\alpha})$ , and $(v^{\alpha},\ell_{\alpha})$ at every point, that satisfy

[TABLE]

Defining the projection operator

[TABLE]

one may readily demonstrate that $u^{\beta}h^{\alpha}_{\beta}=v^{\beta}h^{\alpha}_{\beta}=k_{\alpha}h^{\alpha}_{\beta}=\ell_{\alpha}h^{\alpha}_{\beta}=0$ and that

[TABLE]

Thus, if $\bm{u}$ and $\bm{v}$ span the light cone at a point $p\in M$ , then for any any arbitrary vector $\bm{c}$ , $\bm{h}(\bm{c})$ is orthogonal to both $\bm{u}$ and $\bm{v}$ and therefore lies in the subspace transverse to the light cone.

The trace

[TABLE]

indicates that the subspace transverse to both $\bm{u}$ and $\bm{v}$ is still two dimensional, despite the fact that the trajectories followed by light may in fact be timelike or spacelike with respect to the background metric. One can already see some of the issues that will be faced in dealing with this definition of the projection operator: For null geodesics in vacuum the usual technique is to set the denominators appearing in Eq. (43) to -1, but since in general media the optical metric depends on $p_{\mu}$ , the denominators $(\ell_{\sigma}u^{\sigma})=(\gamma_{\sigma\rho}(\bm{\ell})v^{\rho}u^{\sigma})$ and $(k_{\sigma}v^{\sigma})=(\gamma_{\sigma\rho}(\bm{k})u^{\rho}v^{\sigma})$ are not even the same, and it is unclear whether, and under what conditions, they may be made constant, and whether they can be simultaneously constant.

Since scalar factors like $(\ell_{\sigma}u^{\sigma})$ or $(k_{\sigma}v^{\sigma})$ appear frequently, we suppress the indices and write $(\ell\cdot u)$ or $(k\cdot v)$ . In all such factors it should be understood that a vectorial quantity is being contracted with a covectorial quantity and does not involve the metric.

IV.3 Transverse evolution of a congruence in media

With this more detailed understanding of the null structure at a point, we can see that the most general decomposition of any vector is of the form

[TABLE]

Since $h^{\alpha}_{\beta}u^{\beta}=h^{\alpha}_{\beta}v^{\beta}=0$ and $\bar{\xi}^{\alpha}$ is defined to lie in the transverse subspace, it follows that

[TABLE]

The sought-after transverse kinematics of the congruence is a description of the evolution of $\bar{\bm{\xi}}$ along the chosen reference curve $\tilde{\gamma}$ . If $\tau$ parametrizes the curve, then near the point $\tau_{0}$

[TABLE]

In components we have

[TABLE]

Since $\bm{\xi}$ is a Jacobi field, Eq. (40) implies

[TABLE]

whence

[TABLE]

Note that since $h^{\alpha}_{\mu}u^{\mu}=0$ , then $(h^{\alpha}_{\mu}u^{\mu})_{;\beta}=0$ implies

[TABLE]

so we have

[TABLE]

The antisymmetrization immediately kills any terms dependent on $A$ , but the other terms must be examined more carefully. It follows from Eq. (43) that

[TABLE]

Calculating the $B$ term in Eq. (53), one finds

[TABLE]

By Eq. (42), $u^{\beta}k_{\beta;\tau}=-k_{\tau}u\indices{{}^{\tau}_{;\beta}}$ and $v^{\beta}\ell_{\beta;\tau}=-\ell_{\tau}v\indices{{}^{\tau}_{;\beta}}$ , allowing Eq. (55) to be recast as

[TABLE]

So far we have found that, fortunately, the evolution of $\bar{\bm{\xi}}$ does not depend on either of the undetermined parameters $A$ or $B$ in Eq. (46). Turning attention to the $\bar{\xi}^{\beta}$ term of Eq. (53), and using the convolution property of $h^{\alpha}_{\beta}$ given in Eq. (44) to write $\bar{\xi}^{\alpha}=h^{\alpha}_{\omega}\xi^{\omega}=h^{\alpha}_{\lambda}h^{\lambda}_{\omega}\xi^{\omega}=h^{\alpha}_{\lambda}\bar{\xi}^{\lambda}$ , one finds

[TABLE]

By the action of the projection operator on $\ell_{\alpha}$ and $k_{\alpha}$ , together with the fact that $u^{\sigma}k_{\sigma;\beta}=-k_{\sigma}u\indices{{}^{\sigma}_{;\beta}}$ ,

[TABLE]

It may here be seen that $\dot{\bar{\bm{\xi}}}$ has terms proportional to $\bm{u}$ and $\bm{v}$ , so even though $\bar{\bm{\xi}}$ is initially transverse it may develop components along $\bm{u}$ and $\bm{v}$ . Note that the second term may be written

[TABLE]

showing conclusively that there are no additional transverse components hidden in this term since it is annihilated by $h^{\mu}_{\alpha}$ . Finally, we express the evolution of $\bar{\xi}^{\alpha}$ by

[TABLE]

with

[TABLE]

In general, $\bm{B}$ may be decomposed into trace and traceless parts. Taking the trace, one finds it comes entirely from the purely transverse part of $\bm{B}$ ,

[TABLE]

One might be tempted to represent the trace component of $\bm{B}$ as something proportional to $\Theta\,\delta^{\alpha}_{\lambda}$ , but in general $\delta^{\alpha}_{\lambda}$ does not lie in the transverse subspace, so to ensure that the trace component is correctly represented as lying entirely in the transverse subspace we project $h^{\alpha}_{\sigma}\delta^{\sigma}_{\rho}h^{\rho}_{\lambda}=h^{\alpha}_{\lambda}$ to write the decomposition

[TABLE]

where the factor of $\tfrac{1}{2}$ is explained by recalling that $h^{\alpha}_{\alpha}=2$ .

In the literature, the transverse traceless part, $\hat{\bm{B}}$ , is usually further reduced to its symmetric and antisymmetric parts, of which the actions on $\bar{\bm{\xi}}$ through Eq. (60) are understood as rotation and shear. In our more generalized setting, discussions of symmetry beg the question “symmetric with respect to what?” For a tensor like $g_{\mu\nu}$ , there is a well-defined meaning of symmetry, which is that given two vectors $a^{\mu}$ and $b^{\mu}$ , $g_{\mu\nu}a^{\mu}b^{\nu}=g_{\mu\nu}b^{\mu}a^{\nu}$ , or written another way, that $\bm{g}(\bm{a},\bm{b})=\bm{g}(\bm{b},\bm{a})$ . But a mixed tensor like $B\indices{{}^{\alpha}_{\lambda}}$ accepts two different types of objects as its arguments which cannot simply be interchanged. Given $\bm{B}(\bm{k},\bm{u})=B\indices{{}^{\alpha}_{\lambda}}k_{\alpha}u^{\lambda}$ , there is no meaning to the interchange $\bm{B}(\bm{k},\bm{u})\to\bm{B}(\bm{u},\bm{k})$ , because the first argument of $\bm{B}(\cdot\,,\cdot)$ must be a covector rather than a vector, while the second must be a vector rather than a covector. Any measure of the symmetry of $\bm{B}$ must therefore be made relative to some other known tensor that provides a map between a vector space and its dual. The obvious choice is the metric tensor, whence the symmetry or antisymmetry of $B_{\alpha\beta}=g_{\alpha\sigma}B\indices{{}^{\sigma}_{\beta}}$ has a well-defined meaning. Although we may seem to belabor this question here, the point is that we now have at our disposal a second natural choice of measure – the optical metric.

Framed another way, since $\bm{B}$ is the infinitesimal transformation of a real 4-vector $\bar{\bm{\xi}}_{p}\in T_{p}(M)\sim\mathcal{M}$ (where $\mathcal{M}$ denotes flat Minkowski space-time), the most natural interpretation seems to be that $\bm{B}$ is an element of $\mathfrak{o}(1,3)$ , the Lie algebra of O(1,3) – the group of orthogonal transformations. After separating out the trace, the remaining traceless parts must lie in the subgroup $\mathfrak{so}(1,3)$ , the Lie algebra of SO(1,3) – the group of norm-preserving orthogonal transformations where the norm is given by the background metric $\bm{g}$ . But again, one could instead prefer to define a group of norm-preserving transformations with respect to the optical metric rather than the background metric, and since the optical metric reduces to the background metric in vacuum there is no inconsistency in this choice.

We argue that an observer continues to make measurements with respect to the background metric and that this is consistent with real-world practice. Thus, we suggest that the traceless parts of $\bm{B}$ are most naturally decomposed with respect to the generators of $\mathfrak{so}(1,3)$ . The transverse traceless part $\hat{\bm{B}}$ will be decomposed with respect to those generators that lie in the purely spatial transverse space and will therefore correspond to rotation and shear transformations in this plane. The complimentary, or longitudinal, space is not purely spatial, and therefore the decomposition of the longitudinal traceless part, $(\delta^{\alpha}_{\beta}-h^{\alpha}_{\beta})u^{\sigma}h^{\beta}_{\lambda;\sigma}$ will include generators corresponding to boosts. These boosts correspond to the fact that $\bar{\bm{\xi}}$ can acquire a component along $\bm{u}$ , meaning that different parts of a phase front can travel at different speeds within the medium, such that a single phase front intercepted at a distant detector is absorbed over some finite window of time.

V generalized Raychaudhuri equation in media

The Raychaudhuri equation tells us about the evolution of the expansion along the congruence, $\dot{\Theta}$ , which essentially tells us how the focus of a beam changes along its length, and is instrumental in the proof of focusing and singularity theorems Hawking and Ellis (1975). In curved vacuum space-times, the Raychaudhuri equation for null geodesic congruences is

[TABLE]

This version of the Raychaudhuri equation admits the possibility of nonaffine parametrization of the curve, in which case $\bm{a}=\kappa\bm{u}$ and the term $a\indices{{}^{\rho}_{;\rho}}$ contributes $\kappa\Theta$ . The presence of the Riemann tensor in the Raychaudhuri equation means that it is possible to use congruences as a probe for curvature.

In the spirit of adapting tools from general relativity for use in optics, this section will generalize the Raychaudhuri equation to light propagating through dielectric media residing in a curved background space-time. In particular, the idea of an equivalence between curved space-times and dielectric media suggests that the generalized Raychaudhuri equation should contain a tensor term constructed out of the optical metric and its derivatives, similar to the Riemann tensor term in Eq. (64). The existence of such a tensor would be quite useful for the analysis of light propagation in media. However, while we do indeed find that the generalized equation is modified through the addition of terms proportional to the optical metric and its derivatives, these do not occur in such a way that they might be identified as an effective Riemann or curvature tensor.

Begin by taking the derivative of $\Theta$ along the curve, which from Eq. (62) is

[TABLE]

Our strategy is to write this as the usual Raychaudhuri equation plus some additional terms that arise as a result of the presence of the medium. With a little bit of adding and subtracting, we can rewrite

[TABLE]

and expand the Riemann tensor term to

[TABLE]

We therefore may write

[TABLE]

Consider the final two terms. Substituting Eq. (43) for the projection operator, one finds

[TABLE]

Here we have introduced the notation

[TABLE]

Meanwhile, using Eq. (43) in $\dot{h}^{\beta}_{\alpha}u\indices{{}^{\alpha}_{;\beta}}$ results in

[TABLE]

Inserting these expressions into Eq. (68) and collecting terms, we can write

[TABLE]

Now use the identity

[TABLE]

to find

[TABLE]

At this stage, we have managed to write the Raychaudhuri equation as the vacuum equation plus some additional terms. This is still not a very illuminating form for the Raychaudhuri equation since it depends on the unknown quantity $u\indices{{}^{\mu}_{;\nu}}$ and is not written in terms of the optical metric and its derivatives, as desired. Even more importantly, it appears to depend on the arbitrarily chosen pair $(\bm{\ell},\bm{v})$ , which is problematic for assigning any physical meaning to the equation. It is instructive to look at some specific examples to examine the dependence on $(\bm{\ell},\bm{v})$ .

V.1 Nonaffine geodesics in vacuum

In vacuum it is true that $k_{\alpha}=u_{\alpha}$ . Then $u_{\alpha}u\indices{{}^{\alpha}_{;\beta}}=\frac{1}{2}(u_{\alpha}u^{\alpha})_{;\beta}=0$ , from which it follows that $(k\cdot a)=0$ and $(k\cdot a_{v})=0$ ; thus all such terms vanish from Eq. (74). To see that this expression reduces to the usual one in vacuum, consider the general case of nonaffinely parametrized geodesics such that $\bm{a}=\kappa\bm{u}$ . One readily calculates

[TABLE]

Since in this case

[TABLE]

we recover the usual expression in vacuum for nonaffinely parametrized geodesics

[TABLE]

V.2 Nongeodesic curves in media

It turned out, somewhat miraculously, that in vacuum the pair $(\bm{\ell},\bm{v})$ completely dropped out of the final expression for $\dot{\Theta}$ in Eq. (77) and so may be chosen arbitrarily. This was a consequence of the fact that $\bm{k}$ in vacuum is orthogonal to both $\bm{a}$ and $\bm{a}_{v}$ , and the geodesic nature of the curve that restricts $\bm{a}$ to the form $\bm{a}=\kappa\bm{u}$ . In general refracting media, we are faced with the fact that none of these conditions hold. In particular, since $\gamma_{\alpha\beta}u^{\alpha}u^{\beta}=0$ implies

[TABLE]

we instead have that the most general form of $a^{\mu}$ admits

[TABLE]

where

[TABLE]

is antisymmetric. For congruences of timelike particles, $f\indices{{}^{\mu}_{\nu}}$ represents an external force on the particle. It is unclear whether there exists a corresponding quantity for light in media, and in what follows we assume that this term does not contribute.

To compare with the vacuum result, consider just the parametrization artifact part of the acceleration $\kappa\bm{u}$ . First, note that now

[TABLE]

All the terms proportional to $\bm{\ell}$ or $\bm{v}$ in Eq. (74) still conspire to provide the $\kappa^{2}$ term in the expression for $\kappa\Theta$ , but we are now left with

[TABLE]

Thus, even for the relatively simple parametrization-dependent component of $\bm{a}$ we start seeing nontrivial dependence on the choice of $(\bm{\ell},\bm{v})$ . Of course, the parametrization may be chosen such that $\kappa=0$ , but the other terms in $\bm{a}$ will contribute similarly, and the point is that the final answer for the expansion appears to depend on the choice of $(\bm{\ell},\bm{v})$ . This presents quite a big conceptual problem since at this stage the pair $(\bm{\ell},\bm{v})$ was chosen arbitrarily and so should carry no physical significance; thus, a result that depends on $\bm{v}$ is not well defined.

V.3 Raychaudhuri equation for affinely parametrized congruences

It is clear that we must look for a canonical choice of $(\bm{\ell},\bm{v})$ that carries some physical significance. Ideally, the pair $(\bm{\ell},\bm{v})$ should be related by the same optical metric as the solution pair $(\bm{k},\bm{u})$ , because $\gamma_{\alpha\beta}(\bm{\ell})=\gamma_{\alpha\beta}(\bm{k})$ would significantly improve our ability to simplify both the projection operator $h^{\alpha}_{\beta}$ and ultimately the final answer. Let us seek guidance from the simple case of a light ray propagating through vacuum in the $x$ -direction. In this simplified case the vector $\bm{u}$ is tangent to the solution curve $\tilde{\gamma}$ at the point $\tilde{\gamma}(\tau_{0})=(t_{0},x_{0})$ , as depicted in Fig. 1.

If $\bm{t}$ is tangent to the coordinate line $t$ at this point, then

[TABLE]

is a future-directed null vector with spatial component along $-x$ and associated wave vector

[TABLE]

Thus, given $\bm{u}$ and the coordinate time $t$ of a local observer, we have constructed a second null vector $\bm{v}$ such that $\bm{u}$ and $\bm{v}$ span the light cone at $\tilde{\gamma}(\tau_{0})$ .

It is important to note that $(\bm{\ell},\bm{v})$ is not necessarily a solution in the sense that $\bm{v}$ is tangent to a solution curve with $\bm{\ell}=\mathrm{d}S^{\prime}$ for phase function $S^{\prime}$ ; i.e., we have not shown whether, or under what conditions, $\bm{\ell}$ is exact. To be specific, the time coordinate gives us a curve in a local chart $t:M\to\mathbb{R}$ . The exterior derivative of this $\mathbb{R}$ -valued function on $M$ is $\mathrm{d}t$ . If the space-time is static, then $(\mathrm{d}t)^{\sharp}=\frac{\partial}{\partial t}=\bm{t}$ , which is indeed the expected tangent vector to the coordinate $t$ . Therefore, if the space-time is static, we have

[TABLE]

where the last equality holds because in vacuum we can always scale the null vector $\bm{u}$ such that $\mathrm{d}(\bm{g}(\bm{u},\bm{t}))=0$ . Thus if we consider $\bm{u}$ as tangent to the “outgoing” ray, then in static space-times, the complimentary null vector $\bm{v}$ is indeed tangent to the “ingoing” ray. The condition of static space-times indicates a sense of reciprocity: if a wave front of the outgoing ray is retroreflected, it will follow the same spatial path through the manifold, passing back through the point $x_{0}$ with tangent $\bm{v}$ .

Taking this reasoning as a starting point, let $t^{\mu}$ be a vector field parallel to the world lines of a family of inertial observers who measure the medium parameters, but normalized with $\gamma_{\alpha\beta}(k)t^{\alpha}t^{\beta}=-1$ . Since the optical metric is conformally invariant, it can always be arranged that $\gamma_{\alpha\beta}t^{\alpha}t^{\beta}=g_{\alpha\beta}t^{\alpha}t^{\beta}=-1$ , so this normalization is indeed the natural one for the inertial observers. Now let the pair $(\bm{\ell},\bm{v})$ be defined similarly to Eqs. (83) and (84), but where the optical metric replaces the space-time metric

[TABLE]

Here we have written $(\bm{k}\cdot\bm{t})=\gamma_{\mu\nu}u^{\mu}t^{\nu}$ as a shorthand reminder that the product is with respect to the optical metric rather than the space-time metric, as in Eq. (83). It is straightforward to show that $(\bm{\ell}\cdot\bm{u})=(\bm{k}\cdot\bm{v})=-2(\bm{k}\cdot\bm{t})^{2}$ .

Although $(\bm{\ell},\bm{v})$ completes the null basis of the light cone as in the vacuum case, it does not necessarily correspond to an ingoing solution curve $\gamma^{\prime}$ . For this to happen, we would further require

[TABLE]

which requires not only static space-times but reciprocal media. Since all we need is a complete null basis at each point visited by the reference curve of the congruence, the construction of $(\bm{\ell},\bm{v})$ by Eqs. (86) and (87) is sufficient to trace the congruence $\tilde{\gamma}$ . In vacuum, most treatments scale $(\bm{\ell}\cdot\bm{u})=(\bm{k}\cdot\bm{v})=-1$ . In media, however, such a scaling would require further assumptions on the medium. The quantity $(\bm{k}\cdot\bm{t})$ has a physical interpretation as the frequency measured by the observer. For interesting applications, for example in transformation optics, it is desirable to allow for frequency-modulating media Cummer and Thompson (2011). For this reason we do not require $(\bm{k}\cdot\bm{t})$ to be constant.

Lastly, we assume that the congruence is affinely parametrized and that the only contribution to $\bm{a}$ from Eq. (79) is

[TABLE]

Recall that our task is to calculate

[TABLE]

With the physically meaningful construction of $(\bm{\ell},\bm{u})$ given by Eqs. (86) and (87) now at our disposal, we find it most straightforward to return to the beginning and calculate each unknown term in turn. The projection operator becomes

[TABLE]

The details of the lengthy calculation are presented in the Appendix. The result is

[TABLE]

It is easy to see that this expression reduces to the usual one in the vacuum limit $\gamma_{\alpha\beta}\to g_{\alpha\beta}$ since the covariant derivative $g_{\alpha\beta;\mu}=0$ , but there are some other interesting features to note. The first is that although we might have expected to be able to identify a Riemann-like tensor term built out of second derivatives of the optical metric we find this is not the case. A single second derivative term appears, but this is not enough to identify a Riemann-like tensor from which we can define an effective curvature of the medium. Also of note is that this expression still depends on $\bm{t}$ through the dependence on the projection operator $h^{\beta}_{\alpha}$ . We earlier found that the arbitrarily chosen null vector $\bm{v}$ appeared in $\dot{\Theta}$ , which forced us to seek a canonical choice of $\bm{v}$ that reflected some physical meaning, which we did via the world lines of a preferred set of observers – those who make measurements of the material parameters and the congruence. So, we should not be too surprised by the continued presence of $\bm{t}$ in the final expression, but one might have harbored some hope that the $\bm{t}$ -dependence would drop out by some fortuitous cancellation. Another interesting and unexpected feature is the interaction of the optical metric and the background metric through the Riemann tensor. In the vacuum limit, these terms combine to give us the usual Riemann tensor term, but here we see that this contribution is actually split between the background metric and the optical metric.

In vacuum and for nonrotating congruences, one finds that $\dot{\Theta}$ is driven by the Riemann tensor term, and the positivity of space-time curvature forces a focusing of the congruence. In flat space-time, the Riemann tensor vanishes, and $\dot{\Theta}$ is primarily driven by first and second derivatives of the optical metric. These derivative terms can contribute with either sign, leading to a corresponding focusing or defocusing of the congruence.

Further insight may be gained through more detailed knowledge of the optical metric $\gamma^{\alpha\beta}$ . A better understanding of how the individual dielectric parameters of permeability, permittivity, and magnetoelectric couplings contribute to the optical metric would show how they each contribute to the focusing of a beam. The optical metric must contain contributions from both the background metric and the medium parameters, but an expression for $\gamma^{\alpha\beta}$ in terms of $g^{\alpha\beta}$ and $\chi\indices{{}_{\alpha\beta}^{\mu\nu}}$ is not currently known. Since the medium parameters are also measured by the inertial observers $\bm{t}$ , it is possible that the unknown expression for $\gamma^{\alpha\beta}$ could be formulated in such a way that $\bm{t}$ also appears.

VI Conclusions

In the past decade, transformation optics has renewed interest in the longstanding recognition of the similarity of the refractive properties of dielectric media to those of curved space-times. This has inspired a program that seeks to apply a range of tools and techniques from general relativity to the analysis of electrodynamics in dielectric media. In particular, several authors are working on covariant formulations of electrodynamics within dielectric media. In our approach, we recognize that any real medium must reside within a background space-time, and that the structure imposed by the medium is supplemental to that of the background space-time. It is desirable, therefore, to have a framework for electrodynamics in media that explicitly and separately accounts for the medium and the background.

These considerations motivate us to study the detailed behavior of light propagation through linear dielectric media residing within a curved background space-time. We have derived the covariant kinematics for congruences of light in dielectric media and used this to formulate a generalized Raychaudhuri equation in media. The covariant kinematics allow us to go beyond ray trajectories to analyze physically meaningful and measurable characteristics of beam propagation like the expansion, shear, and rotation of the cross section of a beam.

Since the behavior of congruences, governed by the Raychaudhuri equation, serves as a probe of curvature, we supposed that the derivation of a generalized Raychaudhuri equation in media should allow the identification of a Riemann-like tensor for the medium. Our expectation was that we would be able to identify such a tensor term in the generalized Raychaudhuri equation, comprised of combinations of second derivatives of the optical metric, and which does not vanish for media within Minkowski space-time.

Our derivation showed that the Raychaudhuri equation for media residing in flat space-time is dominated by first and second derivatives of the optical metric but that these quantities do not appear to have any special corresponding physical interpretation in general relativity. We are thus unable to identify a Riemann-like tensor describing the curvature of linear dielectric media. One interesting feature of the generalized Raychaudhuri equation is a dependence on the auxiliary pair $(\bm{v},\bm{\ell})$ . In the vacuum derivation the auxiliary vector drops out of the final result and so may be chosen arbitrarily, but its continued presence in the generalized equation forced us to seek a canonical construction of $(\bm{v},\bm{\ell})$ , which was possible through the choice of a timelike vector field $\bm{t}$ . We argue that this choice is canonical and corresponds to the world lines of a family of timelike observers who are performing the measurements.

The techniques and results presented here can be used for more accurate congruence tracing through dielectric media, which should be useful for studying the behavior of optical devices, particularly those devised through transformation optics. For example, we have used these results to study the expansion of congruences for the case of a dielectric analog space-time where the analog is taken to reside in a flat background space-time Fathi and Thompson (2016).

Although this work was primarily motivated by questions in transformation optics and dielectric analog space-times, it would also be possible and interesting to apply these results to ray or congruence tracing in astrophysical settings. For example radio waves propagating through dust clouds or accretion disks near massive compact objects could experience some dielectric refraction in addition to the effects of space-time curvature. Such refractive effects could have implications for cosmological surveys based on gravitational lensing.

Lastly, we have so far only considered the first order behavior of the transverse vector $\bar{\xi}$ , as can be seen in Eq. (48). By extending this work to the second order behavior it would be possible to generalize the geodesic deviation equation to light in media. Such a generalized deviation equation would provide the starting point for even more accurate ray tracing through dielectric media. We leave this derivation as the subject of future study.

Appendix A Detailed calculation of Raychaudhuri equation in media

In this Appendix we present a fully detailed account of the calculations for the Raychaudhuri equation presented in Sec. V.3. The Raychaudhuri equation is

[TABLE]

With the assumptions made in Sec. V.3, the projection operator becomes

[TABLE]

We will require $\dot{h}^{\beta}_{\alpha}$ :

[TABLE]

We calculate each unknown term of Eq. (93) in the following subsections.

A.1 $h_{\alpha}^{\beta}R\indices{{}^{\alpha}_{\lambda\beta\tau}}u^{\lambda}u^{\tau}$

[TABLE]

A.2 $h_{\alpha}^{\beta}a\indices{{}^{\alpha}_{;\beta}}$

We assume that the only contribution to the acceleration comes from the medium term

[TABLE]

First take the derivative

[TABLE]

Next, operate with the projection operator

[TABLE]

We can rewrite this with some clever addition and subtraction:

[TABLE]

By Eq. (61), this now becomes

[TABLE]

Consider the term $(\delta_{\beta}^{\rho}-h_{\beta}^{\rho})(u\indices{{}^{\beta}_{;\lambda}}h^{\lambda}_{\alpha}-\dot{h}_{\alpha}^{\beta})$ . We have already calculated $\dot{h}^{\beta}_{\alpha}$ above, so first calculate

[TABLE]

then combine with Eq. (95) to get

[TABLE]

To get $(\delta_{\beta}^{\rho}-h_{\beta}^{\rho})(u\indices{{}^{\beta}_{;\lambda}}h^{\lambda}_{\alpha}-\dot{h}_{\alpha}^{\beta})$ , first note that $h^{\alpha}_{\rho}$ annihilates both $u^{\rho}$ and $t^{\rho}$ , so $(\delta_{\beta}^{\rho}-h_{\beta}^{\rho})u^{\beta}=u^{\rho}$ , and similarly for $t^{\beta}$ . Thus

[TABLE]

So there are only two terms left to deal with. First,

[TABLE]

where going from the first to the second line used a very useful identity that will be used several times again:

[TABLE]

Secondly,

[TABLE]

so that

[TABLE]

Returning to $h_{\alpha}^{\beta}a\indices{{}^{\alpha}_{;\beta}}$ we can now find

[TABLE]

We commute the covariant derivatives on

[TABLE]

and expand the projection operator on

[TABLE]

which brings us to the final form for the term $h_{\alpha}^{\beta}a\indices{{}^{\alpha}_{;\beta}}$

[TABLE]

A.3 $\dot{h}_{\alpha}^{\lambda}u\indices{{}^{\alpha}_{;\lambda}}$

Our task here is somewhat simplified by the fact that we already calculated $\dot{h}_{\alpha}^{\lambda}$ in the last section. We have

[TABLE]

A.4 $h^{\mu}_{\rho}u\indices{{}^{\rho}_{;\tau}}(\delta^{\tau}_{\nu}-h^{\tau}_{\nu})(\delta^{\nu}_{\sigma}-h^{\nu}_{\sigma})u\indices{{}^{\sigma}_{;\beta}}h^{\beta}_{\mu}=h^{\beta}_{\rho}u\indices{{}^{\rho}_{;\nu}}(\delta^{\nu}_{\sigma}-h^{\nu}_{\sigma})u\indices{{}^{\sigma}_{;\beta}}$

Start with

[TABLE]

Adding and subtracting allows us to write this as

[TABLE]

Next expand the projection operator on the second line

[TABLE]

where we have made use of the fact that $\gamma_{\mu\nu}t^{\mu}t^{\nu}$ is constant to write

[TABLE]

We are left with

[TABLE]

With Eqs. (96), (112), (113), and (118) and some final rearranging, we finally have the sought-after solution

[TABLE]

Bibliography51

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1Leonhardt and Philbin (2006) Ulf Leonhardt and Thomas G. Philbin, “General relativity in electrical engineering,” New J. Phys. 8 , 247 (2006) . · doi ↗
2Pendry et al. (2006) J. B. Pendry, D. Schurig, and D. R. Smith, “Controlling electromagnetic fields,” Science 312 , 1780–1782 (2006) . · doi ↗
3Schurig et al. (2006) D. Schurig, J. J. Mock, B. J. Justice, S. A. Cummer, J. B. Pendry, A. F. Starr, and D. R. Smith, “Metamaterial electromagnetic cloak at microwave frequencies,” Science 314 , 977–980 (2006) . · doi ↗
4Mc Call et al. (2011) Martin W Mc Call, Alberto Favaro, Paul Kinsler, and Allan Boardman, “A spacetime cloak, or a history editor,” J. Opt. 13 , 024003 (2011) . · doi ↗
5Gratus et al. (2016) Jonathan Gratus, Paul Kinsler, Martin W. Mc Call, and Robert T. Thompson, “On spacetime transformation optics: temporal and spatial dispersion,” New J. Phys. 18 , 123010 (2016) .
6Paul and Rahm (2012) Oliver Paul and Marco Rahm, “Covariant description of transformation optics in nonlinear media,” Opt. Express 20 , 8982–8997 (2012) . · doi ↗
7Bergamin et al. (2011) L. Bergamin, P. Alitalo, and S. A. Tretyakov, “Nonlinear transformation optics and engineering of the kerr effect,” Phys. Rev. B 84 , 205103 (2011) . · doi ↗
8Unruh (1981) W. G. Unruh, “Experimental black-hole evaporation?” Phys Rev Lett 46 , 1351–1353 (1981) . · doi ↗

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Covariant kinematics of light in media and a generalized Raychaudhuri equation

Abstract

I Introduction

II Goal of this paper

III Geometrical optics limit of covariant electrodynamics in media

III.1 Covariant electrodynamics in media

III.2 Geometric optics limit

IV Kinematics of light in media

IV.1 Jacobi fields

IV.2 Projection operator

IV.3 Transverse evolution of a congruence in media

V generalized Raychaudhuri equation in media

V.1 Nonaffine geodesics in vacuum

V.2 Nongeodesic curves in media

V.3 Raychaudhuri equation for affinely parametrized congruences

VI Conclusions

Appendix A Detailed calculation of Raychaudhuri equation in media

A.1 hαβR\indicesλβταuλuτh_{\alpha}^{\beta}R\indices{{}^{\alpha}_{\lambda\beta\tau}}u^{\lambda}u^{\tau}hαβ​R\indicesλβτα​uλuτ

A.2 hαβa\indices;βαh_{\alpha}^{\beta}a\indices{{}^{\alpha}_{;\beta}}hαβ​a\indices;βα​

A.3 h˙αλu\indices;λα\dot{h}_{\alpha}^{\lambda}u\indices{{}^{\alpha}_{;\lambda}}h˙αλ​u\indices;λα​

A.1 $h_{\alpha}^{\beta}R\indices{{}^{\alpha}_{\lambda\beta\tau}}u^{\lambda}u^{\tau}$

A.2 $h_{\alpha}^{\beta}a\indices{{}^{\alpha}_{;\beta}}$

A.3 $\dot{h}_{\alpha}^{\lambda}u\indices{{}^{\alpha}_{;\lambda}}$