Self-interaction in classical gauge theories and gravitation

B. P. Kosyakov

arXiv:1812.03290·hep-th·April 16, 2019

Self-interaction in classical gauge theories and gravitation

B. P. Kosyakov

PDF

TL;DR

This paper explores the manifestations of self-interaction in classical gauge theories and gravitation, focusing on topological phases, degrees of freedom, and the physical implications of these phenomena.

Contribution

It provides a systematic analysis of self-interaction manifestations, including topological phases and degrees of freedom rearrangements, in gauge theories and general relativity.

Findings

01

Rearranged Maxwell-Lorentz electrodynamics describes dressed particles and radiation.

02

Topological phases occur in pure field systems.

03

Ambiguities in energy and momentum in systems with topological content like black holes.

Abstract

To develop a systematic treatment of the self-interaction problem in classical gauge theories and general relativity, we study tenable manifestations of self-interaction: topological phases, and rearrangements of degrees of freedom appearing in the action. We outline the occurrence of topological phases in pure field systems. We show that the rearranged Maxwell-Lorentz electrodynamics is a mathematically consistent and physically satisfactory theory which describes new entities, dressed charged particles and radiation. We extend this analysis to cover different modifications of the Maxwell-Lorentz electrodynamics and the SU(N) Yang-Mills-Wong theory. We take a brief look at a subtle mechanism of self-interaction in classical strings. Turning to general relativity, we note that the total energy and momentum of a system with nontrivial topological content, such as a black hole, are…

Equations936

S = - T_{p} \int d^{p + 1} u - h .

S = - T_{p} \int d^{p + 1} u - h .

u^{a} = f^{a} (\overset{u}{ˉ}), h_{ab} (u) = \frac{\partial u ˉ ^{c}}{\partial u ^{a}} \frac{\partial u ˉ ^{d}}{\partial u ^{b}} \overset{ˉ}{h}_{c d} (\overset{u}{ˉ}),

u^{a} = f^{a} (\overset{u}{ˉ}), h_{ab} (u) = \frac{\partial u ˉ ^{c}}{\partial u ^{a}} \frac{\partial u ˉ ^{d}}{\partial u ^{b}} \overset{ˉ}{h}_{c d} (\overset{u}{ˉ}),

(□ + μ^{2}) ϕ = 0,

(□ + μ^{2}) ϕ = 0,

L = \frac{1}{2} (\partial_{μ} ϕ \partial^{μ} ϕ) - \frac{μ ^{2}}{2} ϕ^{2} .

L = \frac{1}{2} (\partial_{μ} ϕ \partial^{μ} ϕ) - \frac{μ ^{2}}{2} ϕ^{2} .

L = \frac{1}{2} (\partial_{α} ϕ \partial^{α} ϕ) - \frac{μ ^{2}}{2} ϕ^{2} - \frac{λ ^{2}}{4} ϕ^{4},

L = \frac{1}{2} (\partial_{α} ϕ \partial^{α} ϕ) - \frac{μ ^{2}}{2} ϕ^{2} - \frac{λ ^{2}}{4} ϕ^{4},

ϕ = Φ + Φ^{2} F (Φ) .

ϕ = Φ + Φ^{2} F (Φ) .

E_{i} (ϕ) = \frac{δ S}{δ ϕ _{i}} = 0 .

E_{i} (ϕ) = \frac{δ S}{δ ϕ _{i}} = 0 .

\frac{δ ^{2} S}{δ ϕ _{i} δ ϕ _{j}} < 0

\frac{δ ^{2} S}{δ ϕ _{i} δ ϕ _{j}} < 0

L = \frac{1}{2} (\partial_{t} ϕ)^{2} - \frac{1}{2} (\partial_{x} ϕ)^{2} + \frac{μ ^{2}}{2} ϕ^{2} - \frac{λ ^{2}}{4} ϕ^{4} - U_{0} .

L = \frac{1}{2} (\partial_{t} ϕ)^{2} - \frac{1}{2} (\partial_{x} ϕ)^{2} + \frac{μ ^{2}}{2} ϕ^{2} - \frac{λ ^{2}}{4} ϕ^{4} - U_{0} .

U_{0} = \frac{1}{4} μ^{2} ϕ_{0}^{2},

U_{0} = \frac{1}{4} μ^{2} ϕ_{0}^{2},

ϕ_{0} = \frac{μ}{λ},

ϕ_{0} = \frac{μ}{λ},

L = \frac{1}{2} (\partial_{t} ϕ)^{2} - \frac{1}{2} (\partial_{x} ϕ)^{2} - U (ϕ),

L = \frac{1}{2} (\partial_{t} ϕ)^{2} - \frac{1}{2} (\partial_{x} ϕ)^{2} - U (ϕ),

U (ϕ) = \frac{λ ^{2}}{4} (ϕ^{2} - ϕ_{0}^{2})^{2} .

U (ϕ) = \frac{λ ^{2}}{4} (ϕ^{2} - ϕ_{0}^{2})^{2} .

(\frac{\partial ^{2}}{\partial t ^{2}} - \frac{\partial ^{2}}{\partial x ^{2}} - μ^{2}) ϕ = 0 .

(\frac{\partial ^{2}}{\partial t ^{2}} - \frac{\partial ^{2}}{\partial x ^{2}} - μ^{2}) ϕ = 0 .

E = \int d x [\frac{1}{2} (\partial_{t} ϕ)^{2} + \frac{1}{2} (\partial_{x} ϕ)^{2} + U (ϕ)],

E = \int d x [\frac{1}{2} (\partial_{t} ϕ)^{2} + \frac{1}{2} (\partial_{x} ϕ)^{2} + U (ϕ)],

ϕ (t, x) = \pm ϕ_{0} .

ϕ (t, x) = \pm ϕ_{0} .

ϕ = ϕ_{0} + χ .

ϕ = ϕ_{0} + χ .

L = \frac{1}{2} (\partial_{t} χ)^{2} - \frac{1}{2} (\partial_{x} χ)^{2} - μ^{2} χ^{2} - λ μ χ^{3} - \frac{λ ^{2}}{4} χ^{4}

L = \frac{1}{2} (\partial_{t} χ)^{2} - \frac{1}{2} (\partial_{x} χ)^{2} - μ^{2} χ^{2} - λ μ χ^{3} - \frac{λ ^{2}}{4} χ^{4}

m = 2 μ,

m = 2 μ,

ϕ (x) = \pm ϕ_{0} tanh [\frac{μ ( x - x _{0} )}{2}],

ϕ (x) = \pm ϕ_{0} tanh [\frac{μ ( x - x _{0} )}{2}],

ε (x) = μ^{2} \frac{ϕ _{0}^{2}}{2} sech^{4} [\frac{μ ( x - x _{0} )}{2}] .

ε (x) = μ^{2} \frac{ϕ _{0}^{2}}{2} sech^{4} [\frac{μ ( x - x _{0} )}{2}] .

E_{kink} = \int_{- \infty}^{\infty} d x ε (x) = μ \frac{2 2}{3} ϕ_{0}^{2} .

E_{kink} = \int_{- \infty}^{\infty} d x ε (x) = μ \frac{2 2}{3} ϕ_{0}^{2} .

\aleph_{\rm i}=\frac{\phi(x)}{\phi_{0}}\biggl{|}_{x=-\infty}\,,\quad\aleph_{\rm f}=\frac{\phi(x)}{\phi_{0}}\biggl{|}_{x=\infty}\,.

\aleph_{\rm i}=\frac{\phi(x)}{\phi_{0}}\biggl{|}_{x=-\infty}\,,\quad\aleph_{\rm f}=\frac{\phi(x)}{\phi_{0}}\biggl{|}_{x=\infty}\,.

Q = ℵ_{f} - ℵ_{i} .

Q = ℵ_{f} - ℵ_{i} .

J^{μ} = ϕ_{0}^{- 1} ϵ^{μν} \partial_{ν} ϕ,

J^{μ} = ϕ_{0}^{- 1} ϵ^{μν} \partial_{ν} ϕ,

\partial_{μ} J^{μ} = 0

\partial_{μ} J^{μ} = 0

\int_{- \infty}^{\infty} d x J^{0} (x) = ϕ_{0}^{- 1} \int_{- \infty}^{\infty} d x \frac{\partial ϕ}{\partial x} = ℵ_{f} - ℵ_{i} .

\int_{- \infty}^{\infty} d x J^{0} (x) = ϕ_{0}^{- 1} \int_{- \infty}^{\infty} d x \frac{\partial ϕ}{\partial x} = ℵ_{f} - ℵ_{i} .

L = \frac{1}{2} (\partial_{μ} ϕ \partial^{μ} ϕ) - U (ϕ),

L = \frac{1}{2} (\partial_{μ} ϕ \partial^{μ} ϕ) - U (ϕ),

L = \frac{1}{2} (\partial_{μ} Φ)^{*} \partial^{μ} Φ + \frac{μ ^{2}}{2} (Φ^{*} Φ) - \frac{λ ^{2}}{4} (Φ^{*} Φ)^{2} - U_{0},

L = \frac{1}{2} (\partial_{μ} Φ)^{*} \partial^{μ} Φ + \frac{μ ^{2}}{2} (Φ^{*} Φ) - \frac{λ ^{2}}{4} (Φ^{*} Φ)^{2} - U_{0},

L = - \frac{1}{16 π} (F^{μν} F_{μν}) + \frac{1}{2} (D_{μ} Φ)^{*} D^{μ} Φ + \frac{μ ^{2}}{2} (Φ^{*} Φ) - \frac{λ ^{2}}{4} (Φ^{*} Φ)^{2} - U_{0} .

L = - \frac{1}{16 π} (F^{μν} F_{μν}) + \frac{1}{2} (D_{μ} Φ)^{*} D^{μ} Φ + \frac{μ ^{2}}{2} (Φ^{*} Φ) - \frac{λ ^{2}}{4} (Φ^{*} Φ)^{2} - U_{0} .

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

Self-interaction in classical gauge theories and gravitation

B. P. Kosyakov

Russian Federal Nuclear Center–VNIIEF, Sarov, 607188 Nizhny Novgorod Region, Russia

and Moscow Institute of Physics & Technology, Dolgoprudny, 141700 Moscow Region, Russia

Electronic address: ${\rm [email protected]}$

1 INTRODUCTION
1.1 Elusive renditions of self-interaction
1.2 Manifestations of self-interaction
1.3 Plan of the review
2 TOPOLOGICAL PHASES
3 SELF-INTERACTION IN ELECTRODYNAMICS
3.1 The Maxwell–Lorentz theory
3.1.1 Radiation
3.1.2 Local balance of energy-momentum
3.1.3 The Abraham–Lorentz–Dirac equation
3.1.4 Another way of looking at the dressed dynamics
3.1.5 Paradoxes and misconceptions
3.2 Electrodynamics in various dimensions
3.2.1 ${\mathbb{R}}_{1,2n-1}$
3.2.2 ${\mathbb{R}}_{1,1}$
3.2.3 ${\mathbb{R}}_{1,5}$
3.3 Massless charged particles
3.4 Action at a distance
3.5 Nonlinear electrodynamics
3.6 Nonlocal interactions
3.7 Particles with spin
4 SELF-INTERACTING GAUGE FIELD SYSTEMS
4.1 Self-interaction in the Yang–Mills–Higgs theory
4.2 Self-interaction in the Yang–Mills–Wong theory
5 CLASSICAL SELF-INTERACTING STRINGS
6 SELF-INTERACTION IN GENERAL RELATIVITY
ACKNOWLEDGMENTS
References

Abstract

To develop a systematic treatment of the self-interaction problem in classical gauge theories and general relativity, we study tenable manifestations of self-interaction: topological phases, and rearrangements of degrees of freedom appearing in the action. We outline the occurrence of topological phases in pure field systems. We show that the rearranged Maxwell–Lorentz electrodynamics is a mathematically consistent and physically satisfactory theory which describes new entities, dressed charged particles and radiation. We extend this analysis to cover different modifications of the Maxwell–Lorentz electrodynamics and the SU $(N)$ Yang–Mills–Wong theory. We take a brief look at a subtle mechanism of self-interaction in classical strings. Turning to general relativity, we note that the total energy and momentum of a system with nontrivial topological content, such as a black hole, are ambiguous, coordinatization-dependent quantities, which resembles the situation with paradoxical decompositions in the Banach–Tarski theorem.

Keywords: topological phases, rearrangement, dressed particle, radiation, gravitational energy-momentum

1 INTRODUCTION

The self-interaction problem was a central subject for study in fundamental physics of the 20th century. This problem was especially pressing over a period from the late 1930s to the early 1970s in relation to the discovery of ultraviolet divergences in quantum field theory and subsequent effort to remove or avoid them. The next page of this story was the use of the gained experience of handling theories which are suffered from the ultraviolet disease to elaborate criteria for discriminating between appropriate and inappropriate theories. The quest culminated in establishing the Standard Model of particle physics. With the advent of string theory things calmed down. There comes a time when lessons from those stupendous developments could be drawn. Since there is an extensive literature covering quantum aspects of the problem, attention may be get to less analyzed classical aspects.

Traditionally one inquires into the properties of classical self-interaction which share a number of traits with those of quantum self-interaction because the way for eliminating the ultraviolet problems from a classical system may hopefully suggest a cure for such troubles in the quantum incarnation of this system. There are, however, notable classical properties of this phenomenon that bear no relation to the corresponding quantum properties. For example, the behavior of many classical self-interacting systems is irreversible, while the associated quantum regime of evolution is reversible. Both common and distinguishing properties of classical and quantum self-interacting systems are a major preoccupation of the present work.

Eighty years ago, Dirac [52] offered a thorough study of a classical radiating electron, which later furnished the most influential paradigm of self-interaction. Technically, it is much easier to cope with this classical problem than with its quantum counterpart. The classical theory of relativistic charged particles, the Maxwell–Lorentz electrodynamics, can be exactly solved. In contrast, four-dimensional relativistic quantum field theories, specifically quantum electrodynamics, defy all attempts to solve them exactly. The only reliable analytical tool to attack such theories is perturbation series in coupling constants, which, however, fails to grasp non-perturbative effects. Meanwhile it is just these effects which are essential for the proper understanding of the self-interaction mechanisms.

1.1 Elusive renditions of self-interaction

A distinctive feature of the classical picture is the coexistence of particles and fields mediating interactions between these particles. For comparison, the fundamental notion of quantum field theory is a quantized field. Excitations of quantum fields act as particles. Our interest here is with classical systems containing both particles and fields, as well as with pure field systems, that is, systems devoid of particles. We begin with the former.

Abraham [2], [3], [4] and Lorentz [105], [106] pioneered in applying the idea of self-interaction to a nonrelativistic model of the electron as a rigid body of finite extent. They tried to find the force of the electron on itself, that is, the resultant force due to different parts of the charge distribution, acting on one another. In doing so they conceived of the electron, affected by this ‘‘self-force’’, as a warp-free body, namely a sphere of diameter $d$ .

However, the very concept of continuous charged matter with a reasonably steady charge distribution is inconsistent in the classical context. Each part of a lump of charged matter exerts a repulsive force on other parts, which cause the lump to become a rarefied medium. A homogeneous mixture of two oppositely charged fluids is also unstable: part of the mixture would collapse forming a neutral cluster, while the remainder possessing an uncompensated residual charge would spread. Poincaré [121] conjectured that stable existence of continuous charged matter is ensured by nonelectromagnetic cohesive forces. A striking implication of this conjecture is that electrodynamics is fundamentally unclosed. Among other things, electromagnetic self-interaction eludes analyzing separately from the Poincaré force contribution.

One may be inclined to think that the joint action of the cohesive and electromagnetic forces manifests itself in a rigidity condition. Alternatively, the joint action can be realized by calling into play an equation of state adapted to a deformable model of the electron.

However, the existence of rigid bodies is contrary to the theory of relativity (see, e. g., [103], ${\S\,15}$ , where the results of a long discussion of this issue at the dawn of the age of this theory are briefly summarized), and therefore the rendition of self-interaction proposed by Abraham and Lorentz must be abandoned. More recent attempt to reinstate the rigid body model [40] was based on the belief that there is a fundamental length $\ell$ , about the same value as $d$ , so that acausal signals are allowable in regions of size comparable with $\ell$ . There are two characteristic lengths related to the electron, the classical radius of the electron $r_{0}=e^{2}/mc^{2}=2.8\cdot 10^{-13}$ cm, and the Compton wave length of the electron $\lambda_{e}=\hbar/mc=3.9\cdot 10^{-11}$ cm. Both are not as small as is necessary to mark that a new physics is triggered. The smallest length constructed from constants of nature (the velocity of light $c$ , Planck’s constant $\hbar$ , and Newton’s constant $G_{\rm N}$ ) is the Planck length $l_{\rm P}=(\hbar G_{\rm N}/c^{3})^{1/2}=1.6\cdot 10^{-33}$ cm. In regions of size $\sim l_{\rm P}$ , quantum fluctuations of the metric are expected to become significant, and the usual relations between cause and effect may not apply. However, $l_{\rm P}$ is most likely to be unrelated to the electron size. In addition, we are in the dark about the explicit form of violation of causality in the interior of the electron; any argument of this kind appears highly speculative.

As for the deformable electron models, one has to resort to thermodynamical variables (pressure, temperature, etc.) which form the equation of state, but are foreign to the laws of microscopic dynamics. Hence, this approach is almost unfailingly accompanied by an ad hoc phenomenology, and furthermore, a tolerable equation of state is uncertain.

It remains to see whether $p$ -branes, flexible extended objects with $p$ spatial dimensions, can be adapted for use as a pertinent classical model. The habitat of $p$ -branes is said to be sub-Planckian regions, alien to classical physics. However, this type of extended objects is of great theoretical interest, and must be mentioned if only for completeness of our discussion. The points of a $p$ -brane in ambient spacetime are given by $X^{\mu}(u^{0},u^{1},\ldots,u^{p})$ , and the action of a free $p$ -brane is proportional to the $(p+1)$ -dimensional world volume swept out by this $p$ -brane,

[TABLE]

Here, $T_{p}$ is a constant necessary for rendering the action dimensionless, $h_{ab}=\partial_{a}X^{\mu}\,\partial_{b}X_{\mu}$ , $a,b=0,1,\ldots,p$ , is a metric on the world volume induced by the Lorentz metric of the ambient spacetime, and $h=\det\left(h_{ab}\right)$ . The dynamics of $p$ -branes dispenses with the need for arbitrary, unjustifiable phenomenological assumptions; it is uniquely determined by the requirement that the action be the simplest action invariant under reparametrizations

[TABLE]

where $f^{a}$ are arbitrary smooth functions. The action (1) meets this requirement.

In four-dimensional spacetime, there are two species of $p$ -branes: 1-branes (strings) and 2-branes (membranes). Both are systems with infinite degrees of freedom. Their dynamics share many features of field theory. The study of these objects may be combined with that of pure field systems defined on curved $(p+1)$ -dimensional manifolds.

We will see in Sec. 5 that classical strings exhibit a specific form of self-interaction: a free charged closed string is capable of spontaneous splitting into two such strings. This phenomenon might be naturally interpreted as a manifestation of self-interaction of these extended classical objects.

Returning to the history, Frenkel [61] was the first to argue that electromagnetism may be accounted for by itself, without resort to Poincaré cohesive forces. He deemed the electron as a point in the precise geometric sense. A point particle can be envisioned as a sphere of radius $r$ in the limit $r\to 0$ . Electrostatic repulsive forces are put to distinct points of the sphere, and, therefore, each part of the sphere tends to move away from other parts. However, all the repulsive forces are brought into a single point and cancel as $r\to 0$ . Therefore, a point charged particle is free from the explosion tendency, and the stable existence of such objects has no need of the cohesive force conjecture.

Inspired by Frenkel’s idea, Dirac [52] gave its adequate mathematical formulation through the delta-function. The idea of a point source of a field was a useful guide in the development of quantum field theory, and came up with the present paradigm of local interactions of quantized fields. However, the problem of infinite self-energy was the price to pay for the conceptual simplicity and mathematical elegancy.

Another impact of this idea is the necessity to sacrifice the rendition of self-interaction. Taking the view of the electron as a structureless point particle, we have to abandon all attempts to conceive of the putative ‘‘self-force’’ which would combine infinitesimal repulsive forces contributed by different elements of the electron, and exclude this term from the pedagogical usage and physics folklore. Since we do not have at our disposal a pictorial rendition of self-interaction, we are forced to content ourselves with the study of noticeable manifestations of self-interaction in the system ‘‘a charged point particle plus electromagnetic field’’.

Turning to pure field systems, we find that the notion of self-interaction is far from clear. The line of demarcation between interacting and free systems is often fuzzy. The behavior of any free field is believed to be governed by a linear equation with constant coefficients. A simple example is given by a real scalar field $\phi$ obeying the Klein–Gordon equation

[TABLE]

which is derived from the Lagrangian quadratic in $\phi$ ,

[TABLE]

However, the linearity is sometimes a matter of convention, which can be eliminated as the need arises. To illustrate, we refer to the equation of motion for points of a string that becomes either linear or nonlinear according to which gauge condition is adopted.

A generic solution to Eq. (3) tells us that $\phi$ executes simple harmonic oscillations at every point of space. On the other hand, if the Lagrangian involves powers of $\phi$ higher than quadratic, such as

[TABLE]

then the system is generally taken to be self-interacting because the Euler–Lagrange equations are nonlinear. The system governed by the Lagrangian (5) executes anharmonic oscillations. This behavior is qualitatively the same as that of the free system. The only difference is that the period of harmonic oscillations is independent of their amplitudes, while the period of anharmonic oscillations is amplitude-dependent.

The next complication concerning this notion follows from the fact that the system with quadratic Lagrangians can be converted to an ostensibly self-interacting system by a nonlinear field transformation,

[TABLE]

Such transformations are very important in quantum field theory because they provide us with a subclass of nonrenormalizable field theories physically equivalent to renormalizable ones. The main point, unveiled in [42], [56], and [23], is that two quantum field theories related by transformation (6) have the same $S$ matrix. This statement is known in the literature as the ‘‘equivalence theorem’’. A simple proof of this theorem, proposed in [30], shows that the supposedly nonrenormalizable part of the resulting theory is actually a kind of gauge fixing, attributable to the cohomologically trivial sector of the theory.

Note also the absence of a clear-cut distinction between the notions of ‘‘self-interaction’’ and ‘‘interaction between two fields’’, as exemplified by the quartic term of a complex field $\phi=A+iB$ whose role can be understood as a single self-interaction term $\frac{1}{4}\,\lambda^{2}\left({\phi}^{\ast}{\phi}\right)^{2}$ , or, alternatively, as the sum of two self-interaction terms $\frac{1}{4}\,\lambda^{2}\left(A^{4}+B^{4}\right)$ and the term $\frac{1}{2}\,\lambda^{2}A^{2}B^{2}$ which contains mixed contribution of two real fields $A$ and $B$ and corresponds to their coupling. Until the early 1970s the subnuclear zoo was divided into two classes: ‘‘matter’’, represented by fermions, and ‘‘fields’’, represented by bosons, carriers of the fundamental forces of nature. This classification might be substantiated by the statement that, in the classical limit, bosons are susceptible to the Bose–Einstein condensation, and hence the behavior of their collection bears a general resemblance to that of classical fields, while fermions, which follow the Pauli blocking principle, share many traits with classical particles. The advent of supersymmetry produced a dramatic change in that order. A supersymmetric system accommodates field degrees of freedom of different kind, as well as forces between them, in a single self-interacting entity.

1.2 Manifestations of self-interaction

A central idea of this paper is that self-interaction of a classical system shows itself in two significant manifestations. Those will be treated under the names ‘‘topological phases’’ and ‘‘rearrangements of initial degrees of freedom appearing in the action’’, or, shortly, ‘‘rearrangement’’. To explicate these notions, consider a classical system whose states are described by generalized variables $\phi_{i}$ , and the dynamics is encoded by the action $S[\phi]$ . The behavior of the system is governed by the Euler–Lagrange equations resulted from the principle of least action,

[TABLE]

Suppose we are aware of joint solutions to the entire set of these equations, and among them there are physically relevant solutions, namely such that the energy of the system is finite. If two or more solutions describe configurations of distinct topological structures, then we are entitled to claim that the space of states has different phases, and relate the existence of the topological phases to self-interaction of the system. Section 2 outlines some simple pure field systems exhibiting topological phases.

While on the subject of systems which involves point particles and fields mediating interactions between these particles, we should recognize that an attempt to find a joint solution to the set of the Euler–Lagrange equations (7) will in most cases end in a fiasco. Section 3.1 demonstrates that the Maxwell–Lorentz electrodynamics, formulated in terms of mechanical variables $z^{\mu}(s)$ describing world lines of bare charged point particles and the electromagnetic vector potentials $A^{\mu}(x)$ , experiences a blowup, which can be construed, in the spirit of quantum field theory, as a kind of ultraviolet divergence. It will transpire that an interplay between degrees of freedom of bare particles and electromagnetic field rearranges the system giving rise to new entities, dressed particles and radiation, and that the rearranged dynamics is mathematically well-defined and physically reasonable.

Both manifestations of self-interaction can be combined in systems which contain point particles and non-Abelian gauge fields. For example, in the Yang–Mills–Wong theory which describes $K$ particles carrying non-Abelian charges and interacting with the SU $(N)$ Yang–Mills field, $N\geq K+1$ , we will observe two phases. We will learn from Sec. 4.2 that these phases are invariant under different gauge groups, SU $(N)$ and SL $(N,{\mathbb{R}})$ , which are respectively the compact and a noncompact real forms of the complex group SL $(N,{\mathbb{C}})$ , and that the system is rearranged differently in different phases. In contrast, self-interaction of some systems may reveal itself by one of two manifestations. Classical gravitating systems are a good case in point. They will be shown in Sec. 6 to be capable of developing infinitely large number of phases but are unaffected by the rearrangement.

Where do these manifestations of self-interaction come from? The general reason for their occurrence is that the system is unstable. To be more specific, once $\phi_{i}$ is a joint solution to the set of the Euler–Lagrange equations (7), the condition

[TABLE]

holds for some $i$ and $j$ . Unstable modes tend to assemble into new stable modes. Of course, for systems suffered from the ultraviolet disease, the fact of their instability cannot be established directly through the use of (8), and circumstantial evidence is required.

It is interesting to compare this mechanism for displaying self-interaction of classical systems with what happens in the quantum picture. The behavior of a quantum system can be described by the Feynman path integral. Whatever the path with appropriate end points, it contributes to the path integral. The principle of least action, Eq. (7), implying that the contribution of an extremal path dominates the path integral, is irrelevant to the quantum regime of evolution. Therefore, it is beyond reason to take the condition of instability (8) as a prerequisite for rendering quantum self-interaction manifest. In general, it would be wrong to place quantum systems into one of two categories, stable and unstable, because the notions of stability and instability make no sense outside the scope of the principle of least action. Note also that the quantum and classical dressings are unrelated, even though they bear similar names. We recall the reader that the vacuum polarization is of decisive importance for the quantum dressing, and that a cloud of virtual pairs of particles and antiparticles is dragged by a dressed quantum particle. These phenomena are absent from the classical picture where the processes of creations and annihilations of pairs of particles and antiparticles are strongly forbidden. Perhaps the most outstanding distinction between the classical and quantum dynamics is that the latter is reversible, as exemplified by an electron which emits and absorbs photons with comparable probability amplitudes of these processes, whereas the former becomes irreversible after the rearrangement.

And yet the quantum picture shares a common trait with its classical relative, that of having vacuum expectation values of quantum variables $\langle 0|\phi|0\rangle$ governed by the action principle [143]. Further still the condition of instability for quantum systems described in terms of $\langle 0|\phi|0\rangle$ , Eq. (8), falls into the classical pattern [69].

The term ‘‘rearrangement’’ was coined by Umezawa [150] who looked at spontaneous symmetry breaking in the quantum context. The mechanism for rearranging classical gauge field systems was then studied in a series of papers [92], [94], [95], [96], [98], [99], [100], [101], and [43], which are relied heavily on findings by Teitelboim [147].

1.3 Plan of the review

This paper is written in a pedagogical manner. We restrict our attention to the simplest examples of classical self-interacting systems. For those who wish to learn more about particular issues we provide links to original articles and other useful sources. No attempt has been made to prepare a complete bibliography because this is a formidable task.

Our prime interest here is with conceptual aspects of the subject matter, rather than mathematical rigor, generality, and phenomenological utility.

The structure of the review is clear from the table of contents. We briefly run through pure field systems exhibiting the availability of their topological phases in Secs. 2 and 4.1. A thorough analysis of such systems, in the light of spontaneous breakdown of symmetry, can be found in the existing literature.

We do not discuss self-interacting charged particles in curved manifolds [50], [78], [19]. Such a discussion would require rather sophisticated techniques, even though it has met with only limited success in gaining new insight into the self-interaction problem.

We do not mention many current studies, in particular, those related to gravitational self-interaction, which fall outside the purpose of this review for nonexperts.

We use units in which the speed of light and Planck’s constant are taken to be unit throughout. In Secs. 2–5 in which our concern is with the picture in Minkowski spacetime ${\mathbb{R}}_{1,3}$ we adopt the metric $\eta_{\mu\nu}={\rm diag}\left(1-1-1-1\right)$ . When turning to pseudo-Riemannian manifolds in Sect. 6, we use the metric $g_{\mu\nu}(x)$ with the same signature. In order to keep the conformity with the presentations of original research papers, we use interchangeably Gaussian and Heaviside units.

2 TOPOLOGICAL PHASES

To trace the advent of topological phases, we can conveniently discuss a simple prototype of the Goldstone model. Consider a single real scalar field $\phi(t,x)$ in two dimensions whose dynamics is encoded by

[TABLE]

The constant

[TABLE]

in which

[TABLE]

is suitable for writing the Lagrangian in a succinct form:

[TABLE]

where

[TABLE]

The Lagrangian (12) is invariant under reflection $\phi\to-\phi$ . However, the state $\phi=0$ realizing this symmetry is found to be unstable as soon as the principle of least action comes into effect. Every minuscule perturbation remove the system from this state. To put it differently, this state personifies a tachyon which, in the weak coupling limit $\lambda\to 0$ , is governed by

[TABLE]

Since $\phi=0$ corresponds to unstable equilibrium, the ground state of the system, associated with the absolute minimum of the energy functional

[TABLE]

is the state afforded by either of two solutions

[TABLE]

To see this, we note that the derivative terms in (15) are minimized when $\phi$ is a constant. This constant is specified by the minimum of ${U}(\phi)$ . For both of the solutions (16), the energy (15) is vanishing. Assume that $\phi=+\phi_{0}$ furnishes the ground state. Let $\chi$ be a small perturbation about $\phi_{0}$ ,

[TABLE]

Substituting (17) in (12), we observe that the resulting Lagrangian

[TABLE]

exhibits an oscillatory mode with mass

[TABLE]

instead of the tachyon mode. A similar mode appears in the phase associated with the solution $\phi=-\phi_{0}$ .

Therefore, the system executes almost periodic motions about either of two stable equilibrium points (16). The price for the stability is that the Lagrangian (18) is not invariant under reflection $\chi\to-\chi$ . This phenomenon, known as spontaneous symmetry breaking, is inherently classical because the criterion for discriminating between stable and unstable states stipulates that the principle of least action holds.

There are two further topological phases corresponding to the so-called ‘‘kink’’ and ‘‘antikink’’ static solutions

[TABLE]

which asymptotically approach either $\phi_{0}$ or $-\phi_{0}$ as $x\to\pm\infty$ . The energy density of these configurations is localized near $x_{0}$ :

[TABLE]

Accordingly, the total kink energy is finite,

[TABLE]

The solutions (20) realize a local minimum of the energy functional (15) in the sense that small perturbation about the kink (or antikink) are oscillatory modes associated with a bound state and scattering states, and a translation mode. For a detailed discussion of the derivation and properties of these solutions see [44], [125], [126], [107].

The field configuration space of all finite-energy solutions can be divided into four sectors, labelled by two indices

[TABLE]

These sectors are topologically unconnected. The trivial solutions ${\pm\phi_{0}}$ are characterized by $\aleph_{\rm i}=\aleph_{\rm f}=1$ and $\aleph_{\rm i}=\aleph_{\rm f}=-1$ , respectively, the kink is marked by $\aleph_{\rm i}=-1$ , $\aleph_{\rm f}=1$ , and the antikink by $\aleph_{\rm i}=1$ , $\aleph_{\rm f}=-1$ . Fields of one sector cannot be distorted continuously into another. To switch between such sectors, it is necessary to leap over the potential barrier of height $\sim U_{0}L$ , where $U_{0}$ is given by (10), and the system is assumed to be in a large box whose length $L$ tends to $\infty$ . Since time evolution is an example of continuous distortion, a field configuration from any one sector stays within that sector as time passes.

Another way for identifying the topological phases is to use the topological charge

[TABLE]

The corresponding topological current

[TABLE]

where $\epsilon^{\mu\nu}$ is the two-dimensional Levi-Civita symbol, obeys the local conservation law

[TABLE]

for any $\phi$ . Therefore, the spatial integral of ${\cal J}^{0}$ is a conserved quantity identical to ${\cal Q}$ ,

[TABLE]

The kink and antikink phases are endowed with ${\cal Q}=2$ and ${\cal Q}=-2$ , respectively, and both trivial solutions (16), realizing the degenerate ground state, have ${\cal Q}=0$ .

Surprisingly, the four-dimensional analog of the system (9) does not display nontrivial topological phases associated with kink-like solutions, even though the system is unstable, and hence experiences spontaneous breakdown of symmetry. A general statement [48] is that localized static solutions of the system governed by

[TABLE]

are unstable for nonnegative smooth functions $U(\phi)$ with $U^{\prime}(0)=0$ . This argument is extendable to higher dimensions. Furthermore, exponential instability of localized static solutions of these systems was established in [86].

In going from the above systems with a real scalar field $\phi$ to systems with a complex scalar field $\Phi$ , the discrete symmetry $\phi\to-\phi$ is changed for a continuous symmetry $\Phi\to e^{i\theta}\Phi$ . Accordingly the topological charge ${\cal Q}$ is substituted for the so-called winding number $n$ distinguishing different homotopy classes in mapping circles into circles.

Kink-like solutions are lacking in some of these systems. To illustrate, we refer to the Goldstone model,

[TABLE]

where $\Phi$ is a complex scalar field. Although the sign of the term $\frac{1}{2}\,\mu^{2}\left({\Phi}^{\ast}\Phi\right)$ is such that $\Phi$ behaves as a tachyon, the model is devoid of kink-like solutions.

In contrast, solutions of this kind are peculiar to the Higgs model [77]

[TABLE]

Here $F_{\mu\nu}=\partial_{\mu}A_{\nu}-\partial_{\nu}A_{\mu}$ , $D_{\mu}\phi=(\partial_{\mu}-ieA_{\mu})\,\Phi$ , that is, (30) describes the interaction of a complex scalar field $\Phi$ with electromagnetic field. Vortex-line solutions of this model, similar to the vortex line in a superconductor, were shown [113] to form topologically nontrivial phases labelled by winding number $n$ . The reader interested in this topics would do best to consult the books [126], [141], [107].

3 SELF-INTERACTION IN ELECTRODYNAMICS

3.1 The Maxwell–Lorentz theory

The system ‘‘a charged particle plus electromagnetic field’’ is described by the action

[TABLE]

where the Poincaré–Planck term

[TABLE]

governs the particle; the Schwarzschild term

[TABLE]

is responsible for the interaction of the particle and the field; and the Larmor term

[TABLE]

encodes the field dynamics. The parameter $\tau$ is associated with evolution of the particle. Derivatives with respect to $\tau$ are denoted by dots. $m_{0}$ stands for mechanical mass of the particle.

Does the extremalization of this action make the system unstable? A direct way for tackling the question would be to obtain a joint solution to the Euler–Lagrange equations

[TABLE]

and

[TABLE]

(where $s$ denotes the proper time, $v^{\mu}={dz^{\mu}}/{ds}$ is the four-velocity, and $a^{\mu}={dv^{\mu}}/{ds}$ is the four-acceleration), supplemented with the Bianchi identity

[TABLE]

and examine the variation of Eqs. (35) and (36) about this solution. To accomplish this plan, one should first find the joint solution, that is, taking the retarded condition and assuming that $z^{\mu}(s)$ is an arbitrary smooth timelike curve, solve Eqs. (37) and (35); substitute the retarded solution $F^{\lambda\mu}$ into Eq. (36); and solve the resulting equation. But applying the solution $F^{\lambda\mu}$ to (36), we obtain a divergent expression. The occurrence of this divergency can be thought of as if the field degrees of freedom are induced by the extremality of the action to attain a singularity on the world line, and the particle tends to blow up due to infinite concentration of the energy of its own field. But if it is granted that the divergency is eliminated, through the renormalization procedure, the blowup appears to be suppressed.

Therefore, care is required in analyzing the self-interaction problem in field theories with delta-function sources. An appropriate starting point for this analysis is a Noether identity [114], which, as applied to the action (31)–(34), takes the form:

[TABLE]

Here, $T^{\mu\nu}$ is the symmetric stress-energy tensor of this system,

[TABLE]

and ${\cal E}^{\lambda\mu\nu}$ , ${\cal E}_{\mu}$ , $\varepsilon^{\lambda}$ are, respectively, the left-hand sides of equations (37), (35), (36). The derivation of Eqs. (38)–(41) has been detailed, e. g., in [99].

Note that the Noether identity (38) does not stipulate that the action is extremal. The responsibility for the fulfilment of Eq. (38) only rests with translation invariance.

Were it not for the divergency, the equation

[TABLE]

would imply ${\cal E}^{\lambda\mu\nu}=0$ , ${\cal E}_{\mu}=0$ , and $\varepsilon^{\lambda}=0$ , that is, the local conservation law for the stress-energy tensor is formally equivalent to the equation of motion for a bare particle (36) in which a solution to the field equations (37) and (35) is used. However, Eq. (42) provides a more penetrating insight into the rearrangement of the Maxwell–Lorentz theory because $\Theta^{\mu\nu}$ can be segregated into terms with integrable and nonintegrable singularities.

3.1.1 Radiation

Let a point charge be moving along a smooth timelike world line $z^{\mu}(s)$ . The retarded field $F^{\mu\nu}$ generated by this charge can be written [52] as

[TABLE]

Here,

[TABLE]

is a null vector drawn from a point ${z}^{\mu}(s_{\rm ret})$ on the world line, where the electromagnetic signal was emitted, to the point $x^{\mu}$ , in which the signal was received; the four-velocity $v^{\mu}$ and four-acceleration $a^{\mu}$ refer to the retarded instant $s_{\rm ret}$ . Further retarded covariant variables: invariant distance $\rho$ between $x^{\mu}$ and ${z}^{\mu}(s_{\rm ret})$ , which is actually the distance measured in the instantaneously comoving Lorentz frame at $s=s_{\rm ret}$ ,

[TABLE]

a retarded scalar

[TABLE]

and a null vector $c^{\mu}$ aligned with $R^{\mu}$ ,

[TABLE]

are convenient to use for the present discussion. $c^{\mu}$ can be represented as the sum of two orthogonal to each other normalized vectors,

[TABLE]

where $u^{\mu}$ is an imaginary-unit vector directed from ${z}^{\mu}(s_{\rm ret})$ to $x^{\mu}$ ,

[TABLE]

With these definitions, Eqs. (43) and (44) become

[TABLE]

It is common to decompose this field into two parts, $F=F_{\rm\hskip 0.85358ptI}+F_{\rm\hskip 0.85358ptII}$ , where

[TABLE]

and regard $F_{\rm\hskip 0.85358ptI}$ as a ‘‘generalized Coulomb field’’, and $F_{\rm\hskip 0.85358ptII}$ as the ‘‘radiation field’’. However, this separation is of no utility: whatever the motion of the charge, there is a Lorentz frame, special for each point $x^{\mu}$ , in which $F_{\rm\hskip 0.85358ptII}$ is completely eliminated, and only $F_{\rm\hskip 0.85358ptI}$ persists. This is clear from the mere fact that ${\cal P}=\frac{1}{2}F_{\mu\nu}{}^{\ast}\!F^{\mu\nu}=0$ , ${\cal S}=\frac{1}{2}F_{\mu\nu}F^{\mu\nu}=-e^{2}/\rho^{4}$ for the field $F^{\mu\nu}$ defined by Eqs. (51) and (52).

To indicate explicitly the frame of reference in which $F_{\rm\hskip 0.85358ptII}=0$ , rewrite (51) as

[TABLE]

A rendition of the bivector $\varpi$ is the parallelogram of the vectors $c^{\mu}$ and $U^{\mu}$ , with the area of the parallelogram being equal to 1. The bivector $\varpi$ is invariant under the special linear group of real unimodular transformations SL $(2,{\mathbb{R}})$ which rotate and deform the initial parallelogram in the plane spanned by $c^{\mu}$ and $U^{\mu}$ , converting it to parallelograms of unit area. Therefore, $\varpi$ is independent of directions and magnitudes of the constituent vectors, it depends only on the parallelogram’s orientation. The parallelogram can always be built from a timelike unit vector $e_{0}^{\mu}$ and a spacelike imaginary-unit vector $e_{1}^{\mu}$ perpendicular to $e_{0}^{\mu}$ , $\varpi={e}_{0}\wedge{e}_{1}$ . There are three different cases:

(a) $U^{2}>0$ ,

[TABLE]

(b) $U^{2}<0$ ,

[TABLE]

(c) $U^{2}=0$ ,

[TABLE]

In the Lorentz frame with the time axis parallel to $e^{\mu}_{0}$ , all components of the $F^{\mu\nu}$ are vanishing, except for $F^{\hskip 0.56905pt01}$ which behaves as $\rho^{-2}$ . Equations (56)–(58) explicitly specify a frame in which the retarded electromagnetic field generated by a single arbitrarily moving charge appears as a pure Coulomb field at each observation point. With a curved world line, this frame is noninertial.

The SL $(2,{\mathbb{R}})$ transformations can be carried out independently at any spacetime point. We are thus dealing with local transformations. The invariance of $F^{\mu\nu}$ is not pertinent to electrodynamics as a whole, and hence gives rise to no Noether identities. Rather, this is a property of the retarded solution to Maxwell’s equations $F_{\rm ret}$ . The advanced solution $F_{\rm adv}$ can also be put in the form similar to (51)–(52), that is, $F_{\rm adv}$ is decomposable, whereas combinations $\alpha F_{\rm ret}+\beta F_{\rm adv}$ are not.

It is thus seen that the notion of radiation field is problematic: the segregation between parts of the retarded field scaling as $\rho^{-2}$ and $\rho^{-1}$ is disavowed by the local SL $(2,{\mathbb{R}})$ invariance of the Liénard–Wiechert solution (51)–(52). Under these circumstances one may look at the stress-energy tensor $\Theta^{\mu\nu}$ for clues. A motivation for this is that $\Theta^{\mu\nu}$ is frame-dependent. It is natural to accommodate $\Theta^{\mu\nu}$ to Lorentz frames with rectangular coordinates. Substituting (51)–(52) into (40) gives

[TABLE]

which is split into nonintegrable and integrable parts, $\Theta^{\mu\nu}=\Theta^{\mu\nu}_{\rm\hskip 0.85358ptI}+\Theta^{\mu\nu}_{\rm\hskip 0.85358ptII}$ ,

[TABLE]

One can show [147] that two local conservation laws hold outside the world line:

[TABLE]

which suggest that $\Theta^{\mu\nu}_{\rm\hskip 0.85358ptI}$ and $\Theta^{\mu\nu}_{\rm\hskip 0.85358ptII}$ are dynamically independent off the world line. Let us compare the properties of $\Theta^{\mu\nu}_{\rm\hskip 0.85358ptI}$ and $\Theta^{\mu\nu}_{\rm\hskip 0.85358ptII}$ . We begin with the latter.

$\Theta^{\mu\nu}_{{\rm\hskip 0.85358ptII}}$ leaves the source at the speed of light. Indeed, the surface element of the future light cone $C_{+}$ drawn from ${z}^{\mu}(s_{\rm ret})$ is $d\sigma^{\mu}=c^{\mu}\rho^{2}d\rho\,d\Omega$ . Since $c^{\mu}$ is a null vector, the flux of $\Theta^{\mu\nu}_{{\rm\hskip 0.85358ptII}}$ through $C_{+}$ vanishes, $d\sigma_{\mu}\Theta^{\mu\nu}_{{\rm\hskip 0.85358ptII}}=0$ , implying that $\Theta^{\mu\nu}_{\rm\hskip 0.85358ptII}$ propagates along rays of $C_{+}$ . The energy-momentum flux associated with $\Theta^{\mu\nu}_{{\rm\hskip 0.85358ptII}}$ varies as $\rho^{-2}$ , which means that the same amount of energy-momentum flows through spheres of different radii. It is also significant that if the motion is uniform, $a^{\mu}=0$ , then $\Theta^{\mu\nu}_{{\rm\hskip 0.85358ptII}}=0$ .

None of these features is shared by $\Theta^{\mu\nu}_{{\rm\hskip 0.85358ptI}}$ . Let $d\sigma^{\mu}$ be the surface element of the future light cone ${C}_{+}$ , then

[TABLE]

The flux of $\Theta^{\mu\nu}_{{\rm\hskip 0.85358ptI}}$ through $C_{+}$ is nonzero, and hence $\Theta^{\mu\nu}_{{\rm\hskip 0.85358ptI}}$ moves slower than light. One may conclude that $\Theta^{\mu\nu}_{{\rm\hskip 0.85358ptII}}$ detaches from the source, while $\Theta^{\mu\nu}_{{\rm\hskip 0.85358ptI}}$ remains bound to it. It is clear from (60) and (52) that $\Theta^{\mu\nu}_{{\rm\hskip 0.85358ptI}}$ falls with distance at least as $\rho^{-3}$ . Therefore, $\Theta^{\mu\nu}_{{\rm\hskip 0.85358ptI}}$ yields the flux of energy-momentum which dies out with distance. Furthermore, $\Theta^{\mu\nu}_{{\rm\hskip 0.85358ptI}}$ is nonvanishing for any motion of the source. In other words, $\Theta^{\mu\nu}_{{\rm\hskip 0.85358ptI}}$ represents a part of the electromagnetic energy-momentum that is dragged by the charge.

The integration of $\Theta^{\mu\nu}_{{\rm\hskip 0.85358ptI}}$ over a three-dimensional surface intersecting the world line results in a divergent expression. In the language of quantum field theory, such expressions are known as ‘‘ultraviolet divergent’’. The mathematical reason for ultraviolet divergences is that the product of tempered distributions with coincident supports is ill-defined [32].

We will thereafter refer to a symmetric tensor as radiation, and denote it by $\Theta^{\mu\nu}_{{\rm\hskip 0.85358ptII}}$ , if

[TABLE]

It is conceivable that the energy flux produced by $\Theta^{\mu\nu}_{{\rm\hskip 0.85358ptII}}$ is directed inward towards the field source resulting in energy gain rather than energy loss. One may regard this as the absorption of radiation rather than its emission. An alternate view is that the emitted energy is negative: $\Theta^{00}_{{\rm\hskip 0.85358ptII}}=v_{\mu}\Theta^{\mu\nu}_{{\rm\hskip 0.85358ptII}}v_{\nu}<0$ . An example can be drawn from Sect. 4.2 where the self-interaction problem in the Yang–Mills–Wong theory is analyzed. There is no universally adopted terminology that distinguishes between $\Theta^{00}_{{\rm\hskip 0.85358ptII}}>0$ and $\Theta^{00}_{{\rm\hskip 0.85358ptII}}<0$ . We normally reserve the term ‘‘radiation’’ for the case that the emitted energy is positive.

Making switch from four-dimensional electrodynamics to that in $d$ dimensions, we can apply this analysis if we replace a sphere enclosing the source by a $(d-2)$ -dimensional sphere. Then condition (iii) becomes

[TABLE]

In addition, $\Theta^{\mu\nu}_{\rm\hskip 0.85358ptI}$ should fall more rapidly than $\Theta^{\mu\nu}_{\rm\hskip 0.85358ptII}$ to ensure that $\Theta^{\mu\nu}_{\rm II}$ be distinguished asymptotically from $\Theta^{\mu\nu}_{\rm I}$ ,

[TABLE]

The radiated energy-momentum is defined by

[TABLE]

where $\Sigma$ is a three-dimensional spacelike surface intersecting the world line. Since $\Theta^{\mu\nu}_{\rm II}$ involves only integrable singularities, and $\partial_{\nu}\Theta^{\mu\nu}_{\rm II}=0$ , the surface of integration $\Sigma$ in (69) may be chosen arbitrarily. It is convenient to deform $\Sigma$ to a tubular surface ${T}_{\epsilon}$ of small invariant radius $\rho=\epsilon$ enclosing the world line. The surface element on this tube is $d\sigma^{\mu}=\partial^{\mu}\!\rho\,\rho^{2}\,d\Omega\,ds=(v^{\mu}+\lambda c^{\mu})\,\epsilon^{2}\,d\Omega\,ds$ . Inserting (61) into (69) gives

[TABLE]

The solid angle integration is simple. One only need to apply the evident formulas

[TABLE]

where $\stackrel{{\scriptstyle\scriptstyle v}}{{\bot}}$ is the projection operator on a hyperplane with normal $v^{\mu}$ ,

[TABLE]

The result of integration is

[TABLE]

For this expression to be convergent, the integrand must fall off sufficiently rapidly as $s\to-\infty$ . The pertinent asymptotic condition, formulated by Haag [73], states: the motion of every charged particle must asymptotically approach a uniform regime in the remote past,

[TABLE]

With this asymptotic condition, Eq. (73) represents the four-momentum emitted by the source over the period from the remote past to the instant $s$ . Differentiating (73) with respect to $s$ we obtain the four-momentum emitted by an accelerated charge per unit proper time:

[TABLE]

This is the relativistic generalization [75] of the famous Larmor formula [104],

[TABLE]

describing the rate of radiated energy in an instantaneously comoving Lorentz frame.

Equation (76) shows that ${d{\cal E}}/{dt}>0$ to evidence that the emission of radiation is a dissipative, and hence, unidirectional process.

The concept of electromagnetic radiation grew up over a long period. Our interest here is with the definition of radiation developed by Teitelboim [147]. It was argued in [92], [95], [99] that only this definition can be correctly applied to the Yang–Mills–Wong theory.

3.1.2 Local balance of energy-momentum

Since $\Theta^{\mu\nu}_{{\rm\hskip 0.85358ptI}}$ contains $\rho^{-3}$ and $\rho^{-4}$ , this part of the electromagnetic stress-energy tensor is nonintegrable over three-dimensional surfaces intersecting the world line. An appropriate regularization is called for. With a Lorentz-invariant cutoff prescription, the result of integration is given by

[TABLE]

where $\epsilon$ is the cutoff parameter which is to go to zero in the end of calculations. For a thorough derivation of Eq. (77) see, e. g., [99]. Observing that a bare particle possesses the four-momentum

[TABLE]

one may render $m_{0}$ a singular function of $\epsilon$ , $m_{0}=m_{0}(\epsilon)$ , add Eqs. (77) and (78) up, and carry out the renormalization of mass, that is, assume that

[TABLE]

is finite and positive. This completes the definition of the measure ${\rm Reg}_{\epsilon}\,d\sigma_{\lambda}\left(\Theta_{\rm I}^{\lambda\mu}+t^{\lambda\mu}\right)$ in the limit $\epsilon\to 0$ , and the regularization-renormalization procedure culminates in the well-defined quantity

[TABLE]

originally deduced in [147].

The regularization-renormalization procedure is a means for completing the definition of the product of singular distributions, like ${\Box}^{-1}\delta^{4}(x)$ , as linear continuous functionals on a suitable test function space, say, on Schwartz space.

The four-momentum $p^{\mu}$ defined in Eq. (80) is attributed to a new entity synthesized from mechanical and electromagnetic degrees of freedom. This entity is reasonable to call a dressed charged particle.

Turning back to the general solution $F$ of Maxwell’s equations (35) and (37), we recall that $F$ is the sum of the retarded solution $F_{\rm ret}$ describing the self field of the delta-function source plus the general solution $F_{\rm ext}$ of the homogeneous wave equation describing free (‘‘external’’) electromagnetic field, $F=F_{\rm ret}+F_{\rm ext}$ . Accordingly, $\Theta^{\mu\nu}$ is split into $\Theta=\Theta_{\rm ret}+\Theta_{\rm mix}+\Theta_{\rm ext}$ . Our concern here is with $\Theta_{\rm mix}$ containing mixed contributions of $F_{\rm ret}$ and $F_{\rm ext}$ , while $\Theta_{\rm ext}$ is immaterial for the present discussion. Because the leading singularity of $F_{\rm ret}$ is of the type $\rho^{-2}$ , and $F_{\rm ext}$ is regular on the world line, the term $\Theta_{\rm mix}$ is integrable. Besides, taking into account the readily verifiable relationship

[TABLE]

the four-momentum ${\wp}^{\mu}$ associated with $\Theta^{\mu\nu}_{\rm mix}$ is conveniently evaluated by the use of a tube ${T}_{\epsilon}$ of infinitesimal radius $\epsilon$ , enclosing the world line $z^{\mu}(s)$ , as the integration surface,

[TABLE]

Equation (82) represents the four-momentum extracted from an external field $F_{\rm ext}$ during the whole past history prior to the instant $s$ . The derivative of ${\wp}^{\mu}$ with respect to $s$ equals an external Lorentz force exerted on the dressed particle at the point $z^{\mu}(s)$ .

Let us integrate (42) over a domain of spacetime bounded by two spacelike surfaces ${\Sigma}^{\prime}$ and ${\Sigma}^{\prime\prime}$ , separated by a short timelike interval, with both normals directed towards the future, and a tube ${T}_{R}$ of large radius ${R}$ . With the Gauss–Ostrogradsky theorem, this gives

[TABLE]

We then assume that $F_{\rm ext}$ disappears at spatial infinity. The only term contributing to the integral over $T_{R}$ is $\Theta^{\mu\nu}_{\rm II}$ . Taking into account the second equation of (62), the integral of $\Theta^{\mu\nu}_{\rm II}$ over $T_{R}$ can be converted into the integral over $T_{\epsilon}$ . The upshot is

[TABLE]

or, in a concise form,

[TABLE]

This is the desired local energy-momentum balance on the world line: the four-momentum $\Delta{\wp}^{\lambda}=-eF_{\rm ext}^{\lambda\mu}\,{v}_{\mu}\Delta s$ , extracted from the external field $F_{\rm ext}$ during the short period of time $\Delta s$ , is expended on the increment of four-momentum of the dressed particle, $\Delta p^{\lambda}$ , and the four-momentum carried away by radiation, ${\Delta{{\cal P}}}^{\lambda}$ .

Intuitively, this local balance is associated with an energy-momentum equilibration of the initially unstable system ‘‘a bare particle $+$ electromagnetic field’’. The rearrangement of degrees of freedom in this system may be said to terminate with the formation of the dressed particle and radiation. If an external field is incorporated in the system along with the self field, the external Lorentz force $eF_{\rm ext}^{\mu\nu}\,{v}_{\nu}$ comes into play in this equilibration.

3.1.3 The Abraham–Lorentz–Dirac equation

In an expanded form, Eq. (85) is an ordinary third-order differential equation for $z^{\mu}(s)$ ,

[TABLE]

originally discovered by Abraham [2], [3], [4], Lorentz [105], [106], and Dirac [52]. This equation is thus referred to by their names.

With the identities

[TABLE]

(86) can be rewritten as

[TABLE]

where $\stackrel{{\scriptstyle\scriptstyle v}}{{\bot}}$ stands for the projection operator on a hyperplane with normal $v^{\mu}$ , Eq. (72), $p^{\mu}$ is the four-momentum defined in Eq. (80), and $f^{\mu}$ is an external four-force.

Equation (88) is nothing but Newton’s second law smoothly embedded in Minkowski spacetime. Dressed particles are therefore dynamical objects governed by Newton’s second law. The dissimilarity of a dressed particle from its ancestor, a bare particle, is that the former has the four-momentum

[TABLE]

where $\tau_{0}$ is the characteristic time interval

[TABLE]

while the four-momentum of the latter depends on kinematical variables as

[TABLE]

It follows from (89) that

[TABLE]

Suppose that the acceleration of a dressed particle exceeds the critical value,

[TABLE]

then the dressed particle becomes a tachyon, that is, an object whose four-momentum is spacelike, $p^{2}<0$ . This does not imply superluminal motion. The potentially tachyonic nature of a dressed particle results from the fact that the curvature of its world line can be excessively high.

A central feature of the Abraham–Lorentz–Dirac equation (86) is the lack of invariance under time reversal $s\to-s$ because this equation involves both $a^{\mu}$ whose transformation law is $a^{\mu}\to a^{\mu}$ , and ${\dot{a}}^{\mu}$ which transforms according to ${\dot{a}}^{\mu}\to-{\dot{a}}^{\mu}$ . The rearrangement denudes the Maxwell–Lorentz electrodynamics of the time reversal symmetry properties: the emission of radiation is a unidirectional process, and the equation of motion for a dressed particle, Eq. (86), is irreversible.

3.1.4 Another way of looking at the dressed dynamics

There are other methods of deriving the Abraham–Lorentz–Dirac equation without resort to the energy-momentum conservation law, Eq. (42). Our interest here is with one of them (see, e. g., [15]) which is claimed to be based on an alternative definition of radiation.

Let us proceed directly from the equation of motion for a bare charged particle (36),

[TABLE]

in which $F^{\mu\nu}=F^{\mu\nu}_{\rm ret}+F^{\mu\nu}_{\rm ext}$ , $F^{\mu\nu}_{\rm ret}$ is the retarded field due to the charge in question, and $F^{\mu\nu}_{\rm ext}$ is an external field. Following Dirac’s original approach [52], the retarded field $F_{\rm ret}^{\mu\nu}$ is separated into regular and singular parts through introducing the corresponding advanced field $F_{\rm adv}^{\mu\nu}$ :

[TABLE]

Expressions for the retarded and advanced Green’s function,

[TABLE]

indicate that the retarded field generated by a delta-function source behaves similar to the advanced field in the vicinity of the source. Therefore, ${\bar{F}}$ is less singular than $F_{\rm ret}$ and $F_{\rm adv}$ , while $F_{P}$ shares the singular behavior of $F_{\rm ret}$ and $F_{\rm adv}$ .

The resulting regular part of the vector potential is

[TABLE]

where

[TABLE]

We use (99) in (97) to give

[TABLE]

Denoting $R^{\mu}=x^{\mu}-z^{\mu}(s)$ , we evaluate the regular part of the field strength:

[TABLE]

Since

[TABLE]

we have

[TABLE]

Let the observation point $x^{\mu}$ be on the world line, $x^{\mu}=z^{\mu}(\tau)$ . All other points on the world line are separated from $x^{\mu}$ by timelike intervals. Accordingly, the delta-function in (98) should be understood as the limit

[TABLE]

Besides, we can represent the argument of the signum function in (98) as $R_{0}=R\cdot v$ .

We now write $s=\tau+\sigma$ , and consider the integrand for a small interval $\sigma$ . Using the expansions

[TABLE]

where the vectors on the right-hand side refer to the instant $\tau$ , we find

[TABLE]

It follows that

[TABLE]

In view of identities (87),

[TABLE]

Substituting (108) and (109) into (103) and taking into account that

[TABLE]

we obtain

[TABLE]

and

[TABLE]

The term

[TABLE]

is called the Abraham term. In the literature, $\Gamma^{\mu}$ is often interpreted as radiation reaction, that is, the finite effect of the retarded Liénard–Wiechert field upon its own source. This interpretation goes back to Dirac [52] who considered $F_{\rm ret}^{\mu\nu}-F_{\rm adv}^{\mu\nu}$ as the radiation field and $F_{\rm ret}^{\mu\nu}+F_{\rm adv}^{\mu\nu}$ as the bound field. But this treatment is wrong. With reference to Eq. (95), we remark that $\Gamma^{\mu}$ is derived from ${\bar{F}}^{\mu\nu}$ , rather than from $F_{\rm ret}^{\mu\nu}-F_{\rm adv}^{\mu\nu}$ which is double the ${\bar{F}}^{\mu\nu}$ . We already mentioned that a dressed particle is acted upon by only an external force. It will transpire in the next section that the concept of radiation reaction causes much confusion in understanding the rearranged Maxwell–Lorentz theory. Furthermore, linear combinations of retarded and advanced fields seem to be of no use for nonlinear theories such as the Yang–Mills theory. Therefore, it is best to think of Eq. (95) as a mere formal trick for discriminating between integrable and nonintegrable singularities of the retarded field.

Consider the symmetric part $F_{P}$ of the decomposition (95). The corresponding Green’s function is

[TABLE]

We regularize this expression as follows:

[TABLE]

Applying this procedure to

[TABLE]

we get

[TABLE]

Therefore,

[TABLE]

Substituting (112) and (118) in (94) and performing the renormalization of mass, Eq. (79), we come again to the Abraham–Lorentz–Dirac equation (86).

For completeness, it is useful to refer to a simple method of regularization proposed in [16], which obviates the need for advanced fields. This method can readily be extended to field theories involving scalar and tensor fields, and linearized gravitation [18], as well as electrodynamics in curved spacetime [19]. The key idea of this method is that the retarded field $F_{\rm ret}$ can be regularized in the vicinity of the source using a kind of analytic continuation. To be more precise, the field is regarded as a function of two variables $F^{\mu\nu}[x;z(s)]$ and is continued analytically from null intervals between the observation point $x^{\mu}$ and the retarded point $z^{\mu}(s)$ to timelike intervals which result from assigning $x^{\mu}=z^{\mu}(s+\epsilon)$ and keeping the second variable $z^{\mu}(s)$ fixed.

To summarize, the retarded Liénard–Wiechert field can be regularized in different ways. The regularization scheme is allowed to be arbitrary; the only requirement is that it respects the symmetries of the action (32)–(34). Given a regularization characterized by the regularization parameter $\epsilon$ , the field becomes finite but $\epsilon$ -dependent at distances shorter than $\epsilon$ . This suggests that the mechanical mass should also be a function of regularization, $m_{0}=m_{0}(\epsilon)$ . A remarkable fact is that the renormalization of mass (79), absorbing the self-energy divergence, makes the rearranged Maxwell–Lorentz electrodynamics a finite and unambiguous theory.

3.1.5 Paradoxes and misconceptions

The physical validity of the Abraham–Lorentz–Dirac equation

[TABLE]

has been the subject of much controversy. At present the following view of this equation is of considerable use [10], [15], [82], [103], [116], [130]: Eq. (119) governs a charged radiating particle endowed with the four-momentum

[TABLE]

The particle is assumed to experience both an external force $f^{\mu}$ and the radiation reaction

[TABLE]

which is also known as the radiation damping four-force.

This view leads to many paradoxes and puzzles. To gain greater insight into why this view is so much persistent, let us take a closer look at the notion of radiation field proposed by Dirac [52]. The general solution to Maxwell’s equations can be cast in two alternative ways:

[TABLE]

where $F_{\rm in}$ and $F_{\rm out}$ are respectively incoming and outgoing fields described by solutions to the homogeneous wave equation. Dirac defined the radiation field as

[TABLE]

In view of (122) and (123), this can also be written as the difference between the retarded and advanced solutions:

[TABLE]

$F_{\rm rad}$ measures the dissimilarity between the field which is going to happen in the far future and the field which was formed in the remote past, and hence is advisable to be called the radiation field. We will see in Sec. 3.4 that $F_{\rm rad}$ plays a crucial role in the Wheeler and Feynman absorber theory of radiation.

$F_{\rm rad}$ shares a number of traits with $F_{\rm II}$ , the long-range part of the retarded field defined by Eq. (54). Indeed, consider a world line composed of two timelike rays connected by a curved fragment, and draw two future light cones from the connection points. In the region enclosed by these cones, $F_{\rm rad}$ approaches $F_{\rm ret}$ , and, furthermore, $F_{\rm II}$ dominates $F_{\rm ret}$ with distance away from the source. Meanwhile $F_{\rm II}$ was shown in Sec. 3.1.1 to be completely removable by an appropriate local SL $(2,{\mathbb{R}})$ transformation, so that $F_{\rm rad}$ is removable along with $F_{\rm II}$ from the asymptotic region in which $F_{\rm rad}\to F_{\rm II}$ .

It is worthy of note that all manifestations of $F_{\rm rad}$ as a free field are due to linearity of Maxwell’s equations. In non-Abelian gauge theories, linear combinations of retarded and advanced solutions are no longer free fields. The construction (125) may only be of utility in Maxwell’s electrodynamics or other theories with linear field equations.

Since $\frac{1}{2}F_{\rm rad}$ is nonsingular on the world line, one can use it in the Lorentz force law, much as was done in Sec. 3.1.4, to yield the Abraham term $\Gamma^{\mu}$ , Eq. (121), appearing in the Abraham–Lorentz–Dirac equation (86). Ignoring the excess factor $2$ in the definition of $F_{\rm rad}$ , one commits to recognize $\Gamma^{\mu}$ as the radiation reaction.

What are the consequences of this recognition? The radiating particle feels a recoil,

[TABLE]

the negative of the Larmor emission rate (75). However, $-{{\dot{\cal P}}^{\mu}}$ is not a four-force because it is not orthogonal to $v^{\mu}$ . On the other hand, $\Gamma^{\mu}$ is orthogonal to $v^{\mu}$ , but differs from the expected recoil by the so-called Schott term ${2\over 3}\,e^{2}{\dot{a}}^{\mu}$ [140]. This term makes the energy-momentum balance problematic. To see this, write the temporal component of (119) as

[TABLE]

Following Dirac [52], one may reason: the rate at which the external force ${\bf F}$ does work on the particle is equal to the increase in the particle’s kinetic energy, plus the energy radiated, plus the energy stored in the Schott term. Although the energy stored in the Schott term can in principle be attributed to a ‘‘reversible form of emission and absorption of field energy’’, its actual role appears mysterious.

In an effort to remedy the situation, we impose the asymptotic condition

[TABLE]

generalizing the Haag condition (74). Observing that ${2\over 3}\,e^{2}{\dot{a}}^{\mu}$ is a perfect differential, and integrating (119) over $s$ in infinite limits, we eliminate the effect of this term and come to a global energy-momentum balance

[TABLE]

It may appear that (129) is a satisfactory solution to the problem [131]. But actually this result is puzzling. Whatever the history of the particle, $z^{\mu}(s)$ , obeying the asymptotic condition (128), the totality of alternating local emissions and absorptions, controlled by the Schott term, is zero, so that the global energy-momentum balance (129) holds true. It is as if the particle takes care on that this totality does not become nonvanishing in the end. The natural question can then be raised: Why is the energy-momentum conserved globally rather than locally? There is nothing in the laws of the Maxwell–Lorentz theory which suggests that the electromagnetic interaction is nonlocal to make local balance impossible.

The paradigm of rearrangement offers an alternative view of Eq. (119) as the equation of motion for a dressed particle. It was established in Sec. 3.1.3 that (119) is equivalent to Newton’s second law embedded in Minkowski space, Eq. (88). The key point is that a dressed particle possesses the four-momentum $p^{\mu}$ given by (89) rather than by (120). The structure of Eq. (88) makes it clear that the only force exerting on the dressed particle is an external force $f^{\mu}$ . There is no term in this equation through which the dressed particle interacts with itself. The notion ‘‘radiation damping force’’ and the like are thus to be abandoned as misconceptions. The local energy-momentum balance on the world line, Eq. (85),

[TABLE]

is a mere rewriting of the Abraham–Lorentz–Dirac equation, which tells us that the energy-momentum of an external field is converted into the increment of energy-momentum of the dressed particle and the energy-momentum radiated.

Closely related to the energy-momentum balance problem is the paradox of uniform acceleration. A covariant condition for uniform acceleration (see, e. g., [130]) is

[TABLE]

This condition implies $\Gamma^{\mu}=0$ , and the Abraham–Lorentz–Dirac equation (119) becomes

[TABLE]

which is identical to the equation of motion for a nonradiating particle, say a neutral particle of mass $m$ . One may see the paradox in the fact that a uniformly accelerated charged particle, while emitting electromagnetic radiation, experiences no back reaction. Besides, it is strange that the case of uniform acceleration is physically distinguished.

No paradox arises for a dressed particle. As pointed out above, the dressed particle experiences only an external force $f^{\mu}$ . The Abraham term $\Gamma^{\mu}$ has nothing to do with radiation reaction. $\Gamma^{\mu}$ says nothing about the emission rate ${{\dot{\cal P}}^{\mu}}$ , and $\Gamma^{\mu}=0$ does not imply that ${{\dot{\cal P}}^{\mu}}=0$ .

There is another formulation of this paradox. Let us compare the behavior of neutral and charged particles, which have identical masses $m$ , and move along a straight line under a constant force $f^{\mu}$ , say fall to the surface of the Earth. Both are attracted toward the Earth by an approximately constant force ${\bf f}=-mg{\bf n}$ , where $g$ is the acceleration of gravity, and ${\bf n}$ the normal vector of the surface of the Earth. With the ansatz

[TABLE]

where $\nu=\nu(s)$ is an unknown function, the equation of motion for the charged particle reduces to

[TABLE]

and that for the neutral particle reduces to

[TABLE]

Both equations (135) and (136) are satisfied by ${\nu}=-gs$ . Therefore, a given constant force causes both particles move along the same hyperbolic world line

[TABLE]

even if the accelerated charged particle radiates. Since this radiation carries off energy, the charged particle may be expected to accelerate less than the neutral one.

Note, however, that the energy of a neutral particle is positive definite, while the energy of a dressed charged particle is indefinite. Despite the fact that both particles execute identical motions, the energy associated with these motions is different. Indeed, the energy of a dressed charged particle is

[TABLE]

Accordingly, the increment of $p^{0}$ during a period from $s_{1}$ to $s_{2}$ is

[TABLE]

The energy radiated during this period is

[TABLE]

The sum of (139) and (140) equals the work $W$ of the force $f^{\mu}$ defined in (134),

[TABLE]

as might be expected from the balance equation (130).

For the neutral particle, $p^{0}=mv^{0}$ , and so

[TABLE]

which is equal to $W$ .

Keeping in mind Eq. (92), one may state that, when executing an accelerated motion, the dressed particle appears as an object less heavy than the neutral particle. That is why the increment of energy of the dressed particle caused by a constant force, Eq. (139), is less than that of the neutral particle, Eq. (142), by the energy radiated, Eq. (140).

A further concern is with the so-called counter-acceleration. One normally expects that the smallness of $\Gamma^{\mu}$ implies small corrections to the essentially Newtonian behavior of a charged particle. But these expectations are not always realized.

Let a charged particle be moving along a straight line. Then ${v}^{\mu}=(\cosh\nu,\sinh\nu,0,0)$ , $f^{\mu}=f(\sinh\nu,\cosh\nu,0,0)$ , and the Abraham–Lorentz–Dirac equation (86) reduces to

[TABLE]

This ordinary differential equation can be readily integrated to give

[TABLE]

where $C$ is an arbitrary initial value of ${\dot{\nu}}$ at $s=0$ . The comparison of (86) and (143) shows that ${\dot{\nu}}$ can be identified with the one-dimensional acceleration in the instantaneously comoving frame of reference. Setting $C=0$ , one finds that ${\dot{\nu}}$ and $f$ are oppositely directed.

For a dressed particle, this result presents no difficulty. Indeed, Eq. (88) shows that the dynamics of a dressed particle is Newtonian. This, however, in no way implies that acceleration must be aligned with force; $a^{\mu}$ and $f^{\mu}$ would have the same direction only if one makes the ad hoc assumption that $p^{\mu}=mv^{\mu}$ .

We next turn to the problem of runaways. On putting $f^{\mu}=0$ in (119), one can easily check that the general solution to this equation is

[TABLE]

where ${\rm V}^{\mu}$ and ${\rm U}^{\mu}$ are constant four-vectors such that ${\rm V}\cdot{\rm U}=0$ , ${\rm V}^{2}=-{\rm U}^{2}=1$ , and $\nu_{0}$ and $w_{0}$ are arbitrary parameters. This solution of the Abraham–Lorentz–Dirac equation is the most embarrassing feature of the theory: a free charged particle moving along the world line defined in (145) continually accelerates,

[TABLE]

and, furthermore, continually radiates. This may seem contrary to energy conservation. Such particles are said to be self-accelerated, or else executing a runaway motion.

Based on the dynamics of a dressed charged particle, the issue of energy-momentum conservation can be solved immediately. Taking $f^{\mu}=0$ in (130), we have

[TABLE]

which suggests that the rate of change of the energy-momentum of a dressed particle equals the negative of the emission rate. Of crucial importance is the observation that the energy of a dressed charged particle is indefinite,

[TABLE]

and hence, increasing $|{\bf v}|$ need not be accomplished by increasing $p^{0}$ . In fact, the energy of a dressed particle executing a runaway motion (145) decreases steadily, which exactly compensates the increase in energy of the electromagnetic field emitted,

[TABLE]

The runaways have long been believed to be unphysical solutions. The reason for this belief is that a free electron with exponentially increasing acceleration has never observed experimentally. However, by the turn of the 20th century, the idea that free objects can move with acceleration became not without appeal (for a review of some objects exemplifying such non-Galilean motions see [97]). The observational evidence for an accelerating Universe [128], [118], expressed in terms of the scale factor $a$ of the line element for homogeneous and isotropic spacetime,

[TABLE]

where $H_{0}$ is the current Hubble expansion parameter ( $H_{0}^{-1}\sim 10^{10}$ years), is usually attributed to the presence of a positive cosmological constant. An alternative approach to account for this exponentially accelerated motion [98] asserts that the Universe may be regarded as a free massive object, say a brane, which emits gravitational radiation and moves in a runaway regime similar to that shown in Eq. (145), with the characteristic time $\tau_{0}$ being $\sim H_{0}^{-1}\sim G_{\rm N}m$ , where $G_{\rm N}$ is Newton’s constant, and $m$ the total visible mass of the Universe.

Turning to the lack of observational evidence for self-accelerated motions of electrons, we note that the critical acceleration $|{\bf a}|=\tau_{0}^{-1}$ has been attained after a lapse of time

[TABLE]

and then the four-momentum of a self-accelerated electron becomes spacelike, $p^{2}<0$ . The period of time over which a self-accelerated subnuclear particle, if any, possesses timelike four-momenta is quite tiny. From (90) and (151), the period $\Delta s$ is estimated at $\tau_{0}\sim 10^{-23}$ s for electrons, and still shorter for more massive charged elementary particles. All primordial self-accelerated particles with such $\tau_{0}$ ’s have long been in the tachyonic state. But we do not have the slightest notion of how tachyons can be experimentally recorded.

We finally address the problem of pre-accelerations. For simplicity, we consider a charged particle moving along a straight line. To get rid of runaways, we assume that ${\dot{\nu}}(s)\to 0$ as $s\to\infty$ . Then the differential equation (143) can be readily converted into an integro-differential equation [81], [82],

[TABLE]

It follows from this equation that the particle accelerates before the force is applied. For example, if a step pulse, $f(t)=f_{0}\,\theta(t)$ , were applied, then the particle would begin its acceleration at a time $\tau_{0}$ before the pulse arrived. This phenomenon is appraised to be unphysical because the behavior of the particle apparently violates causality.

We should not forget, however, that a dressed particle is an object synthesized from mechanical and field degrees of freedom, and hence the third-order differential equation, Eq. (119), governs its behavior. If an effort is made to express the Abraham–Lorentz–Dirac equation in terms of the second-order equation of motion for a bare particle, then the distinction between these two dynamics shows itself as an effective smearing over a small spacetime region of size $\tau_{0}$ . This imprecise argument can be formulated more neatly: the effective smearing is expressed by Eq. (152). Of course, the actual interaction of a dressed particle with an external field is local, as exemplified by the local energy-momentum balance (130).

Most paradoxes in the Maxwell–Lorentz electrodynamics stem from the view that a charged radiating particle carries four-momentum $p^{\mu}=mv^{\mu}$ , and that the Abraham term $\Gamma^{\mu}$ exerts on this particle as the radiation damping force. The formula $p^{\mu}=mv^{\mu}$ has no justification except for keeping an analogy with mechanics. If a stress-energy tensor ${\mathfrak{T}}^{\mu\nu}$ associated with the integral quantity $p^{\mu}=mv^{\mu}$ is explored, one can readily verify that the local conservation law $\partial_{\mu}{\mathfrak{T}}^{\mu\nu}=0$ does not hold outside the world line. Considering ${{T}}^{\mu\nu}-{\mathfrak{T}}^{\mu\nu}$ as the radiation part of the stress-energy tensor we come into conflict with the characteristic properties of the radiation (64) and (65).

Note also that $p^{\mu}=mv^{\mu}$ implies that $p^{0}$ is positive definite. But the four-momentum of a dressed charged particle is defined by $p^{\mu}=P_{\rm I}^{\mu}+m_{0}v^{\mu}$ . The bound four-momentum $P_{\rm I}^{\mu}$ is a timelike future directed vector, while the four-momentum of a bare particle $m_{0}v^{\mu}$ is a timelike past directed vector because $m_{0}(\epsilon)<0$ for small $\epsilon$ , as Eq. (79) suggests. Assuming that $P_{\rm I}^{\mu}+m_{0}v^{\mu}$ is a timelike vector, one should recognize that the time component of this vector can have any sign. This conclusion is certified by Eq. (148).

Lack of understanding of the fact that the dynamics of a dressed charged particle attributed to the Abraham–Lorentz–Dirac equation (119) is physically satisfactory has led to numerous attempts at developing ad hoc modifications and approximate versions of this equation adapted to applications (see, e. g., [146] and references therein). We omit these developments because they add little to our discussion of the conceptual aspects of the Maxwell–Lorentz theory.

While the concept of dressed particles is a tenable physical means of disambiguation, the proper mathematical treatment of the Abraham–Lorentz–Dirac equation governing a dressed particle is as yet imperfectly understood. When extreme care is not exercised, surprising results may arise [57], [120], [116], which tempts one to assign the blame to the Abraham–Lorentz–Dirac equation itself.

3.2 Electrodynamics in various dimensions

It is instructive to see whether the rearrangement of the Maxwell–Lorentz theory remains unchanged in other dimension if the field sector of the action is assumed to be valid,

[TABLE]

Here, $\Omega_{d-2}={2{\pi}^{\frac{d-1}{2}}}/{\Gamma\left(\frac{d-1}{2}\right)}$ is the area of the unit $(d-2)$ -sphere, the field strength $F_{\mu\nu}$ is expressed in terms of the vector potentials, $F_{\mu\nu}=\partial_{\mu}A_{\nu}-\partial_{\nu}A_{\mu}$ , and

[TABLE]

represents the current density of a single point charge $e$ . For simplicity, the charge $e$ is put to be unit here and in the next subsubsection. Greek letters standing for spacetime indices take the values $0,1,\ldots,d-1$ . We adopt the metric $\eta_{\mu\nu}={\rm diag}\,(1,-1,\ldots,-1)$ .

The field equation resulting from (153), on imposing the Lorenz gauge condition, reads

[TABLE]

This equation can be solved with the aid of the Green’s function technique [81]. Solutions to the equation of the retarded Green’s function

[TABLE]

are given by

[TABLE]

where $\delta^{(n-2)}(x^{2})$ is the Dirac delta-function differentiated $n-2$ times with respect to its argument.

A sharp distinction between wave propagation in space of even and odd dimensions can be understood from Huygens’ principle, whereby any retarded signal carries information on the state of a point source at the instant of its emission. This idea is exemplified in the first line of (157): the retarded Green’s function for ${\mathbb{R}}_{1,2n-1}$ is concentrated on the forward sheet of the light cone $x^{2}=0,\,x_{0}>0$ . By contrast, in ${\mathbb{R}}_{1,2n}$ the retarded signal measured at a point $x^{\mu}$ derives from the entire history of the source which lies on or within the past light-cone of $x^{\mu}$ . If we think of the retarded signal as travelling with the speed of light then it ought to be emitted at the instant the source intersects the past light cone of $x^{\mu}$ . Hence, Huygens’ principle fails in odd spacetime dimensions: the second line of (157) shows that the support of the retarded Green’s function in ${\mathbb{R}}_{1,2n}$ is the interior of the future light cone $x^{2}\geq 0$ , rather than its surface.

Even-dimensional electromagnetic worlds have little in common with their relatives in odd dimensions. To illustrate this, in ${\mathbb{R}}_{1,2}$ , the action (153) can be augmented by the addition of the Chern–Simons term

[TABLE]

which is gauge invariant despite the presence of the parameter $\mu$ interpreted as the mass of the field $A^{\alpha}$ [49]. Odd-dimensional versions of the Maxwell–Lorentz electrodynamics are more intricate, both technically and conceptually, than its even-dimensional versions, in particular the self-interaction problem in ${\mathbb{R}}_{1,2n}$ is less well understood. Attempts at formal evaluating the effects of self-interaction gained little, if any, insight into this problem. This forces us to focus on the affair in even dimensions.

3.2.1 ${\mathbb{R}}_{1,2n-1}$

The $2n$ -dimensional retarded vector potential is given by

[TABLE]

where $R^{\mu}=x^{\mu}-z^{\mu}(s_{\rm ret})$ is the null $2n$ -vector drawn from the retarded point $z^{\mu}(s_{\rm ret})$ on the world line, where the signal is emitted, to the point $x^{\mu}$ , where the signal is received.

To simplify our notations as much as possible, we introduce the net vector potentials and field strengths, ${\cal A}_{\mu}$ and ${\cal F}_{\mu\nu}$ (as opposed to the ordinary vector potentials and field strengths, ${A}_{\mu}$ and ${F}_{\mu\nu}$ , whose normalization is consistent with Gauss’ law):

[TABLE]

where

[TABLE]

Consider the vector potentials, calculated through the use of Eq. (159), restricting our attention to $d$ in the range from $d=2$ to $d=10$ (which hold the greatest interest in string and brane applications):

[TABLE]

It is possible to show [100] that the field strength ${\cal F}^{(2n)}_{\mu\nu}$ generated by a point charge living in a $2n$ -dimensional world is expressed in terms of the vector potentials ${\cal A}^{(2m)}_{\mu}$ due to this charge in $2m$ -dimensional worlds nearby,

[TABLE]

We mention in passing that these algebraic relationships are not only remarkably simple and elegant, but also physically surprising. The world line $z^{\mu}(s)$ of the charge generating these field configurations is described by various numbers of the principal curvatures $\kappa_{j}$ for different spacetime dimensions. To be specific, consider Eq. (169). The world line which gives rise to ${\cal A}^{(2)}_{\mu}$ is a planar curve, specified solely by $\kappa_{1}$ , while that giving rise to ${\cal A}^{(6)}_{\mu}$ is a curve characterized by five essential parameters $\kappa_{1}$ , $\kappa_{2}$ , $\kappa_{3}$ , $\kappa_{4}$ , and $\kappa_{5}$ . If we regard the world line $z^{\mu}(s)$ in ${\mathbb{M}}_{1,2n-1}$ as the basic object, then both projections of this curve onto lower-dimensional spacetimes and its extensions to higher-dimensional spacetimes are rather arbitrary. However, this arbitrariness does not show itself in Eqs. (168)–(172) linking ${\cal F}^{(2n)}_{\mu\nu}$ and ${\cal A}^{(2m)}_{\mu}$ .

To reveal the behavior of the retarded electromagnetic field at spatial infinity, we segregate in ${\cal A}^{(2n+2)}$ the term scaling as $\rho^{-n}$ by introducing the vectors

[TABLE]

and

[TABLE]

All infrared irrelevant terms are erased by this limiting procedure, so that

[TABLE]

represents the long-distance asymptotics of ${\cal F}^{(2n)}$ [96], [99], [100], [72]. To see this, it is sufficient to compare the behavior of ${\cal A}^{(2)}\wedge{\cal A}^{(2n+2)}$ and ${\cal A}^{(4)}\wedge{\cal A}^{(2n)}$ . Because the least falling terms of ${\cal A}^{(2n+2)}$ and ${\cal A}^{(2n)}$ scale, respectively, as $\rho^{-n}$ and $\rho^{1-n}$ , the leading long-distance asymptotics of ${\cal A}^{(2)}\wedge{\cal A}^{(2n+2)}$ is given by $\rho^{1-n}$ while that of ${\cal A}^{(4)}\wedge{\cal A}^{(2n)}$ is given by $\rho^{-n}$ . The same is true for the comparison of the long-distance behavior of ${\cal A}^{(2)}\wedge{\cal A}^{(2n+2)}$ and that of the remaining two-forms contained in ${\cal F}^{(2n)}$ .

We write explicitly ${\mathfrak{b}}^{(2n+2)}_{\mu}$ for $n=1,2,3,4,5$ :

[TABLE]

The radiated energy-momentum is defined by

[TABLE]

where $\Sigma$ is a $(2n-1)$ -dimensional spacelike hypersurface. $\Theta^{\mu\nu}_{\rm II}$ involves only integrable singularities, and $\partial_{\nu}\Theta^{\mu\nu}_{\rm II}=0$ . Therefore, the surface of integration $\Sigma$ in (181) may be chosen arbitrarily. It is convenient to deform $\Sigma$ to a tubular surface ${T}_{\epsilon}$ of small invariant radius $\rho=\epsilon$ enclosing the world line. The surface element on this tube is

[TABLE]

Since the radiation flux through a $(2n-2)$ -dimensional sphere enclosing the source is constant for any radius of the sphere, the terms of $\Theta_{\mu\nu}$ responsible for this flux scale as $\rho^{2-2n}$ . Equation (181) becomes

[TABLE]

so that the radiation rate is given by

[TABLE]

A large list of generic formulas describing radiation of tensor fields of various ranks from an accelerated point source moving in ${\mathbb{R}}_{1,2n-1}$ , which allows to evaluate the total intensity and radiated momentum, has been given in [111]).

The above results may give the impression that self-interaction in any even dimension is qualitatively the same. Indeed, it seems to be imperative that the Maxwell–Lorentz theory, generalized to $2n$ dimensions, rearranges in a standard way to bring into existence radiation and dressed particles whose momenta obey the balance equation (85). We now take $d=2$ and $d=6$ as examples which impeach this impression, namely we mean to show that the $d=2$ and $d=6$ pictures differ drastically from the $d=4$ picture [96], [99]. Two-dimensional electrodynamics is immune from rearrangement: there are neither radiation nor dressing. Six-dimensional electrodynamics does rearrange, but the upshot is surprising. The six-momentum of a dressed particle is not defined uniquely; this quantity is given by two different expressions $\mathfrak{p}^{\mu}$ and $p^{\mu}$ . Each is the six-momentum in a particular context. If we take the balance equation (85), then the dressed particle is represented by $\mathfrak{p}^{\mu}$ , but if Newton’s second law (88) is regarded as the equation of motion for the dressed particle, its dynamical state is specified by $p^{\mu}$ .

3.2.2 ${\mathbb{R}}_{1,1}$

The retarded vector potential, generated by a charged point particle,

[TABLE]

and the associated retarded field strength,

[TABLE]

are not singular, even if $F_{\mu\nu}$ is discontinuous on the world line. Besides, $F_{\mu\nu}$ is independent of acceleration.

The stress-energy tensor for the field (186) is

[TABLE]

Here, the completeness relation $v^{\mu}v^{\nu}-u^{\mu}u^{\nu}=\eta^{\mu\nu}$ stemming from the fact that $v^{\mu}$ and $u^{\mu}$ form a basis in ${\mathbb{R}}_{1,1}$ has been used. It is evident that $\partial_{\mu}\Theta^{\mu\nu}=0$ . Expression (187) contains the term $-\frac{1}{4}\,e^{2}c^{\mu}c^{\nu}$ . Is it possible to interpret it as radiation? Although this term meets three conditions (64), (65), and (67), the fourth condition (68) is violated because $-\frac{1}{4}\,e^{2}c^{\mu}c^{\nu}$ is similar in its spatial behavior to the rest of the stress-energy tensor (187), contrary to the requirement that the radiation be asymptotically separated from the bound part. Hence, the electromagnetic radiation is absent from ${\mathbb{R}}_{1,1}$ .

The problem of two particles in ${\mathbb{R}}_{1,1}$ is readily translated into the problem of two parallel plates of a planar immense capacitor in ${\mathbb{R}}_{1,3}$ . There is only an electric field ${\bf E}$ between the plates, which is constant for any separation and velocity of the plates. The same is true for a system of $N$ charged particles which can be thought of as a system of $N$ parallel charged plates of infinite extent. An important point is that we are dealing with a well-defined problem only for systems whose total charge $Q=\sum e_{I}$ is vanishing, otherwise infrared divergence of the self-energy ensues. Indeed, restricting our consideration to a single-particle system, $Q=e\neq 0$ , we find

[TABLE]

In contrast, if $Q=0$ , then the integration range in every integral of the type shown in Eq. (188) is finite, as Gauss’ law suggests, and the integrals turn out to be convergent.

The resulting dynamics is thus not subject to rearrangement. The equation of motion for $I$ th particle in which the general retarded solution to Maxwell’s equations is used reads

[TABLE]

The set of $N$ ordinary differential equations of the form of Eq. (189) is integrable. Exact solutions to (189) [99] show that every particle moves along a hyperbolic world line. Therefore, the Maxwell–Lorentz electrodynamics in ${\mathbb{R}}_{1,1}$ is locally reversible despite the fact that the retarded boundary condition has been switched on.

Surprisingly, however, the global dynamical picture is irreversible. Since this subtle feature of the two-dimensional dynamics, underlying the mechanism of self-interaction in classical strings, will be reviewed in Sec. 5, we will defer its consideration until then.

It is interesting to compare the situation in a genuine two-dimensional world with that in an effective two-dimensional world which arises when a charged particle is placed in a straight line in ambient space. The Abraham–Lorentz–Dirac equation (86) then reduces to Eq. (143), whence it follows that the effective dynamics is irreversible. To explain the discrepancy, let us note that only kinematical aspects of the effective description are in fact two-dimensional, whereas self-interaction still remains four-dimensional, because its features are attributable to the four-dimensional Lienárd–Weichert field developed in ambient spacetime.

Why is the Maxwell–Lorentz electrodynamics in ${\mathbb{R}}_{1,1}$ unaffected by rearrangement? What is the dissimilarity between ${\mathbb{R}}_{1,3}$ and ${\mathbb{R}}_{1,1}$ that may render unstable systems stable? The automorphism group of ${\mathbb{R}}_{1,3}$ is the semidirect product of the Lorentz group SO $(1,3)$ and the translation group $T_{4}$ , while that of ${\mathbb{R}}_{1,1}$ is the semidirect product of SO $(1,1)$ and $T_{2}$ . The geometrical dissimilarity stands out: SO $(1,3)$ is non-Abeilian, and SO $(1,1)$ is Abeilian. It seems plausible that just this distinctive feature resolves the contradiction between the manifestations of electromagnetic self-interaction in ${\mathbb{R}}_{1,3}$ and ${\mathbb{R}}_{1,1}$ .

3.2.3 ${\mathbb{R}}_{1,5}$

To grasp the distinctive features of self-interaction in the six-dimensional Maxwell–Lorentz theory, let us turn to the ultraviolet behavior of the retarded field due to a point charge,

[TABLE]

The electromagnetic six-momentum $P^{\mu}$ would result from integrating $\Theta^{\mu\nu}$ over a five-dimensional spacelike hypersurface. But the obstacle to this integration is that the $F_{\mu\nu}$ exhibits non-integrable singularities on the world line. By (190) and (191),

[TABLE]

so that

[TABLE]

Since the element of measure on a five-dimensional spacelike hyperplane is proportional to $\rho^{4}d\rho$ , the integration of $\Theta^{\mu\nu}$ results in cubic and linear divergences. It is clear from (192) and (193) that the cubic divergence occurs even in the static case. In contrast, the linear divergence, which owes its origin to the terms of $F_{\mu\nu}$ scaling as $\rho^{-2}$ , appears only for curved world lines, that is, in the case that either $a^{\mu}$ or ${\dot{a}}^{\mu}$ , or both are nonzero. This implies that the Poincaré–Planck action for a bare particle (32) must be augmented by the addition of terms containing higher derivatives of $z^{\mu}$ to absorb the linear divergence.

The Lagrangian with higher derivatives is said to describe rigid particles. The simplest reparametrization invariant Lagrangian for a rigid bare particle is

[TABLE]

where $m_{0}$ and $\nu_{0}$ are real parameters. The corresponding six-momentum is

[TABLE]

On dimensional grounds, it is possible to show that the linearly divergent term arising from the integration of $\Theta^{\mu\nu}$ involves ${v}^{\mu}$ and ${\dot{a}}^{\mu}$ in exactly the same way as the second term of (195) does. The cubic and linear divergences are thus eliminated through the respective renormalization of $m_{0}$ and $\nu_{0}$ .

The classical dynamics governed by the action (32)–(34) proves inconsistent for $d>4$ because ultraviolet divergences of the self-energy of a point charge proliferate with $d$ , while the Poincaré–Planck term (32) does not involve enough free parameters to eliminate all the divergences through the redefinition of these parameters. If we mean to explore ${\mathbb{R}}_{1,2n-1}$ with $n>2$ , preserving the laws of Maxwell’s electrodynamics encoded in the Schwarzschild and Larmor terms (33) and (34), we have to use a rigid particle dynamics. This statement has been justified from various perspectives [96], [67], [87], [158], [65], [29]).

A regular way for deriving the equation of motion for a dressed charged particle is to evaluate regularized expressions for $P^{\mu}$ , renormalize divergent terms, and segregate finite terms. But we take another route which requires fewer efforts [96]. We determine the radiation rate, and then make use of the fact that the equation of motion for a dressed particle involves the projector $\stackrel{{\scriptstyle\scriptstyle v}}{{\bot}}$ . This procedure is independent of a particular regularization prescription because the radiation momentum integral is always convergent.

In order to clarify the origin of the projector $\stackrel{{\scriptstyle\scriptstyle v}}{{\bot}}$ , we digress for a while and recall the reader the main implication of reparametrization invariance resulting from Noether’s second theorem [114]. Consider an infinitesimal change of the parameter of a world line,

[TABLE]

where $\epsilon(\tau)$ is an arbitrary smooth function of $\tau$ , close to zero, which becomes vanishing at the end points of integration. Variation of $\tau$ implies the corresponding variation of the world line coordinates

[TABLE]

In response to the variations (196) and (197), the action $S[z]$ varies as

[TABLE]

Let $S$ be reparametrization invariant, $\delta S=0$ . Because $\epsilon$ is an arbitrary function of $\tau$ , one concludes that

[TABLE]

This equation expresses Noether’s second theorem: if the action is invariant under the group of transformations involving an arbitrary function $\epsilon(\tau)$ , then the Eulerians ${\cal E}_{\mu}$ are linearly dependent. The identity (199) suggests that ${\cal E}_{\mu}$ contains the projection operator on a hyperplane with normal ${\dot{z}}^{\mu}$ defined by Eq. (72). This operator annihilates identically any vector parallel to ${\dot{z}}^{\mu}$ . Reparametrization invariance bears on the projection structure of the basic dynamical law for a bare particle which can be written in the form of Eq. (88). The availability of $\stackrel{{\scriptstyle\scriptstyle v}}{{\bot}}$ in the equation of motion for a dressed particle is therefore the imprint of reparametrization invariance which is to be preserved by the rearrangement.

With reference to the general expression for $2n$ -dimensional radiated energy-momentum, Eq. (183), we specify it to $d=6$ :

[TABLE]

To make the solid angle integration, we apply the readily derivable formulas

[TABLE]

which gives

[TABLE]

and

[TABLE]

One can easily verify the inequality

[TABLE]

which evidences that ${\cal P}^{0}$ represents positive field energy flowing outward from the source.

The bound momentum contains divergent and finite parts, ${\mathfrak{p}}^{\mu}=p_{\hskip 1.42262pt\rm div}^{\mu}+p_{\hskip 1.42262pt\rm fin}^{\mu}$ . The finite part $p_{\hskip 1.42262pt\rm fin}^{\mu}$ is free of dimensional parameters other than the overall factor $e^{2}$ , and inherits the appropriate dimension from kinematical variables:

[TABLE]

where $c_{1}$ , $c_{2}$ , and $c_{3}$ are numerical coefficients. The presence of $\stackrel{{\scriptstyle\scriptstyle v}}{{\bot}}$ in the equation of motion for a dressed particle implies the transversality condition

[TABLE]

With the identities

[TABLE]

this gives

[TABLE]

The kinematical structure of the divergent part $p_{\hskip 1.42262pt\rm div}^{\mu}$ is similar to that of the bare particle momentum (195). We therefore should renormalize $m_{0}$ and $\nu_{0}$ in $p_{\hskip 1.42262pt0}^{\mu}$ , and combine it with $p_{\hskip 1.42262pt\rm fin}^{\mu}$ , Eq. (209), to yield

[TABLE]

The six-momentum ${\wp}^{\mu}$ extracted from an external field $F^{\mu\nu}_{\hskip 0.85358pt\rm ex}$ is found by integrating the mixed term of the stress-energy tensor $\Theta^{\mu\nu}_{\hskip 0.85358pt\rm mix}$ over a tube ${T}_{\epsilon}$ of small radius $\epsilon$ enclosing the world line,

[TABLE]

whence

[TABLE]

The local energy-momentum balance reads

[TABLE]

Substituting (210), (204), and (212) in (213), we obtain the equation of motion for a dressed charged particle

[TABLE]

where

[TABLE]

In an explicit form, this equation is written as

[TABLE]

A surprising result is the occurrence of two different six-momenta $\mathfrak{p}^{\mu}$ and $p^{\mu}$ , defined by Eqs. (210) and (215). Each can be recognized as the energy-momentum of the dressed particle: the $\mathfrak{p}^{\mu}$ plays this role in the balance equation (213), whereas Newton’s second law (214) tells us that the dressed particle is endowed with the six-momentum $p^{\mu}$ .

The rearrangement outcome, Eqs. (213) and (214), has been obtained with the aid of the condition of transversality (207), which greatly alleviates the problem. However, this trick is inadequate for $d\geq 8$ [110]. In higher dimensions, it seems impossible to avoid tedious calculations with explicit regularizations and choosing counterterms, similar to those proposed in [87], [65].

When comparing the symmetries behind the rearrangement in four and six dimensions, one further comment is in order. It has been shown in Sec. 3.1.1 that the retarded field $F$ generated by a point charge in ${\mathbb{R}}_{1,3}$ , Eq. (55), is a decomposable 2-form invariant under local SL $(2,{\mathbb{R}})$ transformations. Recall that sl $(2,{\mathbb{R}})$ is equivalent to sp $(1)$ , see, e. g., [17]. In contrast, the retarded field $F$ in ${\mathbb{R}}_{1,5}$ , Eqs. (190) and (191), contains two exterior products. The 2-form $F$ of this structure is invariant under similar transformations forming the Sp $(2)$ group. A key geometrical distinction between these groups is that Sp $(2)$ is non-Abeilian, and Sp $(1)$ is Abeilian. It is conceivable that this distinction may explain the fact that the six-momentum of a dressed particle appears in two guises, $\mathfrak{p}^{\mu}$ and $p^{\mu}$ , while its four-dimensional counterpart $p^{\mu}$ is uniquely defined. In Sec. 6, some evidence in support of this suggestion will arise in a wider mathematical context of the Banach–Tarski theorem.

3.3 Massless charged particles

The idea of zero-mass particles has several essential aspects. We begin with the very possibility to give a consistent Lagrangian formulation of a modified Maxwell–Lorentz electrodynamics involving massless charged particles. Imagine a particle which is moving along a smooth null world line,

[TABLE]

Here, $z_{\mu}(\tau)$ stands for a curve parametrized by a monotonically increasing variable $\tau$ associated with the evolution in time. We differentiate Eq. (218) to give

[TABLE]

Because ${\dot{z}}_{\mu}$ is null, ${\ddot{z}}_{\mu}$ may be either spacelike or lightlike, aligned with ${\dot{z}}_{\mu}$ . If ${\ddot{z}}^{2}<0$ , then the trajectory is bent. As an illustration, we refer to a particle that orbits the origin in a circle of radius $r_{0}$ at a constant angular velocity of $1/r_{0}$ . The world line is a helical null curve of radius $r_{0}$ wound around the time axis. On a large scale, the particle traverses timelike intervals.

If ${\ddot{z}}^{2}=0$ , then ${\ddot{z}}_{\mu}$ and ${\dot{z}}_{\mu}$ are parallel, and the trajectory is straight. Although ${\ddot{z}}_{\mu}$ has nonzero components, the motion is uniform. Indeed, whatever the evolution parameter $\tau$ , the history is depicted by a straight world line aligned with the null vector ${\dot{z}}_{\mu}$ . Therefore, ${\ddot{z}}_{\mu}$ is an artefact concerning the choice of $\tau$ for parametrizing the world line rather than a quantity related to actual acceleration. To put it otherwise, planar world lines, other than straight lines, are unrelated to the history of particles moving with the speed of light. The properties of null curves are discussed at greater length in [36].

It is reasonable to suppose that massless particles move along null world lines. The Poincaré–Planck action (32) is unsuited for such particles because the four-momentum $p^{\mu}$ derived from it vanishes as $m_{0}\to 0$ , and the dynamics proves to be trivial.

In contrast, the action

[TABLE]

proposed in [39], is sound for both massive and massless particles. Here, $\eta$ is an auxiliary dynamical variable, called ‘‘einbein’’. We assume that $\eta$ transforms as

[TABLE]

in response to reparametrizations ${\tau}\to{\bar{\tau}}$ . With this transformation law for $\eta$ , the action (220) is reparametrization invariant.

Varying the action (220) with respect to $\eta$ gives

[TABLE]

For $m_{0}\neq 0$ , the solution to this equation is

[TABLE]

Substitution of (223) in (220) regains (32). Therefore, the action (220) is equivalent to the Poincaré–Planck action (32) provided that the Euler–Lagrange equation (222) is taken into account.

We combine (220) with (33) and put $m_{0}=0$ to obtain the action which encodes the dynamics of a massless charged particle,

[TABLE]

Variation of (224) with respect to $\eta$ results in the basic constraint, Eq. (218), which shows that the massless particle governed by this action does move along null world lines. Note also that the resulting constraint, Eq. (218), is independent of $\eta$ .

Variation of (224) with respect to $z^{\mu}$ gives the equation of motion for a massless particle

[TABLE]

Since $\eta$ does not contribute to the other Euler-Lagrange equation, Eq. (218), this quantity is undetermined. However, we are entitled to handle the reparametrization freedom making the dynamical equations as simple as possible. For some choice of the evolution variable, the einbein can be converted to a constant, $\eta=\eta_{0}$ . Then (225) becomes

[TABLE]

which closely resembles the equation of motion for a massive particle (36).

The next important issue, concerning the high energy phenomenology, is the fact that zero-mass leptons do not appear to exist. The existence of massless charged particles is ‘‘clearly permitted by Maxwell’s equations’’ [37]. The same is also true for the interaction of massless particles with the Yang–Mills field. Take, for example, neutrinos which interact with the SU $(2)\times$ U $(1)$ Yang–Mills field in the Standard Model. These particles were long assumed to be massless, but recent experimental data suggest that neutrinos are likely to be endowed with a finite, albeit small, mass. On the other hand, it is commonly supposed that quarks in quark–gluon plasma may reveal themselves as zero-mass particles because deconfinement triggers the chiral-symmetry-restoring phase transition, whereby quarks attain their masslessness. Were such indeed the case, the disparity between realizable zero-mass quarks and unfeasible zero-mass leptons would be of even great concern.

However, the most important issue of the present discussion is conformal invariance. The Maxwell–Lorentz electrodynamics of massless charged particles, as will soon become clear, does not experience rearrangement. Both dressing and radiation are absent from this theory [101]. If conformal invariance is overlooked, then one may be under wrong impression that this theory is amenable to the rearrangement [88]. It is perhaps no wonder that a ‘‘massless dressed particle’’ is impossible to synthesize from a massless bare particle and electromagnetic field: for lack of $m_{0}$ in a scale invariant theory, the renormalization of mass is forbidden, whence it follows that the self-energy term must of necessity be vanishing. However, the fact that a massless charged particle moving along a curved world line does not radiate may seem surprising if one remembers that the radiation is inevitable for an accelerated charged particle having arbitrarily small mass. To explain the difference, we remark that a conformal theory need not be conceived as a continuous limit of fading away terms that violate conformal invariance. As discussed above, the set of allowable histories of particles moving with the speed of light is free from planar world lines, which, contrastingly, are well suited for the histories of subluminal particles. In general, the set of physically allowable smooth timelike world lines does not asymptotically approach the set of physically allowable smooth null world lines.

To establish the above assertion on the absence of the rearrangement, we follow the previous line of reasoning, taking, as the starting point, the Noether identity

[TABLE]

in which some terms are slightly modified as against Eq. (38) to accommodate the fact that the bare particle under examination moves along null world lines. The proper time $s$ is unusable as a parameter of such curves, and we should look at another variable $\tau$ , for example, the laboratory time $t$ in a particular Lorentz frame. Accordingly, the modified ${\cal E}_{\mu}$ , $\varepsilon^{\lambda}$ , and $t^{\lambda\mu}$ are

[TABLE]

while ${\cal E}^{\lambda\mu\nu}$ and $\Theta^{\lambda\mu}$ are given, as before, by Eqs. (37) and (40).

Evidently

[TABLE]

This implies that the system is invariant under the group of conformal transformations C $(1,3)$ [24], [64].

The retarded electromagnetic field due to a charge moving along a null world line is

[TABLE]

The first term $F^{\rm r}_{\mu\nu}$ (r for regular) is

[TABLE]

Here, all kinematical variables refer to the point $\tau_{\rm ret}$ . The scalar

[TABLE]

measures the separation between $z^{\mu}(\tau_{\rm ret})$ and $x^{\mu}$ . To see this, let us choose a particular Lorentz frame in which

[TABLE]

From $R\cdot{\dot{z}}=r\left(1-\cos\vartheta\right)$ follows that $\rho$ varies smoothly from 0 to $\infty$ as $x^{\mu}$ moves away from $z^{\mu}(\tau_{\rm ret})$ , except for the case that $R^{\mu}$ points in the direction of ${\dot{z}}^{\mu}$ . The surface swept out by the singular ray ${R}_{\mu}$ aligned with the tangent vectors ${\dot{z}}_{\mu}$ forms a two-dimensional warped manifold ${\cal M}_{2}$ .

The second term $F^{\rm ir}_{\mu\nu}$ (ir for irregular) in Eq. (232) is

[TABLE]

Since ${\dot{z}}^{2}=0$ , the irregular term $F^{\rm ir}_{\mu\nu}$ is everywhere zero except for the manifold ${\cal M}_{2}$ .

Note that the Gauss’ law integral is saturated with the ${\bf E}^{\rm ir}$ alone. Indeed, integrating ${\bf E}^{\rm ir}$ over a sphere $r=\ell$ in a Lorentz frame, in which ${\dot{z}}^{\hskip 0.85358pt\mu}$ and $R^{\mu}$ take the form shown in Eq. (236), we obtain

[TABLE]

and furthermore, the same surface integration of ${\bf E}^{\rm r}$ can be shown to be zero.

It may be worth pointing out that the factor ${\dot{z}}^{2}$ disappears from Eq. (238) because it is cancelled by the identical factor arising in the denominator owing to the solid angle integration of $\rho^{-2}$ . If we would have $\rho^{-s}$ with $s$ other than ${2}$ , then this mechanism would fall short of the required cancellation. Consider for example the term of the stress-energy tensor $\Theta^{\rm ir}_{\mu\nu}$ built from $F^{\rm ir}_{\mu\nu}$ . By (237),

[TABLE]

To define the corresponding part of four-momentum $P^{\rm ir}_{\mu}$ , we integrate $\Theta^{\rm ir}_{\mu\nu}$ over the future light cone $C_{+}$ , drawn from $z^{\mu}(\tau_{\rm ret})$ , with the use of the surface element $d\sigma^{\mu}=R^{\mu}\rho\,d\rho\,d\Omega$ , after prior regularizing the integral over $\rho$ to make it convergent. In response to the solid angle integration of $\rho^{-2}$ , the denominator gains the factor ${\dot{z}}^{2}$ . However, this factor does not kill the factor $({\dot{z}}^{2})^{2}$ in the numerator, and hence the regularized integrand, involving the overall factor ${\dot{z}}^{2}$ , is vanishing. In the limit of regularization removal, we have $P^{\rm ir}_{\mu}=0$ .

As to the term of stress-energy tensor containing mixed contribution of $F^{\rm r}_{\mu\nu}$ and $F^{\rm ir}_{\mu\nu}$ , it is not unduly difficult to show that this term, being contracted with the surface element of the future light cone $C_{+}$ , is annihilated by ${R}^{2}=0$ and ${\dot{z}}^{2}=0$ .

This completes the proof of our assertion that the self-energy term is vanishing.

Turning to $F^{\rm r}_{\mu\nu}$ , we first note that both invariants ${\cal P}=\frac{1}{2}\,{}^{\ast}\!F^{\rm r}_{\mu\nu}F^{{\rm r}\hskip 1.42262pt\mu\nu}$ and ${\cal S}=\frac{1}{2}\,F^{\rm r}_{\mu\nu}F^{{\rm r}\hskip 1.42262pt\mu\nu}$ are zero. In other words, $F^{\rm r}_{\mu\nu}$ is a null field over all spacetime minus the manifold ${\cal M}_{2}$ .

The term of the stress-energy tensor $\Theta_{\rm r}^{\mu\nu}$ built from $F^{\rm r}_{\mu\nu}$ ,

[TABLE]

has an integrable singularity on the world line. To evaluate the four-momentum associated with $\Theta_{\rm r}^{\mu\nu}$ , we take, as before, the surface of integration to be a tubular surface ${T}_{\ell}$ of small radius $\ell$ enclosing the world line. It remains to manage the singularity on ${\cal M}_{2}$ . A pertinent regularization prescription is to perforate a hole in the intersection of ${T}_{\ell}$ with ${\cal M}_{2}$ . Let the normal of ${T}_{\ell}$ be $n^{\mu}=\left(0,{\bf n}\right)$ , where the unit vector ${\bf n}$ is defined in Eq. (236), then

[TABLE]

so that the part of the four-momentum associated with $\Theta_{\rm r}^{\mu\nu}$ is

[TABLE]

Here, $\Lambda=4\,{\delta}^{-6}$ , and $\delta$ is a small regularization parameter, the lower limit of integration over $\vartheta$ , required for smearing the ray singularity. In Eq. (242), we have omitted finite terms which are negligibly small in comparison with the term proportional to ${\delta}^{-6}$ .

At first glance $P^{\mu}_{\rm r}$ is the four-momentum radiated by a charge moving along a null world line. But closer inspection shows that the contribution of $P^{\mu}_{\rm r}$ to the energy-momentum balance equation is absorbed by some reparametrization of the null curve. To see this, we reiterate mutatis mutandis the argument used for the derivation of Eq. (83) from the Noether identity (38) to attain

[TABLE]

where $\Sigma^{\prime}$ and $\Sigma^{\prime\prime}$ are spacelike surfaces separated by a short timelike interval. If we impose the Haag asymptotic condition

[TABLE]

then the integration over the tube $T_{R}$ approaches zero as $R\to\infty$ . Only $F^{\rm r}_{\mu\nu}$ contributes to the integrations over $\Sigma^{\prime}$ and $\Sigma^{\prime\prime}$ , and the result of such integrations is typically expressed by Eq. (242). Using (242) in (243) gives

[TABLE]

The first and the last terms have similar kinematical structures. This suggests that there is a particular parametrization ${\bar{\tau}}$ such that these terms cancel. To verify this suggestion, we go from $\tau$ to ${\bar{\tau}}$ through the reparametrization

[TABLE]

In fact, Eqs. (246) and (221) constitute a set of two functional differential equations in which ${\bar{\tau}}={X}\left(\tau\right)$ and ${\bar{\eta}}=Y\left[\eta(\tau);{\bar{\tau}}(\tau)\right]$ are the unknown quantities, and appropriate solutions to these equations can hopefully be found. By (221),

[TABLE]

and

[TABLE]

whence it follows that the term $-\frac{2}{3}\,e^{2}\Lambda\,{\ddot{z}}^{2}{\dot{z}}^{\lambda}$ disappears from Eq. (245).

We thus come to recognize that the net effect of the putative radiated four-momentum is actually removable through an appropriate reparametrization of the null world line.

We finally compare these results with those obtained in the Maxwell–Lorentz theory of massive charged particles. It is reasonable to begin with the Noether identity (227) which is universally suitable for both massive and massless cases. If $m_{0}\neq 0$ , then the usual way to explore this identity further is to consider $\eta$ to be the solution (223) of the constraint equation (222), which implies that the world line is parametrized by the proper time $ds=d\tau\,\sqrt{{\dot{z}}\cdot{\dot{z}}}$ . However, there is nothing to prevent us from following the above route. In doing so, we come to Eq. (243). A closer look at the integrals of $\Theta^{\lambda\mu}$ over $\Sigma^{\prime}$ and $\Sigma^{\prime\prime}$ , which represents the increment of the four-momentum of electromagnetic field for the period $\tau^{\prime\prime}-\tau^{\prime}$ , shows a dramatic change of the affair. For a particle moving along a timelike world line,

[TABLE]

Evidently the term $-{2\over 3}\,e^{2}\,{{\stackrel{{\scriptstyle\ldots}}{{z}}}}^{\hskip 0.85358pt\lambda}$ cannot be cancelled by other terms of Eq. (249), no matter what is the parametrization of the world line. Furthermore, $\left({e^{2}/2\epsilon}\right){\ddot{z}}^{\lambda}$ is divergent. For this divergence to be absorbed by the mass renormalization, the gauge must be fixed, $\eta=m_{0}/\sqrt{{\dot{z}}\cdot{\dot{z}}}$ , which implies that the world line is parametrized by the proper time. Accordingly, the term $-{2\over 3}\,e^{2}\,{\ddot{z}}^{\hskip 0.85358pt2}{\dot{z}}^{\lambda}$ survives in the energy-momentum balance.

It was emphasized in Sec. 1.2 that the responsibility for rearranging the initial degrees of freedom rests with instabilities peculiar to the system. For lack of the rearrangement, the system with the action (224) is stable, which is likely to be due to the conformal symmetry group C $(1,3)$ . Recall that C $(1,3)$ is the lowest dimensional group containing the Poincaré group. Furthermore, C $(1,3)$ is semisimple, even though the Poincaré group is the semidirect product of the Lorentz and translation groups. Admittedly, however, why the features of this fascinating symmetry provide a way for the acquisition of stability has been something of mystery.

Of special note is that the Yang–Mills–Wong theory of zero-mass particles carrying the pertinent non-Abelian charges also enjoys the property of conformal invariance. The absence of the rearrangement from this theory can lead to far-reaching speculations. As stated above, quarks in quark–gluon plasma are most likely to be massless. Such quarks do not emit electromagnetic and Yang–Mills radiation, and hence do not lose their energy in collisions. This might help to illuminate the fact that the quark–gluon plasma is the most perfect fluid ever observed, see, e. g., [144].

3.4 Action at a distance

The Maxwell–Lorentz electrodynamics of $N$ charged particles can be recast in terms of the direct interparticle action, proposed by Fokker [59],

[TABLE]

where capital Latin letters are used to label the particles. The distinctive feature of this formulation of electrodynamics is the presence of retarded and advanced interactions on an equal footing. Owing to the delta-function, the typical points $z_{I}$ and $z_{J}$ on $I$ th and $J$ th world lines can be thought of as ‘‘interacting’’ if they are connected by a null interval, which is a relativistic generalization of interactions by contact occurring at zero distance. The Fokker action (250) does not contain unconstrained field degrees of freedom. It is as if particle $I$ were affected by particle $J$ directly, that is, without mediation of the electromagnetic field. Accordingly, self-interaction seems to be absent from (250).

Wheeler and Feynman assumed [153], [154] that radiation is completely absorbed. To clarify the precise meaning of this assumption, we first decompose the retarded Green’s function $D_{\rm ret}$ into even and odd parts by introducing the advanced Green’s function $D_{\rm adv}$ :

[TABLE]

The even part $D_{P}(x)=\frac{1}{2}\left[D_{\rm ret}(x)+D_{\rm adv}(x)\right]=\delta(x^{2})$ is a solution to the inhomogeneous wave equation with the delta-function source,

[TABLE]

while the odd part $D(x)=\frac{1}{2}\left[D_{\rm ret}(x)-D_{\rm adv}(x)\right]={\rm sgn}\,(x_{0})\,\delta(x^{2})$ obeys the homogeneous wave equation

[TABLE]

Turning to the dynamics underlying the Fokker action (250), we note that the interactions between particles are such that they simulate electromagnetic field between them: the vector potential and the field strength adjunct to particle $I$ are given by half the retarded and half the advanced solutions to Maxwell’s equations,

[TABLE]

where

[TABLE]

is the current density of $I$ th charged point particle. These quantities satisfy the wave equation and the Lorenz gauge condition:

[TABLE]

In contrast,

[TABLE]

is the vector potential of a free field obeying the homogeneous wave equation

[TABLE]

because $\Box_{x}D(x-y)=0$ , as indicated by Eq. (253).

With zero initial data on a spacelike hyperplane $\Sigma$ , ${\bar{A}}^{\mu}_{I}|_{\Sigma}=0$ and $\left(n\cdot\partial\right){\bar{A}}^{\mu}_{I}|_{\Sigma}=0$ , the solution to the Cauchy problem for the wave equation (260) is trivial ${\bar{A}}^{\mu}_{I}(x)=0$ .

Imagine for a while that only $N$ charged particles are in the Universe, and ${\bar{A}}^{\mu}(x)$ is the total free field adjunct to these particles, ${\bar{A}}^{\mu}=\sum{\bar{A}}^{\mu}_{I}$ . If ${\bar{A}}^{\mu}(x)$ vanishes at one time, then it is zero at all times. Wheeler and Feynman [153], [154] adopted

[TABLE]

as a supplementary constraint to the action (250), and interpreted it as the condition of total absorption (for an extended discussion see [117], [79]). Ever since this approach is often referred to as the absorber theory of radiation.

However, the fact that ${\bar{A}}^{\mu}(x)$ is vanishing does not amount to the lack of radiation in the sense of the definition (64)–(66). Rather, this fact suggests that the Fokker action (250) is inadequate to describe the system completely. The system under examination is actually the union of $N$ particles undergoing the direct mutual interactions and the fabric of spacetime which is integrated in the system by Eq. (261).

Let us express the action (250) in terms of $A^{\mu}_{I}(x)$ , and vary the $I$ th world line,

[TABLE]

to obtain

[TABLE]

This equation differs from the equation of motion for a bare charged particle in that the Lorentz force exerted on particle $I$ involves the symmetric combination (half-retarded plus half-advanced) of fields due to all particles, except that for particle $I$ itself:

[TABLE]

For lack of the self field, the usual infinities associated with it do not occur, and so there is no need to renormalize $m_{I}$ .

On the other hand, Eq. (263) looks quite different from the Abraham–Lorentz–Dirac equation governing the behavior of a dressed charged particle. Note, however, that the Wheeler–Feynman condition (261) still remains untapped. One can recast (264) as

[TABLE]

where the last term is the sum over all particles. By (261),

[TABLE]

at every point on the world line of particle $I$ , and so (264) takes the form

[TABLE]

The expression in the square bracket can be elaborated further, as in Sec. 3.1.4, to give the Abraham term $\Gamma^{\mu}$ , and Eq. (263) becomes

[TABLE]

In summary, combining Eq. (263) with the Wheeler–Feynman constraint (266) results in the conventional equation of motion for a dressed particle in the retarded field of all other particles. Furthermore, Eq (268) is equivalent to the local energy-momentum balance, Eq. (85),

[TABLE]

implying that radiation effects have been properly incorporated in this description.

Wheeler and Feynman assumed that the total matter in the Universe behaves as a perfect absorber, and proposed Eq. (266) as a cosmological absorber condition. If we keep track of particle $I$ , then the radiation of this particle is to be completely absorbed by other particles. The absorber exerts on particle $I$ a force which is the sum of retarded forces due to other particles, and endows it with the four-momentum $p^{\mu}_{I}=m_{I}v^{\mu}-\frac{2}{3}\,e^{2}_{I}a^{\mu}_{I}$ . The rearrangement of the initial degrees of freedom appearing in the action (250) has thus attained in a somewhat meandering way. The system disguises the radiation, but the effect of dressing can be discerned in the local energy-momentum balance, Eq. (269).

A similar procedure can be readily developed for any theory with linear field equations to convert it to a theory of direct interparticle action. However, this approach is unsound for system with nonlinear field equations, such as the Yang–Mills and Einstein equations. The reason for this is that the Green’s function method, versatile enough to get rid of reference to field degrees of freedom, is unfit for use in nonlinear theories.

3.5 Nonlinear electrodynamics

The term ‘‘nonlinear electrodynamics’’ is usually taken to mean a generalization of the Larmor action (34) in which the Lagrangian ${\cal L}$ is nonlinear in the invariants ${\cal S}=\frac{1}{2}\,F_{\mu\nu}F^{\mu\nu}$ and ${\cal P}=\frac{1}{2}\,F_{\mu\nu}{}^{\ast}\!F^{\mu\nu}$ , where the field strength is expressed in terms of vector potentials, $F_{\mu\nu}=\partial_{\mu}A_{\nu}-\partial_{\nu}A_{\mu}$ , and the dual field tensor is given by ${}^{\ast}\!F_{\mu\nu}=\frac{1}{2}\,\epsilon_{\mu\nu\alpha\beta}F^{\alpha\beta}$ . The best known theory of this type, proposed by Born and Infeld [38], is characterized by

[TABLE]

Here and throughout this section, Heaviside units are adopted, so that the factor of $1/4\pi$ is absent from the field Lagrangians; ${b}$ stands for a constant having dimension of the field strength. For weak fields, ${\cal L}_{\rm BI}$ approaches $-\frac{1}{2}\,{\cal S}$ which is the Larmor Lagrangian, Eq. (34).

An unpleasant novelty related to nonlinear electrodynamics is that it allows for the creation of electromagnetic shock waves. Since coefficients of higher derivatives in the field equation are functions of $F_{\mu\nu}$ and ${}^{\ast}\!F_{\mu\nu}$ , the equation for determining the normals to the characteristic surfaces may have degenerate real solutions, which is a prerequisite to shock waves. The presence of the shock waves introduces a further topological aspect associated with uncontrollable irreversibility. The nonlinear set of hyperbolic equations in the $(1+1)$ -dimensional Born–Infeld theory is unique in that their characteristics cannot intersect, and hence no electromagnetic shock wave occurs [31]. Besides, the Born–Infeld theory is the only version of nonlinear electrodynamics with a sensible weak field limit which is free of the birefringence, that is, only this theory describes signal propagation along a single characteristic cone, regardless of their polarization [34].

The major concern over the blowup of local interactions that was voiced in the early stages of the development of the modern field theory can be settled in the framework of classical nonlinear electrodynamics. To verify this, we begin with

[TABLE]

assuming that the Lagrangian ${\cal L}\left({\cal S},{\cal P}\right)$ reduces to $-\frac{1}{2}\,{\cal S}$ in the weak field limit. Let us define the field excitation

[TABLE]

Then the Euler–Lagrange equations resulting from (271) read

[TABLE]

This set of equations should be augmented by the addition of the Bianchi identity

[TABLE]

(which is a mere restatement of the initial assumption that the field strength is expressed in terms of vector potentials), and the constitutive equations

[TABLE]

following from (272). The constitutive equations of Maxwell’s electrodynamics are linear, $E_{\mu\nu}=-F_{\mu\nu}$ . In the general case, however, Eq. (276) need not be linear in $F^{\mu\nu}$ , hence the name nonlinear electrodynamics. For example, (270) implies the following constitutive equations

[TABLE]

where

[TABLE]

Equations (273)–(276) form the entire set of equations of nonlinear electrodynamics. It is clear from (274) and (273) that a point charge generates $E^{\mu\nu}$ but evokes response through $F^{\mu\nu}$ .

The symmetric stress-energy tensor of electromagnetic field

[TABLE]

obeys the equation

[TABLE]

so that, in a region free of electric charges,

[TABLE]

Just as $F_{\mu\nu}$ is associated with the electric field intensity ${\bf E}$ and the magnetic induction ${\bf B}$ in a particular Lorentz frame, so $E_{\mu\nu}$ can be related to the electric displacement, ${\bf D}_{i}=E_{i0}$ , and the magnetic field intensity, ${\bf H}_{k}=\frac{1}{2}\epsilon_{0klm}E^{lm}$ . Let us take a look at the static case ${\bf j}=0,\,\partial{\bf D}/\partial t=0$ . Equation (274) reduces to

[TABLE]

To be specific, we turn to the Born–Infeld theory. The constitutive equation (277) becomes

[TABLE]

The Poisson equation (282) is obeyed by the Coulomb solution

[TABLE]

which is singular at $r=0$ . However, the field strength ${\bf E}$ derived from (284) with the help of (283) is regular,

[TABLE]

Here, $\ell$ is a characteristic length related to the critical field $b$ as

[TABLE]

At large $r$ , ${\bf E}(r)$ approaches the Coulomb field. Note also that ${\bf E}(r)\to{\bf D}(r)$ as $\ell\to 0$ .

The energy density results from (279):

[TABLE]

By (283),

[TABLE]

Using (284) in (288) gives ${\Theta}_{00}\sim 1/r^{2}$ near $r=0$ , but this singularity is integrable, and hence the self-energy is finite,

[TABLE]

In addition, the Born–Infeld theory shows clear evidence that a static point charge is stable because it is free of tearing strains. Consider the force exerting on the charge within an infinitesimal solid angle:

[TABLE]

This $d{\bf F}$ is balanced by the infinitesimal force equal in magnitude and opposite in direction. Both are put to the same point, so that the net effect is zero. One can then explain the stability of a point charge in the Maxwell–Lorentz theory taking (290) as a regularized expression for the infinitesimal force, integrating this expression over solid angle, and taking the limit $\ell\to 0$ .

The Born–Infeld electrodynamics may thus be understood as a modification of the Maxwell–Lorentz theory such that the product of singular distributions turns out to be well defined.

The solution (285) for a charge at rest can be readily generalized for a uniformly moving charge by a Lorentz boost. However, solutions for an arbitrarily moving charge, similar to the retarded Liénard–Wiechert solution of Maxwell’s theory, are still not found. We have no inkling what is the mechanism of rearrangement in nonlinear electrodynamics.

For an extended discussion of the Born–Infeld electrodynamics we refer the reader to [145], and [28]. The history and some remarkable features of this theory can be learned from the essay [27]. One passage of it reads: ‘‘There is no evidence that their theory has any direct connection with the physical reality’’. However, as soon as two years later, the Born–Infeld Lagrangian was found to be the low energy effective Lagrangian of gauge fields on open strings [60].

3.6 Nonlocal interactions

Another way for avoiding the blowup inherent in local field theories is to ‘‘smear out’’ the interaction over a small spacetime region. Early attempts to render the interaction nonlocal were focussed on the search of suitable form factors in the interaction,

[TABLE]

The form factor $F$ was conceived as a smooth function of $(x-y)^{2}$ which looks like a sharp pulse normalized to unit area, for instance

[TABLE]

Two closely related problems of nonlocal theories of this type are causality violations and angular divergences [89]. In fact, there is much evidence that nonlocal interactions can be mediated by superluminal signals. The reason for occurrence of the angular divergences is the necessity of integration over an infinite range of hyperbolic angles parametrizing the pseudoeuclidean momentum space because the form factor like that shown in Eq. (292) prevents the Wick rotation making the description euclideanized. To meet the challenge, the support of $F(x-y)$ must be compact, and its characteristic size $\ell$ small. Note, however, that the topology of four-dimensional real Euclidean space is distinctly different from that of Minkowski space [159].

Let $F$ be a function of $(x-y)^{2}$ . Then the invariant region where the acausal effects are confined,

[TABLE]

is noncompact: near the light cone $(x-y)^{2}=0$ , the extension of this region in spatial and temporal directions is arbitrarily large. Alternatively, it is possible to use a unit vector $q^{\mu}$ for constructing a positive definite quadratic form

[TABLE]

and take $F$ to be a function of $d(x,y)$ to limit the acausal effects to a compact, invariant region $d(x,y)\leq\ell^{2}$ . This brings up the question: Where can the $q^{\mu}$ come from? We may think of $q^{\mu}$ as the four-velocity $v^{\mu}$ of some particle. However, in the absence of particles, we are forced to use a fixed unit vector $q^{\mu}$ , which would distinguish a privileged frame of reference, and violate explicit Lorentz invariance. We may also regard $q^{\mu}$ as an auxiliary unit vector, and average $F$ over directions of $q^{\mu}$ , but this procedure appears too arbitrary.

Another line of attack for overcoming these difficulties is as follows. Let a form factor be obtained by acting a function $K$ of the d’Alembertian ${\Box}$ on the Dirac delta-function,

[TABLE]

The relativistic invariance of $F(x-y)$ is apparent: $\delta^{4}(x-y)$ is invariant under Poincaré transformations, and ${\Box}\,\delta^{4}(x-y)$ shares this property. With the Fourier transform

[TABLE]

(295) becomes

[TABLE]

The radius of convergence of this series depends on $c_{n}$ . We will discuss only those power series which are convergent in the whole complex $k^{2}$ -plane. In other words, $K({-k^{2}})$ is an entire function.

If $c_{n}=0$ for $n\geq N$ , then $K({-k^{2}})$ is a polynomial, and we are led to an ordinary higher-derivative Lagrangian. This suggests that if the coefficients $c_{n}$ decrease too much rapidly, even though their sequence does not terminate, then the interaction is not smeared out, but actually remains local. Such interactions are called localizable. The line of demarcation between localizable and nonlocal interactions [109], [83], [54] separates entire functions $K({-k^{2}})$ into two classes:

[TABLE]

Condition (L) was shown to be equivalent to the following bound of asymptotic growth of $K({-k^{2}})$ as $k^{2}$ approaches infinity in the complex plane:

[TABLE]

where $\epsilon$ is arbitrarily small. As for nonlocal interactions, the class of entire functions $K({-k^{2}})$ showing considerable promise as appropriate form factors satisfies the asymptotic condition

[TABLE]

for a fixed, positive $\sigma$ . With such form factors, finite nonlocal theories of scalar fields obey all general conditions of quantum field theory: unitarity, covariance, and macroscopic causality [54], [55]. Furthermore, most results of the axiomatic quantum field theory, in particular $PCT$ -invariance and connection between spin and statistics, can be extended to the case that the Wightman vacuum expectation values reveal exponential energy growth [80], which is characteristic of nonlocal interactions.

Since our interest here is with the rearrangement of classical electrodynamics, we restrict our discussion to the following modification of the action [90]:

[TABLE]

where $j^{\mu}$ is the usual four-current of a delta-function source defined in (99), and $K({\Box})$ is given by (295). To maintain the link with the Maxwell–Lorentz theory, we require that

[TABLE]

Variation of $z^{\mu}$ gives the equation of motion for a bare particle

[TABLE]

and varying $A^{\mu}$ , we obtain the equation of motion for the electromagnetic field

[TABLE]

It is instructive to begin with the static case ${\bf E}=-\nabla\phi$ . Equation (304) becomes

[TABLE]

Using

[TABLE]

in (305) gives

[TABLE]

In the next to last equation of Eq. (307), the Sokhotski relation has been used,

[TABLE]

in which ${\rm P}$ stands for the Cauchy principal value. The last equation of Eq. (307) has been obtained with regard to Eq. (302).

Let us verify that the interaction associated with entire functions $K({k}^{2})$ obeying (299) is indeed ‘‘localizable’’. To this end we examine the behavior of

[TABLE]

By Cauchy’s theorem, integration over the real axis can be replaced by integration over a semicircle $\Gamma_{R}$ of large radius $R$ at the upper half-plane ${\rm Im}\,k>0$ . Letting $k=Re^{i\vartheta}$ ,

[TABLE]

where we have taken into account (299), and

[TABLE]

In the sector $0<\vartheta\leq\pi/2$ , it is helpful to use the inequality $\sin\vartheta\geq{2\vartheta}/{\pi}$ ,

[TABLE]

For finite $r$ and $\epsilon<2r\delta/\pi$ , this expression vanishes in the limit $R\to\infty$ .

To summarize, if $K({k}^{2})$ obeys (299), then $h_{R}(r)\to 0$ , and the potential (307) takes the form of the Coulomb potential $e/r$ . The singularity of $\phi({\bf r})$ remains unaffected when $K(-{\nabla^{2}})$ acts on $\delta^{3}({\bf r})$ .

We next turn to nonlocal interactions. Supposing that

[TABLE]

we find

[TABLE]

For $r>\ell$ , this expression vanishes in the limit $R\to\infty$ . To enquire into the situation in the region $r<\ell$ , write the solution to (305) in the form

[TABLE]

where $\theta(r)$ is the Heaviside step function indicating that the range of $r$ is reduced to the semiaxis ${\mathbb{R}}_{+}$ , and $\alpha(r)$ is a differentiable function satisfying two conditions

[TABLE]

Combining (315) with (307), we obtain

[TABLE]

where the prime stands for differentiation with respect to $r$ . The inverse of (318) is

[TABLE]

This formula is convenient for constructing $K(k^{2})$ with the required properties (302) and (313). One can to show [115] that if $\alpha(r)$ is a differentiable function obeying (316) and (317), then (319) represents an entire function $K(k^{2})$ of order $\frac{1}{2}$ , which is square integrable on ${\mathbb{R}}$ , normalized to $K(0)=1$ , and whose indicatrix is $H(\vartheta)=\ell\sin\vartheta$ . Conversely, let $K(k^{2})$ be a square integrable on ${\mathbb{R}}$ entire function of order $\frac{1}{2}$ and type $\ell$ . Then $\alpha^{\prime}(r)$ , defined in (318) is zero for $|r|\geq\ell$ .

Therefore, to formulate the nonlocal electrodynamics (301), we may proceed from the static potential (315) with an arbitrarily chosen $\alpha(r)$ , satisfying the conditions (316) and (317), and use Eq. (319) to obtain the explicit form of $K({\Box})$ .

As a simple example of $K(k^{2})$ and $\alpha^{\prime}(s)$ we take

[TABLE]

The corresponding potential $\phi(r)$ is a truncated Coulomb potential. A similar static solution for a charged sphere of radius $\ell$ is offered by Maxwell’s electrodynamics. However, the similarity is deceptive. While the charged sphere is subject to the repulsive static forces, and cannot be stable without resort to Poincaré cohesive forces, the delta-function source generating the potential (315) with $\alpha(r)$ given by (320) is free of tearing strains,

[TABLE]

The square integrability of $K(k^{2})$ implies that the self-energy is finite:

[TABLE]

We are now in position to consider the general case that a point charge is moving along an arbitrary timelike smooth world line. To solve the field equation (304), we adopt the retarded boundary condition, and impose Lorenz gauge. The solution is given by

[TABLE]

where

[TABLE]

Looking at (313), we observe that $K(-k^{2})$ grows exponentially when $k^{2}\to\infty$ . This implies that the integral (323) fails to converge unless ${\tilde{\jmath}}^{\hskip 0.85358pt\mu}(k)$ decrease appropriately in timelike directions. It is possible to demonstrate [90], [99] the existence of world lines such that

[TABLE]

which is to say that the integral in Eq. (323) is convergent. If the nonlocal electrodynamics is to be consistent, the allowable class of world lines must be narrowed in the following way. Let $t$ be laboratory time in a particular Lorentz frame, so that $dz^{\mu}/dt=(1,{\bf v})$ . Consider a set of timelike smooth curves $z^{\mu}(t)$ which are capable of being parametrized by a complex variable $t+iu$ , and assume that these curves lend themselves to the requirements of analyticity in the strip

[TABLE]

and integrability

[TABLE]

where ${\bf v}_{+}(t)=\frac{1}{2}[{\bf v}(t)+{\bf v}(-t)]-\frac{1}{2}({\bf v}_{\rm in}+{\bf v}_{\rm out})$ . Then this set of curves represents the class of world lines for which the asymptotic estimate (325) holds.

The retarded solutions $A^{\mu}(x)$ prove to be identical to the Liénard–Wiechert vector potentials everywhere outside a thin tube $T_{\varrho}$ of radius ${\varrho}\sim{\ell}$ enclosing the world line. All acausal phenomena are confined to a spacetime region ${\cal M}$ bounded by $T_{\varrho}$ .

We thus come to the following picture of self-interaction. The initial degrees of freedom appearing in the action (301) rearrange just as they do in the Maxwell–Lorentz theory with few exceptions concerning the region ${\cal M}$ .

Substitution of the retarded field (323) in $\Theta^{\mu\nu}$ gives a finite stress-energy tensor. The integrability is therefore not the feature unique to $\Theta^{\mu\nu}_{\rm II}$ . In fact, the segregation between $\Theta^{\mu\nu}_{\rm I}$ and $\Theta^{\mu\nu}_{\rm II}$ is confidently made only at a distance well away from the region ${\cal M}$ . To define $\Theta^{\mu\nu}_{\rm II}$ , the part of the retarded solution $F^{\mu\nu}_{\rm II}$ that scales as $\rho^{-1}$ in the limit $\rho\gg\ell$ should be substituted in $\Theta^{\mu\nu}$ . This makes possible to reproduce most of the properties of radiation in the Maxwell–Lorentz theory, the generalized Larmor formula (75) included, for sufficiently smooth world lines, that is, for curves whose local acceleration and higher derivatives are always small, $\ell^{2}|a^{2}|\ll 1$ , $\ell^{4}|{\dot{a}}^{2}|\ll 1$ ,…

We remark parenthetically that the product of distributions ${\Box}^{-1}K({\Box})\,\delta^{4}(x)$ , whose smearing functions $K({-k^{2}})$ satisfy the asymptotic condition (299), is well defined if the appropriate basic space ${\cal Z}$ consists of slowly increasing entire functions $\chi(k^{2})$ of the complex variable $k^{2}=\xi+i\zeta$ subject to the conditions

[TABLE]

where $a$ , $b$ , $C_{n}$ , and $N$ are constants (dependent on $\chi$ ). For more details of ${\cal Z}$ see [54].

Equation (303) is free of the ultraviolet disease and hence is the well-defined equation of motion for a dressed charged particle. It is possible to bring this equation to the form of Newton’s second law, Eq. (88). Indeed, the general solution to the field equation (304) is the sum of $F_{\rm ret}$ , the retarded field due to the delta-function source smeared by $K({\Box})$ , plus $F_{\rm ext}$ , a solution to the homogeneous wave equation describing an external field. Therefore, the right side of Eq. (303) is decomposed into two terms, $I^{\lambda}+f^{\lambda}$ , where $I^{\lambda}$ is attributed to the effect of the retarded field, and $f^{\lambda}$ stands for the external force. Note also that the terms $m_{0}a^{\lambda}$ , $I^{\lambda}$ , and $f^{\lambda}$ are orthogonal to $v^{\lambda}$ each. This explains the reason for the occurrence of the overall projector $\stackrel{{\scriptstyle\scriptstyle v}}{{\bot}}$ , and the separation of $f^{\lambda}$ from the remainder, which, together with $m_{0}a^{\lambda}$ , is associated with ${\dot{p}}^{\lambda}$ . For sufficiently smooth world lines, Eq. (303) is closely approximated by the Abraham–Lorentz–Dirac equation, Eq. (86), in which $m=m_{0}+\delta m$ , with $\delta m$ being given by (322). Let $\kappa$ be a characteristic curvature of such world lines, and $\epsilon=\ell\kappa$ a small dimensionless parameter. Then Eq. (89) represents the four-momentum of a dressed particle $p^{\mu}$ accurate to $O(\epsilon^{5})$ .

It is notable that the self-energy $\delta m$ is finite as (322) suggests. The renormalization of mass, $m=m_{0}+\delta m$ , is therefore not a means for rendering the product of singular distributions well-defined. The reason for fusing two quantities of different nature, $m_{0}$ and $\delta m$ , into a single quantity, $m$ , is again the fact that the initial degrees of freedom appearing in the action (301) are unstable, which begets their gathering into a dressed particle whose inert property is expressed in terms of $m$ .

3.7 Particles with spin

By now, several approaches to the dynamics of classical charged particles with spin have been developed. All can be separated into two main groups: those characterized by the use of ordinary commuting variables describing spin degrees of freedom, and those marked by the use of anticommuting Grassmannian variables (for a review see, e. g., [129]). Frenkel [62] pioneered in a relativistic Lagrangian description of classical spinning particles in electromagnetic field. Grassmannian variables were first applied to describe spin at the classical level in [108], [21], [22], and [41].

The Maxwell–Lorentz electrodynamics can be extended to cover a particle possessing a charge $e$ and a dipole moment $m^{\mu\nu}=-m^{\nu\mu}$ by introducing the current density

[TABLE]

To specialize our discussion to the case of a magnetic dipole, we connect $m^{\mu\nu}$ with spin variables $S^{\mu\nu}$ of the particle,

[TABLE]

satisfying the Frenkel condition

[TABLE]

The particle is assumed to be an object whose nature is preserved under time evolution. In particular, the magnitude of the tensor $S^{\mu\nu}$ must be unchanged,

[TABLE]

The equation of motion for a dipole must be consistent with conditions (331) and (332).

In a particular Lorentz frame, the antisymmetric tensor $S_{\mu\nu}$ is written as $S_{\mu\nu}=({\bf N},{\bf S})$ , where $\sigma_{0i}={\rm N}_{i}$ and $\sigma_{ij}=-\epsilon_{ijk}\,{\rm S}_{k}$ . In the rest frame, Frenkel’s constraint (331) implies that only the components of ${\bf S}$ are nonzero, while ${\bf N}={\bf 0}$ . Equation (332) can be recast as $S^{\mu\nu}S_{\mu\nu}=2{\bf S}^{2}$ . Therefore, ${\bf S}$ may be used to mean spin as viewed by a comoving observer.

The symmetry properties of the extended theory have been subjected to dramatic changes. Since $m^{\mu\nu}$ has dimension of length, Maxwell’s equations cease to be invariant under the group of conformal transformations C $(1,3)$ . Furthermore, in view of (329), it seems difficult to conceive of a simple law of transformation for $m^{\mu\nu}$ in response to reparametrizations of world lines so as to preserve reparametrization invariance of the action.

The retarded solution to Maxwell’s equations with the source $j^{\mu}$ shown in (329) is

[TABLE]

All the quantities on the right-hand side are taken at the retarded instant $s_{\rm ret}$ .

It is clear from (333) that the retarded field $F^{\mu\nu}$ contains terms proportional to $\rho^{-3}$ , $\rho^{-2}$ , and $\rho^{-1}$ . Accordingly, the four-momentum $P^{\mu}$ of this field involves terms that diverge as $\epsilon^{-3}$ , $\epsilon^{-2}$ , and $\epsilon^{-1}$ in the limit of regularization removal $\epsilon\to 0$ . To be more specific, we refer to [26] where it was found that the only singular term of $P^{\mu}$ proportional to $e^{2}$ looks like

[TABLE]

singular terms proportional to $e\mu$ appear as

[TABLE]

and singular terms proportional to ${\mu}^{2}$ are of the form

[TABLE]

To absorb these divergences, we need to redefine three free parameters which are held in the particle sector of the action. But only two such parameters, namely $m_{0}$ and $S$ , are available. This bears some resemblance to the Maxwell–Lorentz electrodynamics in ${\mathbb{R}}_{1,5}$ , discussed in Sec. 3.2.3, where $P^{\mu}$ contains cubic and linear divergences. We augmented the Poincaré–Planck action by the addition of terms containing higher derivatives of $z^{\mu}$ , in other words, we adopted the rigid dynamics with a simple Lagrangian (194), which offered a means to eliminate the divergences and make the description consistent.

However the situation is different now. As indicated above, we are dealing with a system devoid of conformal and reparametrization invariances. It is unlikely to contrive a simple Lagrangian for a spinning particle containing, in addition to $m_{0}$ and $S$ , a further parameter, so as to be able to cope with elimination of the divergences. A regular approach to deriving a consistent dynamics of a spinning particle interacting with electromagnetic field remains to be developed. Yet, at times, researchers propose ad hoc versions of the desired dynamics taking into account self-interaction effects [25], [26], [148], [133], [20], [70]. We do not delve into the essence of these heuristic approaches. We only note that the eventual outcome is given by two comparatively simple equations, the equation of motion for a dressed spinning particle, and the equation governing the precession of spin of this particle. For example, the following dynamical equations were obtained in [148]:

[TABLE]

and

[TABLE]

where $g$ is the gyromagnetic ratio defined by

[TABLE]

The rearranged dynamics, Eqs. (337) and (338), is expressed in terms of two auxiliary quantities ${\hat{F}}^{\mu\nu}$ and ${\hat{S}}^{\mu\nu}$ originating in the following way. Let ${F}^{\mu\nu}_{(-)}$ denote half the difference of the retarded and advanced fields generated by the particle. Then ${\hat{F}}^{\mu\nu}$ is defined by

[TABLE]

The definition of ${\hat{S}}^{\mu\nu}$ is more complicated. The derivative of this quantity is defined by

[TABLE]

where the auxiliary momentum variable ${\hat{P}}^{\mu}$ satisfies the equation

[TABLE]

The constraints (330), (331), and (332) therewith change, respectively, as

[TABLE]

It is not unduly difficult to verify that the dynamical equations (337) and (338) are consistent with these modified constraints.

To sum up, the classical dynamics of charged spinning particles formulated in terms of ordinary commuting variables is a challenging task which still remains to be solved.

The surprising thing is that the Lagrangian description of charged spinning particles with the aid of anticommuting Grassmannian variables turns out to be quite simple. As an illustration, we refer to the model of [66] characterized by the use of real-valued odd elements of a Grassmann algebra $\theta^{\mu}$ and $\theta_{5}$ to describe spin degrees of freedom. An appropriate reparametrization invariant action is

[TABLE]

and the endpoint variation conditions are

[TABLE]

The variables $N$ and $M$ are Lagrange multipliers of the constraints $P^{2}-m^{2}-ieF_{\mu\nu}\theta^{\mu}\theta^{\nu}\approx 0$ and $\theta^{\mu}P_{\mu}+m\theta_{5}\approx 0$ . The even Grassmannian construction $i\theta^{\mu}\theta^{\nu}$ is similar to the spin tensor $S^{\mu\nu}$ .

A stumbling block for the study of such models lies in the following fact. Although even elements of a Grassmann algebra contain usual numbers (Dirac called them $c$ -numbers), the subset of $c$ -numbers does not exhaust all the possibilities. There are even elements different from $c$ -numbers. Solutions to the equation of motion for a spinning particle, $z^{\mu}(\tau)$ , may well be built from even Grassmannian variables appearing in the action (346) which are not $c$ -numbers, say, from $N\theta^{\mu}$ and $i\theta^{\mu}\theta^{\nu}$ . The complete collection of world lines $z^{\mu}(\tau)$ may involve curves which need not be mappings ${\mathbb{R}}\to{\mathbb{R}}_{1,3}$ . Therefore, this spinning particle lives in a realm which is not identical to Minkowski spacetime or its warpings. This realm is described in terms of arbitrary even Grassmannian variables. However, in this realm, most physical quantities are deprived of operational definition, and we do not have the slightest idea of how they can be experimentally recorded.

4 SELF-INTERACTING GAUGE FIELD SYSTEMS

Armed with the insight gained from the self-interaction problem in the Maxwell–Lorentz electrodynamics and simplified nonlinear field models possessing nontrivial topological structures, we begin to consider this problem in non-Abelian gauge field systems. Both of the studied manifestations of self-interaction can occur there. If a system does not involve particles, that is, delta-function sources of the field are missing from this system, solutions associated with phases of distinct topological structures are the sole manifestation of self-interaction. The Yang–Mills–Higgs model is an example. One phase of this system can be related to the Coleman plane waves, and the other phase can be attributed to the ’t Hooft–Polyakov magnetic monopole field. The former enjoys the property of the internal SU $(2)$ symmetry, which is spontaneously broken to U $(1)$ in the latter. These configurations differ not only in their topological setups, but also in physical properties. On the other hand, if a system contains particles interacting with a non-Abelian gauge field, the phase manifestation of self-interaction is supplemented with rearranging initial mechanical and field degrees of freedom. This phenomenon shows a general resemblance to that in the Maxwell–Lorentz theory, but occurs variously for different phases. To illustrate, in one phase of the Yang–Mills–Wong theory, accelerated dressed particles emit Yang–Mills field waves of positive energy, while in the other phase, accelerated dressed particles emit waves of negative energy. The gauge groups of these phases are different even though none of them is a subgroup of the other. They are the compact and a noncompact real forms of the complexification of the gauge symmetry group. This trait of self-interaction in the Yang–Mills–Wong theory is called ‘‘spontaneous deformation of symmetry’’.

4.1 Self-interaction in the Yang–Mills–Higgs theory

A regular way for obtaining nontrivial static solutions of the pure SU $(2)$ gauge theory is to use the so-called Wu and Yang ansatz [157] in which space coordinates are interwoven with gauge field coordinates,

[TABLE]

’t Hooft [149] and Polyakov [123] pioneered the use of the Wu–Yang ansatz in the Yang–Mills–Higgs theory, and discovered a field configuration possessing properties of the magnetic monopole. This configuration is a finite-energy smooth solution to the SO $(3)$ gauge theory with a Higgs triplet. The Lagrangian is:

[TABLE]

where

[TABLE]

The field equations read

[TABLE]

The Higgs potential $V(\phi)$ must approach zero as $r\to\infty$ , which means that the Higgs field has a nonzero limit at spatial infinity

[TABLE]

The boundary condition (355) singles out a particular axis ${\hat{\phi}}_{a}$ in the parameter space of the SO $(3)$ group of internal symmetry for each spatial direction ${\bf n}$ , thus breaking this symmetry. Solutions subject to this condition are invariant under the group of rotations about ${\hat{\phi}}_{a}$ . Therefore, the unbroken symmetry is SO $(2)$ or, equivalently, the U $(1)$ subgroup of SO $(3)$ . The resulting U $(1)$ gauge theory can be identified with Maxwell’s theory of charged vector and scalar fields if generators $T_{a}$ of the initial SO $(3)$ group are projected on ${\hat{\phi}}^{a}$ . Then the Abelian vector potential $A^{\mu}$ associated with the local U $(1)$ gauge group is

[TABLE]

and the electric charge is

[TABLE]

Our concern here is with static spherically symmetric solution to the field equations (353) and (354). Let us take the gauge condition $A^{a}_{0}=0$ . This implies $D_{0}\phi_{a}=0$ and $G^{a}_{0j}=0$ . With the ansatz

[TABLE]

(353) and (354) become

[TABLE]

where the prime stands for differentiation with respect to $r$ .

Combining (355) with (358) gives

[TABLE]

that is, ${\hat{\phi}}^{a}=n^{a}$ . The ansatz (359) is consistent with Eqs. (360) and (361) if $b(r)=O(1)$ as $r\to\infty$ . The ‘‘electromagnetic’’ field strength $F_{\mu\nu}$ is to be identified with the component of the Yang–Mills strength $G^{a}_{\mu\nu}$ in the direction of ${\hat{\phi}}^{a}$ , which corresponds to the unbroken U $(1)$ symmetry. Taking into account that $G^{a}_{0j}=0$ , one can show that, far from the origin,

[TABLE]

or

[TABLE]

which represents a radial static magnetic field

[TABLE]

generated by the total magnetic charge $e^{\star}=1/e$ . This solution, called the ’t Hooft–Polyakov monopole, describes the Yang–Mills field of magnetic type because ${}^{\ast}{G}^{a}_{\mu\nu}{G}_{a}^{\mu\nu}=0$ and ${G}^{a}_{\mu\nu}{G}_{a}^{\mu\nu}>0$ .

The ’t Hooft–Polyakov monopole and the Dirac monopole [51], [53] differ in their structure within a core of size $\sim\lambda/\mu$ . The Dirac monopole solution (365) has a singularity for which a point source has to be introduced explicitly in the action, while the ’t Hooft–Polyakov monopole is smooth everywhere and satisfies the field equations (353) and (354) without external sources. Outside the core these configurations are similar.

There is another class of solutions in the Yang–Mills–Higgs theory, the non-Abelian analogues of electromagnetic plane waves, originally discovered by Coleman [45]. To describe these solutions of the field equations (353) and (354), it is convenient to adopt light-cone coordinates $x^{\mu}=\left(x^{+},x^{-},x^{2},x^{3}\right)$ , $x^{\pm}=x^{0}\pm x^{1}$ . The metric, expressed in terms of these coordinates, is $ds^{2}=dx^{+}dx^{-}-(dx^{2})^{2}-(dx^{3})^{2}$ . The Coleman solutions for plane waves moving in the negative $x^{1}$ -direction are given by

[TABLE]

where the $f$ ’s and $g$ ’s are arbitrary bounded functions of $x^{+}$ , with all other components of the field strength being vanishing. For these solutions, the energy density is bounded throughout spacetime, the direction of the Poynting vector is constant, the magnitude of the Poynting vector is equal to the energy density, and ${}^{\ast}{G}^{a}_{\mu\nu}{G}_{a}^{\mu\nu}=0$ , ${G}^{a}_{\mu\nu}{G}_{a}^{\mu\nu}=0$ , which allows to classify these ${G}^{a}_{\mu\nu}$ as null-field configurations. The Coleman plane waves represent a phase distinct from the phase associated with the ’t Hooft–Polyakov magnetic monopoles. There is no bijective continuous mapping between these two configurations.

The ’t Hooft–Polyakov monopole is electrically neutral due to the combination of the gauge condition $A_{0}^{a}=0$ and the requirement that the field configurations should be static. Julia and Zee [85] have abandoned this gauge condition and used the ansatz

[TABLE]

where $c(r)$ is an unknown function. The resulting static solution exhibits nonzero $G^{a}_{0j}$ . This configuration with electric and magnetic charges, presently known as the Julia–Zee dyon, offers a further phase of the Yang–Mills–Higgs system.

For arbitrary $\mu$ and $\lambda$ , the set of ordinary differential equations (360) and (361) has never been solved analytically. However, in the limit $\mu\to 0$ , $\lambda\to 0$ , with $\mu/\lambda<\infty$ , an exact solution

[TABLE]

where $\beta$ is an arbitrary constant, was obtained by Bogomol’nyi [33], and Prasad and Sommerfield [124]. The Bogomol’nyi–Prasad–Sommerfield configuration is self-dual,

[TABLE]

where the Hodge dual field ${}^{\ast}G_{\mu\nu}^{a}$ is defined by ${}^{\ast}G_{\mu\nu}^{a}=\frac{1}{2}\epsilon_{\mu\nu\alpha\beta}G^{a\hskip 1.42262pt\alpha\beta}$ . This implies that the Yang–Mills term of the stress-energy tensor $\Theta_{\mu\nu}$ is vanishing. Indeed,

[TABLE]

which shows that $\Theta_{\mu\nu}=0$ for ${}^{\ast}G_{\mu\nu}=\pm i\,G_{\mu\nu}$ . Thus, the Bogomol’nyi–Prasad–Sommerfield configuration carries zero energy and momentum.

This cursory glance at nontrivial solutions of the Yang–Mills–Higgs system is suffice for present purposes. The availability of these solutions makes it clear that self-interaction in this system gives rise to several phases with different geometric and physical properties.

The literature on exact solutions of classical Yang–Mills theories is rather extensive. The reader interested in this topics may consult the books [126], [141], and [134]. A detailed treatment of monopole solutions is given in [68], [84], [46], [11]. A largely complete review of classical solutions to SU $(2)$ gauge theories that were known by the end of the 1970s is represented in the survey [5].

4.2 Self-interaction in the Yang–Mills–Wong theory

The Yang–Mills–Wong theory has many features in common with the Maxwell–Lorentz theory. This is a classical gauge field theory which describes the interaction of particles carrying non-Abelian charges with the Yang–Mills field. A closed system of $K$ particles interacting with the SU $({\cal N})$ Yang–Mills field is governed by the action [12]

[TABLE]

where $T_{a}$ are generators of SU $({\cal N})$ , $G^{a}_{\mu\nu}=\partial_{\mu}A^{a}_{\nu}-\partial_{\nu}A^{a}_{\mu}+if^{a}_{\leavevmode\nobreak\ bs}A^{b}_{\mu}A^{c}_{\nu}$ is the field strength, $f_{abc}$ are the structure constants of SU $({\cal N})$ . The SU $({\cal N})$ gauge group is thereafter called the ‘‘color’’ gauge group, and the classical particles go under the name of ‘‘quarks’’, with the understanding, however, that all such terms refer to the Yang–Mills–Wong theory, which may in some respects stretch the truth of subnuclear realm.

Quarks, labelled with $I$ , carry color charges $Q_{I}$ in the adjoint representation of SU $({\cal N})$ , $Q_{I}=Q^{a}_{I}\,T_{a}$ , which can be written in terms of the basic variables $\eta_{Ij}$ in the fundamental representation,

[TABLE]

The Euler–Lagrange equations for $\eta$ and $\eta^{\ast}$ , in which the label $I$ is omitted,

[TABLE]

can be combined into

[TABLE]

Equation (374) was originally derived by Wong [156]. It shows that the color charge $Q_{a}$ shares with a top the property of precessing. Indeed, $Q_{a}$ precesses around the axis ${v}\cdot A^{a}$ in the color space.

In contrast to the electric charge $e$ , which is a constant, the color charge $Q_{a}$ is a dynamical variable governed by Eq. (374). But the color charge magnitude is a constant of motion,

[TABLE]

which follows from (374), written in the Cartan basis, because in this basis, $f_{abc}=-f_{bac}$ . Furthermore, there is good reason to look for solutions of the Yang–Mills–Wong theory satisfying the condition

[TABLE]

Abandoning this condition would pose the problem of an infinitely rapid precession of $Q_{a}$ because the retarded field $A^{a}_{\mu}$ is singular on the world line.

Varying $A^{\mu}$ in the action (371) gives the Yang–Mills equations:

[TABLE]

Before proceeding further let us recall that knowing the retarded solution to Maxwell’s equations with the source involving a single point charge,

[TABLE]

its extension to the case that the source is composed of $K$ charges follows immediately:

[TABLE]

Allowing for linear combinations of solutions with arbitrary real coefficients, Eq. (379), is tantamount to stating that electric charges $e_{I}$ take arbitrary real values. By contrast, the superposition principle does not apply to the Yang–Mills equations unless they become Abelian, and hence linearize. Non-Abelian solutions of the Yang–Mills equations with the single-quark source are prevented from superposing, and we are forced to solve Eq. (377) for each $K$ individually.

A systematic method for finding exact retarded solutions of Eq. (377) with the source composed of $K$ quarks moving along arbitrary timelike smooth world lines was proposed and developed in a series of papers [91], [93], [94], [95], [96], [99], and rediscovered in part in [137]. Without going into detail of this method we simply present typical solutions and discuss their properties bearing on self-interaction.

There are two kinds of retarded solutions to Eq. (377), Abelian and non-Abelian. Abelian solutions are defined on a set of ${\cal N}-1$ commuting matrices $H_{a}$ which form the Cartan subalgebra of the Lie algebra su $({\cal N})$ . To build non-Abelian solutions, we need extended subalgebras of su $({\cal N})$ , containing the Cartan subalgebra.

Consider the simplest case that the SU $(2)$ Yang–Mills field is generated by a single quark moving along a timelike smooth world line. The retarded Abelian solution is

[TABLE]

where $q$ is an arbitrary real parameter whereby the color charge of the quark is measured, $Q=qT_{3}$ . This solution has much in common with the Liénard–Wiechert vector potential (378), in particular the field $G^{\mu\nu}$ evaluated from (380) is of electric type.

The corresponding retarded non-Abelian solution is

[TABLE]

Here $T_{a}$ $(a=1,2,3)$ , are three generators of SU $(2)$ , which can be expressed in terms of the Pauli matrices, $T_{a}=\frac{1}{2}\sigma_{a}$ , and $\kappa$ is an arbitrary real nonzero parameter. The field strength associated with this vector potential is

[TABLE]

where $U^{\mu}=-\lambda v^{\mu}+\rho a^{\mu}$ which is the same as that given by Eq. (52).

With the prescription that observable color singlets must involve either ‘‘ $+$ ’’ or ‘‘ $-$ ’’ representatives of the non-Abelian solutions, not their mix, one can deduce that the configuration defined by Eqs. (382)–(383) qualifies as a Yang–Mills field of magnetic type,

[TABLE]

The Yang–Mills equations determine not only the retarded non-Abelian field (381), but also the magnitude of the color charge that generates this field. Indeed, the quantity

[TABLE]

appearing in (381) is ordered by the structure of the Yang–Mills equations (377).

The solution (381) acquires the form $A_{\mu}={\cal A}_{\mu}^{a}\,{\cal T}_{a}$ where all coefficients ${\cal A}_{\mu}^{a}$ are pure imaginary with the use of the matrix basis

[TABLE]

Elements of this basis obey the commutation relations

[TABLE]

which underlie the sl $(2,{\mathbb{R}})$ Lie algebra. The color space becomes a pseudoeuclidean space with the metric $\eta_{ab}={\rm diag}\,(-1,1,-1)$ . The automorphism group of this space is SO $(2,1)$ . On the other hand, the gauge group of the solution (380) is the initially chosen SU $(2)$ .

Should we adopt the initial gauge group Sp $(1)$ , rather than SU $(2)$ or SO $(3)$ , we would come to identical results owing to the equivalence of three complex Lie algebras

[TABLE]

their real compact forms

[TABLE]

and their real noncompact forms

[TABLE]

see, e. g., [17].

Imagine for a little that a single quark is in the Universe. The system ‘‘the quark plus its own Yang–Mills field’’ exists in two phases which are distinguished by their groups of gauge symmetry: SU $(2)$ and SL $(2,{\mathbb{R}})$ . These phases will be conventionally referred to as ‘‘hot’’ and ‘‘cold’’.

A closer look at SU $(2)$ and SL $(2,{\mathbb{R}})$ shows that none of them is a subgroup of the other. The origin of SL $(2,{\mathbb{R}})$ bears no relation to spontaneous symmetry breakdown. SU $(2)$ and SL $(2,{\mathbb{R}})$ are the compact and a noncompact real forms of the complex group SL $(2,{\mathbb{C}})$ . Invariance of the action (371) under SU $(2)$ automatically entails its invariance under the complexification of this group, SL $(2,{\mathbb{C}})$ . However, a complex-valued Yang–Mills field may seem problematic in the classical context, particularly where observable quantities, such as energy, were involved. Only real forms of SL $(2,{\mathbb{C}})$ appear to be satisfactory as gauge groups. The emergence of a solution invariant under a real form of SL $(2,{\mathbb{C}})$ different from the initial SU $(2)$ is a phenomenon specific to the Yang–Mills–Wong theory. We call it the ‘‘spontaneous symmetry deformation’’. The cold and hot phases differ from each other not only in their symmetry; a cold quark generates the Yang–Mills field of magnetic type, while a hot quark generates the Yang–Mills field of electric type.

While on the subject of systems composed of $K$ quarks and their field evolving in the non-Abelian regime, we note that the Yang–Mills–Wong theory of such systems can be consistently formulated for the color gauge group SU $({\cal N})$ with ${\cal N}\geq K+1$ . To illustrate, we refer to a system of two quarks whose initial gauge group is assumed to be SU $(3)$ . A retarded non-Abelian solution that describes the field due to two quarks moving along arbitrary timelike smooth world lines is

[TABLE]

where $H_{l}$ and $E^{\pm}_{mn}$ are generators of SU $(3)$ in the Cartan basis, which can be expressed in terms of the Gell-Mann matrices:

[TABLE]

$R_{1}^{\mu}=x^{\mu}-{z}^{\mu}_{1}(s_{1})$ and $R_{2}^{\mu}=x^{\mu}-{z}^{\mu}_{2}(s_{2})$ are, respectively, null four-vectors drawn from points ${z}^{\mu}_{1}(s_{1})$ and ${z}^{\mu}_{2}(s_{2})$ on the world lines of quarks 1 and 2, where the signals were emitted, to the point $x^{\mu}$ , where the signals were received.

Expression (391) is imaginary-valued in the color basis

[TABLE]

or in the Cartan basis spanned by $H_{n}$ and $E_{mn}^{\pm}$ . The ${\cal T}_{n}$ ’s are traceless real $3\times 3$ matrices satisfying the commutation relations of the Lie algebra sl $(3,{\mathbb{R}})$ . Thus, the gauge group of the non-Abelian solution (391) is actually SL $(3,{\mathbb{R}})$ .

The structure of retarded non-Abelian solutions to the Yang–Mills equations with the source composed of $K$ quarks closely resembles that of (391). These solutions are the sum (not an arbitrary superposition) of single-quark terms, as exemplified by Eq. (391) in which two single-quark terms add up to give the retarded Yang–Mills field generated by two quarks in the cold phase. The spontaneous symmetry deformation, responsible for the emergence of the gauge group which is a noncompact real form of the complex group SL $({\cal N},{\mathbb{C}})$ , occurs universally in all systems of $K$ quarks governed by the action (371).

The corresponding retarded Abelian solutions

[TABLE]

are linear combinations of single-quark solutions (380) with arbitrary real coefficients $q_{I}^{n}$ . The gauge group of these configurations is the initial SU $({\cal N})$ .

Varying $z_{I}^{\mu}$ in the action (371) gives the equation of motion for $I$ th bare quark

[TABLE]

Since we are to study the rearrangement in hot and cold phases, the retarded Abelian and non-Abelian solutions to the Yang–Mills equations with the source composed of $K$ quarks should be substituted to Eq. (395). However, the singularities of these solutions on the world lines preclude the direct execution of this plan. As before we invoke the Noether identity

[TABLE]

where

[TABLE]

$t^{\mu\nu}$ is given by (41), ${\cal E}^{a}_{\mu}$ and $\varepsilon^{\lambda}$ are respectively the left-hand sides of (377) and (395).

In the hot phase, the Yang–Mills equations linearize, and become almost identical to Maxwell’s equations. Accordingly, all results of Sec. 3.1 are reproduced with the only replacement $e^{2}\to q^{2}$ . The degrees of freedom appearing in the action (371) are rearranged on the extremals subject to the retarded condition to give dressed quarks and Yang–Mills radiation closely resembling such entities in electrodynamics. The behavior of a dressed quark is governed by the Abraham–Lorentz–Dirac equation (86),

[TABLE]

where $\tau_{0}$ is a characteristic time interval defined by (90) in which $e$ is substituted for $q$ .

As to the cold phase, three more points need to be made. First, a special magnitude for the color charge of every quark of $K$ -quark systems evolving in the non-Abelian regime is selected by the Yang–Mills equations:

[TABLE]

Our principal interest here is with the overall minus sign of expression (399).

Second, the nonlinearity of the Yang–Mills equations is yet compatible with the fact that retarded non-Abelian solutions are given by the sum of single-quark terms. Because all retarded non-Abelian solutions share this common property, the stress-energy tensor of the Yang–Mills field is written as

[TABLE]

where $\Theta_{I}^{\mu\nu}$ is comprised of the field generated by $I$ th quark, and $\Theta_{\leavevmode\nobreak\ IJ}^{\mu\nu}$ contains mixed contributions of the fields due to $I$ th and $J$ th quarks. Expression (400) is similar to that of $\Theta^{\mu\nu}$ in the Maxwell–Lorentz theory. We are thus entitled to reiterate the procedure used in Sec. 3.1 to reveal the rearrangement of Yang–Mills–Wong systems evolving in the non-Abelian regime.

Third, conformal invariance is apparently violated by the linearly rising terms of $A_{\mu}$ containing constants $\kappa$ which have dimension $({\rm length})^{-2}$ . Note, however, that

[TABLE]

whence it follows that any color singlet, such as $\Theta^{\mu\nu}$ , is free of the contributions violating conformal invariance. This is because the linearly rising terms depend upon either $E_{mn}^{+}$ or $E_{mn}^{-}$ , but not both. Although the linearly rising terms of $A_{\mu}$ contribute to the field strength $G_{\mu\nu}$ , as viewed in Eq. (383), conformal invariance is recovered on the level of observables. For instance, the force exerted on $I$ th quark from all other quarks

[TABLE]

involves only the ‘‘Liénard–Wiechert part’’ of the field strength $G_{\mu\nu}^{{\rm LW}}$ . The explanation is simple. The force (402) includes the scalar product of two color vectors $G_{\mu\nu}-G_{\mu\nu}^{{\rm LW}}$ and $Q_{I}$ . They are not arbitrary; the exact solutions constrain these vectors to be orthogonal to each other.

On this basis we can get the following conclusions [91], [92], [95], [99]. In the cold phase, an accelerated quark gains, rather than loses, energy by emitting the Yang–Mills radiation. To see this, one has to derive the emitted four-momentum

[TABLE]

making clear the fact that

[TABLE]

which is construed as absorbing convergent waves of positive energy, or, alternatively, as emitting divergent waves of negative energy. Here, $m$ is the renormalized mass of $I$ th quark (from here on, the label $I$ is omitted), and $\ell$ stands for a characteristic length

[TABLE]

It is interesting to compare this parameter with the characteristic length $\tau_{0}$ in the Maxwell–Lorentz electrodynamics which is thought of as an effective theory resulting from quantum electrodynamics at long distances. The validity of this effective theory is limited by a cutoff related to the Compton wave length of the electron, $\lambda_{e}=3.86\cdot 10^{-11}$ cm. At shorter distances, the effect of pair creations becomes appreciable. Likewise, we may understood the Yang–Mills–Wong theory as an effective theory to low-energy quantum chromodynamics, and associate the cutoff with the Compton wave length of the quark, $\lambda_{q}$ . But unlike $\tau_{0}$ , which is proportional to $e^{2}\approx 1/137$ , the characteristic length in the cold phase $\ell$ is inversely related to $g^{2}$ . In the strong coupling regime $g\sim 1$ , the parameter $\ell$ is of order of the Compton wave length of the quark. However, if $g\ll 1$ , then $\ell\gg\lambda_{q}$ , so that all phenomena specified by $\ell$ fall in the range of validity of this classical theory being immune to the effect of pair creations.

The four-momentum of a dressed quark in the cold phase is

[TABLE]

and therefore

[TABLE]

If the acceleration exceeds its critical value ${a}^{2}_{c}=-\ell^{-2}$ (which is another way of stating that the momentum transfer is greater than $\ell^{-1}$ ), the quark becomes a tachyon. Note, however, that light constituent quarks have mass of about $300$ MeV which is close to the deconfinement transition temperature $T_{c}\approx 200\pm 50$ MeV. With reference to what was said immediately after Eq. (405), this suggests that the attainment of ${a}^{2}_{c}$ may result in triggering between the cold and hot phases rather than producing a tachyonic state.

The local energy-momentum balance

[TABLE]

applies to the cold phase. Here, $p^{\mu}$ and ${\cal P}^{\mu}$ are defined respectively by Eqs. (406) and (403), and ${\dot{\wp}}^{\mu}=-f^{\mu}$ , with $f^{\mu}$ being given by (402). According to this balance, the four-momentum $d{\wp}^{\mu}=-f^{\mu}ds$ , extracted from an external field during an infinitesimal interval $ds$ , is used for changing the four-momentum of a dressed cold quark $dp^{\mu}$ and emitting the Yang–Mills radiation four-momentum ${d{{\cal P}}}^{\mu}$ with negative energy content.

Equation (408) can be rewritten to give the equation of motion for a dressed quark

[TABLE]

The only qualitative difference between the equation of motion for a dressed quark in the cold phase, Eq. (409), and that in the hot phase, Eq. (398), is the overall sign of the parenthesized term. To appreciate this difference, let us assume that $f^{\mu}=0$ . While the general solution to equation (398) takes the runaway form (145), the general solution to equation (409) proves to be

[TABLE]

Here, $V^{\mu}$ and $U^{\mu}$ are constant four-vectors such that $V\cdot U=0$ , $V^{2}=-U^{2}=1$ , and $\nu_{0}$ and $w_{0}$ are arbitrary parameters. This self-decelerated solution should be attended with the Haag asymptotic condition (74); otherwise the radiated four-momentum (404) will be divergent. This requirement is fulfilled only for $w_{0}=0$ . Therefore, a free dressed quark governed by Eq. (409) moves along a straight world line $v^{\mu}=$ const.

5 CLASSICAL SELF-INTERACTING STRINGS

Do classical strings undergo self-interaction? At first glance, strings are systems equipped with enough symmetry to be unstable. This is indeed the case. But the mechanism for revealing self-interaction is rather subtle.

For simplicity, we restrict our attention to bosonic strings. We recall the reader some elements of their description [71], [122]. The points of a string are specified by spacetime coordinates $X^{\mu}$ . During its motion, the string sweeps out a two-dimensional surface in Minkowski space, $X^{\mu}=X^{\mu}(\sigma,\tau)$ , called the world sheet. The coordinates $\tau$ and $\sigma$ parameterize the world sheet: $\sigma$ labels the position of a point on the string and $\tau$ measures its time evolution. World sheets are assumed to be timelike, smooth surfaces, that is, a two-dimensional plane tangent to the world sheet is spanned by a timelike and a spacelike vectors, ${\dot{X}}_{\mu}={\partial X_{\mu}}/{\partial\tau}$ and ${X^{\prime}}_{\!\mu}={\partial X_{\mu}}/{\partial\sigma}$ . By analogy with the action for a particle, Eq. (32), which is proportional to the length of the world line, the action for a string is taken to be proportional to the area of the world sheet:

[TABLE]

A classical string moves so as to minimize the area of the world sheet, with initial and final positions of the string being fixed,

[TABLE]

Taking the boundary conditions

[TABLE]

we arrive at the Euler–Lagrange equations

[TABLE]

nonlinear partial differential equations for $X^{\mu}$ . This is not the whole story, however. The change of variables

[TABLE]

where $F$ and $G$ are smooth functions, leaves the action (411) invariant. Transformations (415) form the gauge group of the string. To eliminate the gauge freedom of the string, one may impose two gauge fixing conditions. A convenient choice is

[TABLE]

whose geometrical significance is that the coordinate lines $\tau=$ const and $\sigma=$ const are orthogonal and uniformly parametrized, hence the name ‘‘orthonormal gauge’’. With the gauge (416), the Euler–Lagrange equations (414) simplify

[TABLE]

String coordinates in the orthonormal gauge obey the wave equation. This result makes it clear that the string dynamics at the world sheet is devoid of self-interaction. The same is true for superstrings.

The boundary conditions (413) become

[TABLE]

These are Neumann boundary conditions. Using (416) and (418), we obtain

[TABLE]

that is, end points of strings obeying this boundary conditions move at the speed of light.

Alternatively, one may adopt Dirichlet boundary conditions

[TABLE]

which imply that $\delta X^{\mu}=0$ in the last term of (412). This term can also be set to zero if we impose periodic boundary conditions

[TABLE]

These relations are suitable for closed strings in the orthonormal gauge.

A free open string can be coupled to an external electromagnetic field by adding an interaction term to the free action. This term is to be chosen in a form preserving most, or, better still, all symmetries of the free action. The only Poincaré and gauge invariant expression is

[TABLE]

where $e$ stands for the electric charge of the string, and $F_{\mu\nu}=\partial_{\mu}A_{\nu}-\partial_{\nu}A_{\mu}$ is the external electromagnetic field. Because

[TABLE]

(422) equals

[TABLE]

plus two terms at $\tau=\tau_{1}$ and $\tau=\tau_{2}$ , which do not contribute to the Euler–Lagrange equations. It is clear from (423) that the charge of an open string is located at its ends. As for a closed string, the charge may be distributed over its entire length.

Adding (423) to the free action leaves the Euler–Lagrange equations unchanged, but the Neumann boundary conditions (418) become

[TABLE]

This consideration illuminates a peculiar feature of string interactions: open strings interact with each other at their ends. On the quantum level, strings interact locally, without mediation of long-range fields. Open strings may join when their ends contact, a single open string may spontaneously split into two pieces, or become closed, or emit a closed string, etc. Joining and splitting are the basic interactions of strings. This form of interaction respects all symmetries of free strings.

It is significant that joining and splitting of strings change the topological structure of their world sheets.

The question now arises of whether joining and splitting are realizable in the classical picture. It is highly improbable that free open strings can move in such a way as to bring their ends into contact with each other at some point of a three-dimensional arena. To be more specific, the probability measure of such events is zero. That is why joining of open strings may be ignored in the classical context. Furthermore, a classical string seems to be unable of splitting. Strings can be indefinitely stretched without any evidence of being favourably disposed towards splitting. There is no elastic limit for extended objects governed by the action (411). The only dimensional parameter in the action is an overall factor $T/2\pi$ which defines the scale of length. Classical strings are thus immune from compulsory splittings. It remains to see whether classical strings can split at random. A close look at the realization of Laplace’s determinism in the classical picture shows that this is the case.

The classical is associated with Laplace’s determinism. Of course, classical statistical mechanics invokes probability theory, but the reason for this is that uncertainties of this description may be attributed to lack of knowledge of actual deterministic histories of macroscopic systems which have too many degrees of freedom to be completely controlled.

Worthy of mention are also chaotic systems. Although chaotic dynamics is formulated by means of probability theory, ‘‘chaotic’’ is not to be confused with ‘‘random’’. Classical chaotic systems are governed by deterministic laws, but their histories are given by highly tangled trajectories. Motions displaying extreme sensitivity to initial conditions are taken to be chaotic. Complexity effects in the behavior of unstable classical systems are a major manifestation of the state of chaos. The apparent indeterminism in the behavior of chaotic systems is then fictitious; it is due to imperfect knowledge of initial conditions.

The methods of statistical physics and chaotic dynamics have considerable utility in string problems of experimental interest, notably in cosmic strings [151], [6]. However, in the discussion that follows, we omit these topics, and focus on the amenability of classical strings to splitting at random.

Laplace’s determinism holds for a given classical system if the Cauchy problem for the dynamical equations of this system has a unique solution. This requirement is generally believed to be imperative in classical physics. Nevertheless, there are classical systems that run counter to Laplace’s determinism. Examples of systems whose behavior can be regarded as truly random are given below [102].

Let two particles be moving towards each other along a straight line. Having spent kinetic energy for overcoming the interparticle repulsion by the instant of their meeting, these particles merge into a single point aggregate.

Since our interest is with final stage of this head-on collision when the particles move slowly, the use of nonrelativistic approximation seems to be accurate. The two-particle problem can be brought to a one-particle problem by introducing the relative coordinate $q=x_{2}-x_{1}$ , the reduced mass $m=m_{1}m_{2}/(m_{1}+m_{2})$ , and the potential energy $U(q)$ . The problem is then to describe the motion of this particle in $U(q)$ , so that its velocity vanishes on its arrival at the top of the potential hill, see Fig. 1.

Let the top be located at $q=0$ , $U_{\rm max}=U(0)$ , and the instant that the particle comes to this point be $t=0$ . The fact that the velocity is vanishing at $q=0$ implies that the total energy is zero,

[TABLE]

The time it takes for the particle to arrive at the top is therefore

[TABLE]

What is the behavior of the particle after its arrival at the top? If the integral in (426) diverges, then the question of the subsequent evolution does not arise because the ascent takes an infinite period. Such is the case for an analytical at $q=0$ function $U(q)$ , say for $U(q)=-U_{0}\,q^{2}$ . However, the integral is finite for

[TABLE]

and the like.

Equation (425) is invariant under time reversal. Furthermore, $q(t)=0$ is another solution to (425). Therefore, if the climb takes a finite period of time, then an infinity of options is available: after staying at the top for an arbitrary period of time ${\cal T}$ , the particle can start to descend, realizing the reversed order of events. Analytically,

[TABLE]

where ${\tt Q}\,(t)$ is the inverse of $t(q)$ defined in (426). Going back to the initial two-particle problem, we see that the aggregate of two merged particles spontaneously disintegrates into its constituents after a lapse of an arbitrary period ${\cal T}$ , and the particles move apart.

By the Picard theorem, the solution to the Cauchy problem for the ordinary differential equation (425) with the initial condition $q(0)=0$ is unique if the Lipschitz condition holds: $\sqrt{-U(q)}<C\,|q|$ . Clearly, this inequality fails for $U(q)$ given by any of Eqs. (427)–(429).

The potentials $U(q)$ which can be visualized as hills are thus divided into two classes: the $U$ ’s whose top is a point of unstable equilibrium in the conventional sense, and the $U$ ’s whose top is a point of an ‘‘over-unstable’’ equilibrium. The state of equilibrium on the former top is kept until a small external perturbation occurs. By contrast, the state of equilibrium on the latter top can be violated spontaneously, with no external cause. The Lipschitz condition is sufficient but not necessary for stability against spontaneous violation. Convergence of the integral in Eq. (426) may serve as a necessary condition for an unstable equilibrium to be classified as over-unstable.

One may disregard this argument for at least two reasons. First, the potentials $U(q)$ shown in (427)–(429) are unlikely to bear on physical reality. Second, time reversal is crucial for the spontaneous breakdown of equilibrium to occur. But the dynamics of an accelerated charged particle is dissipative because it radiates electromagnetic energy. A similar situation holds for particles carrying non-Abelian charges whose dynamics is also irreversible. Therefore, the exact solution (430) is no longer valid for such particles.

Both objections are disproved if we turn to ${\mathbb{R}}_{1,1}$ . First, Eq. (185) shows that the time component of the retarded vector potential is given by $A_{0}=-e\,|\,q\,|$ which falls into the type of Eq. (427) on putting $\alpha=\frac{1}{2}$ , see Fig. 1 (b). Second, it was established in Sec. 3.2.2 that charged particles in ${\mathbb{R}}_{1,1}$ do not radiate, and all processes are locally reversible. Therefore, the above indeterministic phenomenon is feasible here.

Indeed, let us choose the barycentric frame, assume that the particles have equal masses $m$ and charges $e$ , and use the notation $w=e^{2}/m$ . Then the exact solution to the Cauchy problem for the set of equations governing a closed system ‘‘two charged particles plus electromagnetic field in ${\mathbb{R}}_{1,1}$ ’’, having zero total energy, is given by

[TABLE]

which describes two world lines $z^{\mu}_{1}(s)$ and ${\ z}^{\mu}_{2}(s)$ that coalesce at $s=s^{\ast}$ , and separate at $s=s^{\ast\ast}=s^{\ast}+{\cal T}$ . Here, $s^{\ast}$ and ${\cal T}$ are arbitrary positive constants. If $s^{\ast}$ and $s^{\ast\ast}$ are different and finite, then (431) and (432) describe the history of the aggregate with finite life time. If $s^{\ast\ast}\to\infty$ , we obtain the history of a stable aggregate formed at a finite instant. In the limit $s^{\ast}\to-\infty$ , the solution tells us that the aggregate arose at the infinitely remote past, and its decay occurs at a finite instant. For $s^{\ast}\to-\infty$ and $s^{\ast\ast}\to\infty$ , the solution becomes a straight line corresponding to an absolutely stable aggregate, and for $s^{\ast}=s^{\ast\ast}$ , the solution describes an aggregate existing during a single moment. We thus have a continuum of solutions because ${\cal T}$ is arbitrary. In physical terms, the aggregate can disintegrate quite accidentally at any instant after its formation.

It is significant that the solution (431) and (432) can be accomplished on condition that the total energy of the system is zero. Clearly the initial data of the Cauchy problem describing the collision of two charged particles which merge into a single aggregate after their meeting constitute a measure zero set. In contrast, the spontaneous disintegration of the aggregate is a physically tangible event. The global dynamics of the electromagnetic realm in ${\mathbb{R}}_{1,1}$ is thus effectively irreversible: the formation of the discussed aggregates is highly improbable, whereas their disintegration may well take place. The usual outcome of electromagnetic self-interaction in the classical picture, the violation of invariance under time reversal, has again been emerged.

This analysis gives us an inkling of the self-interaction mechanism peculiar to classical strings. A similarity of the above two-particle system living in ${\mathbb{R}}_{1,1}$ to strings lies in the fact that the potential energy of a string varies linearly with distance between two somehow labelled points of this string. A closed charged string may then be thought of as a chain whose elements are point aggregates of merged charged particles. We assume that this assemblage is feasible, but drop out of sight a particular way for its making. This enables us to avoid the foregoing conundrum: to ensure that two colliding particles amalgamate in a single aggregate, their total energy must be exactly zero, and so the initial data of the corresponding Cauchy problem constitute a measure zero set. To settle this question, we switch from a particular element of the chain to a continual set of identical aggregates constituting a closed string, and assume that the availability of this set is afforded by its cardinality. We thus reason in opposite way by saying that if it is granted that a free closed charged string is capable of spontaneous splitting into two closed charged strings, then extending analytically the history of this disintegration back in time, according to Eqs. (431) and (432), we would restore the prehistory, and this would reinforce the statement that the rest energy of the aggregate whose splitting is responsible for the occurrence of a new string is indeed zero.

Therefore, classical self-interaction of a closed charged string manifests itself in its capability of spontaneous creating other closed charged strings.

On the other hand, the mechanism underlying the history shown in (431) and (432) is unsound for joining of classical strings. To see this, imagine two closed charged strings coming in contact at some point $x^{\mu}$ . Consider the colliding points of the strings just before their meeting at $x^{\mu}$ in the barycentric frame. The condition that zero total energy is necessary for these points to merge appears extremely exotic. To aggravate matters, the collision happens in ${\mathbb{R}}_{1,3}$ to which the exact solution (431) and (432) is unrelated.

The vast disparity between joining and splitting of strings in the classical picture can be regarded as an effective violation of the property of time reversal invariance: the emission of closed charged strings is a unidirectional process.

6 SELF-INTERACTION IN GENERAL RELATIVITY

Self-interaction of gravitating systems is much different from that of other classical gauge field systems. Restricting our discussion to General Relativity (GR), we note that a closed gravitating system has infinitely large number of topological phases which unceasingly change, as exemplified by the occurrence of more and more black holes during the history of the Universe. An important point is that these transmutations are irreversible. Once converted into a black hole, a massive star can never regain its previous state. It seems likely that the irreversibility is the only common property which is shared by gravitating and other gauge field systems.

The other manifestation of self-interaction, the rearrangement, is missing from GR. Indeed, the effect of the rearrangement can be summarized in the local energy-momentum balance, Eq. (130). Of particular interest is the case $f^{\mu}=0$ , in which the rate of change of the energy-momentum of a dressed particle equals the negative of the emission rate, Eq. (147). This is tantamount to the statement that the action–reaction principle holds for the given system. But the action–reaction principle is just the one incompatible with the equivalence principle underlying GR. To see this, one should observe that a particle of mass $m$ is governed by the geodesic equation

[TABLE]

which is independent of $m$ , while the field equation with the delta-function source

[TABLE]

shows that the greater is $m$ , the stronger is the generated gravitational field. Although the gravitational field exerts on every particle in a uniform way, no matter what is $m$ , the influence of particles on the state of the gravitational field is different for different $m$ . This is contrary to the action–reaction principle [43].

The failure of the action–reaction principle can be clarified using the old reasoning by Planck [119] who regarded this principle as the rationale of momentum conservation. By Noether’s first theorem [114], the energy-momentum is conserved due to invariance of the action under spacetime translations. But the requirement of translational invariance does not apply to curved manifolds, and so gravitating systems are generally devoid of conserved momentum and energy. Furthermore, the very construction of momentum and energy inspired by Noether’s first theorem is no longer defined.

To avoid this conclusion, one normally turns to field-theoretic treatments of gravity, which go back to Rosen [132]. Such treatments are feasible if the gravitational field can be granted to be ‘‘sufficiently weak’’,

[TABLE]

Here, a second-rank tensor field $\phi_{\mu\nu}$ is assumed to be defined in a flat background ${\mathbb{R}}_{1,3}$ with the metric tensor $\eta_{\mu\nu}$ , and has small components in every Lorentz frame, $|\phi_{\mu\nu}|\ll 1$ . The symmetry properties of ${\mathbb{R}}_{1,3}$ afford energy-momentum conservation through the standard Noether’s argument. The field-theoretic treatment is valid until the mapping of $g_{\mu\nu}$ into $\phi_{\mu\nu}$ shown in (435) is bijective and smooth, that is, every curved spacetime configuration, associated with a gravitational effect, can be smoothly covered with a single coordinate patch. Bimetric theories of gravitation have been the objective of much recent research (for a review see [138]).

GR leaves room for both weak and strong gravity. Strong gravitational effects are associated with great warpings of spacetime, that is, with great values of components of $R^{\lambda}_{\leavevmode\nobreak\ \mu\nu\rho}$ . However, a characteristic curvature whereby such warpings might be rated as ‘‘drastic’’ is absent from GR. It seems reasonable to take a criterion for discriminating between strong and weak gravity as that related to whether it is possible or impossible to alter the topology of spacetime [43]. According to this criterion, the gravitational field is weak if the topology of spacetime is identical to that of ${\mathbb{R}}_{1,3}$ , otherwise it is strong. The weakness of the gravitational field is thus a qualitative rather than quantitative concept.

The field equations (434) tell nothing about the topological properties of their solutions because differential equations are local in character. A global solution can be formed by gathering its infinitesimal pseudoeuclidean fragments. The topology of the resulting solution may differ from that of ${\mathbb{R}}_{1,3}$ if the assembly is subject to a restrictive boundary condition. To illustrate, we refer to the Schwarzschild metric [142],

[TABLE]

where $r_{\rm S}={2G_{\rm N}M}$ is the Schwarzschild radius, and $d{\Omega}$ the round metric in a sphere $S^{2}$ . A three-dimensional spacelike surface $\Sigma_{3}$ endowed with this metric has a twofold geometric interpretation. First, it looks like a ‘‘bridge’’ between two otherwise Euclidean spaces, and, second, it may be regarded as the ‘‘throat of a wormhole’’ connecting two distant regions in one Euclidean space in the limit when this separation of the wormhole mouths is very large compared to the circumference of the throat [63].

The price for the multiplicity of topologically distinct phases in GR is the absence of the Killing vector fields responsible for energy and momentum conservation. Nevertheless, some models of the Universe can be equipped with the total energy-momentum $P^{\mu}$ . An ‘‘island of matter surrounded by emptiness’’ is a good case in point [112]. Whatever the topological setup of the island, spacetime is asymptotically flat, which provides a means for the Hamiltonian formulation of this system [7], [8], [9]. More specifically, one supposes that $g_{\mu\nu}$ approaches $\eta_{\mu\nu}$ at spatial infinity sufficiently rapidly, namely

[TABLE]

The second condition shown in Eq. (437) is needed for the Lagrangian function

[TABLE]

with the Lagrangian density

[TABLE]

should be convergent. Note that the volume integral in (438) diverges for the Schwarzschild solution expressed in terms of the original Schwarzschild coordinates, appearing in (436), because ${\cal L}=O(1)$ as $r\to\infty$ . But the use of isotropic coordinates (see, e. g., [103]) which convert the Schwarzschild metric (436) to

[TABLE]

gives ${\cal L}=O(1/\varrho^{4})$ . This ensures the convergence of the Lagrangian (438). This example shows that both the asymptotic flatness

[TABLE]

and a good choice of coordinates share in the responsibility for the convergence of additive quantities such as the Lagrangian and Hamiltonian.

Being armed with the Hamiltonian formulation of Arnowitt–Deser–Misner [7], [8], [9], one would expect that the total energy-momentum $P^{\mu}$ is unambiguously defined. Let us verify if this expectation is correct restricting ourselves to $E=P^{0}$ for simplicity.

The total energy

[TABLE]

with ${\cal H}$ being a cumbersome construction immaterial for the present discussion (see, e. g., [58]) can be transformed [127] to a simple expression,

[TABLE]

Here, the integral is taken over a 2-dimensional surface at spatial infinity.

Schoen and Yau [139] and Witten [155] were able to demonstrate that an isolated gravitating system having non-negative local mass density has non-negative total energy $E$ . Applying the surface integral (443) to the Schwarzschild configuration generated by a point particle of mass $m$ they get the conclusion that

[TABLE]

Meanwhile we should take proper account of the freedom to foliate spacetime into spacelike sections consistent with asymptotic symmetries of the manifold. Is it possible to relax the asymptotic condition (437) in such a way as to preserve the asymptotic flatness (441) and the convergence of pertinent additive quantities in this Hamiltonian formulation? To be more precise, let us proceed from the metric $g_{\mu\nu}$ with the asymptotic behavior (437), and map the initial grid of spatial coordinates, $\{x_{i}\}$ , into a new one, $\{{\tilde{x}}_{i}\}$ ,

[TABLE]

where $f$ is an arbitrary regular function subject to the following conditions:

[TABLE]

and

[TABLE]

Condition (447) is necessary and sufficient for the mapping (445) to be invertible because

[TABLE]

To illustrate, we refer to a mapping proposed in [47],

[TABLE]

where $\alpha$ and $\epsilon$ are arbitrary nonzero numbers, and $l$ is an arbitrary parameter of dimension of length. This is a bijective monotonic regular mapping $r\to{\tilde{r}}$ which becomes $1$ as $\epsilon\to 0$ . The leading asymptotic terms of spatial components of the metric and those of the Christoffel symbols are

[TABLE]

It follows that the Lagrangian density behaves as

[TABLE]

which provides the convergence of the volume integral (438) and other additive quantities of this kind. Note also that the asymptotic flatness, Eq. (441), is still the case.

The mapping (445) with $f$ defined in (449) is instructive to apply to the Schwarzschild metric written in terms of isotropic coordinates (440). Denisov and Solov’ev [47] showed that the total energy of the Schwarzschild configuration generated by a point particle of mass $m$ takes any positive values, greater than, or equal to $m$ , when $\alpha^{2}$ runs through ${\mathbb{R}}_{+}$ ,

[TABLE]

We thus see that the total energy of gravitational systems with nontrivial topological contents depends on a particular foliation of spacetime. The same is true for the total momentum of such gravitational systems.

The occurrence of ill-defined additive quantities in GR closely parallels that in the Banach and Tarski theorem [14]. The theorem states that a ball in ${\mathbb{R}}_{3}$ can be split into a finite disjoint subsets which can then be put back together through continuous movements of the pieces, without changing their shape and without running into one another, to yield a ball twice as large as the original, or, in more abstract terms, arbitrary bounded sets with nonempty interiors in ${\mathbb{R}}_{3}$ are equidecomposable. Both the Banach–Tarski decomposition-reassembly and the formation of a black hole derive from a topological reorganization by which the three-dimensional measures of the geometrical layouts become poorly defined. The measure appearing in the Banach–Tarski theorem is the ordinary volume of the balls (more precisely, Lebesgue measure), while the measure in the gravitational energy-momentum problem is the measure of integral quantities like that given by Eq. (442). When turning to the surface integral for calculation of the total energy, Eq. (443), there arises the situation which may be likened to that concerning paradoxical duplicating or enlarging spheres discovered by Hausdorff [74].

The usual objection against addressing our concern with the Banach–Tarski paradox in the physical context is that material bodies are made of atoms; the partitioning procedure of a mathematically continuous ball is unrelated to their actual disintegration. Therefore, it is impossible to cut up a pea into finitely many pieces and then reassemble them to form a Sun-sized ball. However, this objection overlooks one important instance, black holes. Each isolated stationary black hole is completely specified by three parameters: its mass $m$ , angular momentum $J$ , and electric charge $e$ . Whatever the content of a system which collapses under its own gravitational field, the exterior of the resulting black hole is described by a Kerr–Newman solution. All individual geometric features of the collapsing system disappear in the black hole state [76]. Furthermore, the event horizon which is meant for personifying the black hole is stripped of the grain structure that was inherent in its precursor, the collapsing system, and the black hole appears as a perfect object.

It is therefore interesting to enquire into why the values of the functionals (442) and (443) are foliation-dependent for black holes in the light of the analyses which are lumped together as the ‘‘Banach–Tarski theorem’’ (the subject has been detailed in [152]).

A central idea in obtaining a paradoxical decomposition of a set is to get such a decomposition in an isometry group acting on the set, and then transfer it to the set. If a bounded set can be decomposed in a paradoxical way with respect to a group $G$ , then $G$ contains free subgroups. In particular, a ball in ${\mathbb{R}}_{3}$ is SO $(3)$ -paradoxical because SO $(3)$ acts as a free non-Abelian isometry group. On the other hand, Banach’s theorem [13] states that no paradoxical decompositions exist in ${\mathbb{R}}$ and ${\mathbb{R}}_{2}$ . The class of groups whose actions preserve isometry-invariant, finitely additive measures of the bounded sets, the so-called amenable groups, are found to be fairly extensive, containing all solvable groups. A subclass of this class of particular interest is comprised of Abelian groups.

The interplay between measure theory and group theory discovered in the framework of the Banach–Tarski theorem mirrors that in GR in the following way [43]. The failure of the action-reaction principle suggests that the dynamics encoded by Eqs. (433) and (434) is unstable. This instability is a prerequisite to the formation of topologically nontrivial manifolds, in particular those associated with black holes. Much of the current interest in physics of black holes refers to the model of island Universe satisfying the requirement of asymptotic flatness, Eq. (441). However, the asymptotic flatness leaves room for a wide range of foliations of a curved spacetime manifold compatible with the condition that all additive observables be convergent. The grids of coordinates realizing these foliations can be interconverted by diffeomorphisms which respect asymptotic symmetry of the manifold. The effect of rearranging gravitational degrees of freedom is entrusted to the properties of the group of asymptotic symmetry. It is generally believed that the Bondi–Metzner–Sachs group [35], [135], [136] is just this group. What counts is that the Bondi–Metzner–Sachs group is a non-Abelian isometry group. This is the reason why the energy–momentum of gravitational systems with nontrivial topological contents is not uniquely defined, in particular the total energy of the Schwarzschild configuration is controlled by the foliation parameter $\alpha$ , Eq. (452).

ACKNOWLEDGMENTS

The first impetus to summarize the state of the art in the self-interaction problem of classical gauge theories came to me from Asim Barut in 1994. The idea of writing a review of this kind came up again in my discussions with Rudolf Haag in 1998. He advised me to extend the project to cover all attendant issues so that the text would meet the needs of senior students. Furthermore, he wrote a letter of support to the International Science and Technology Center with recommendations to allot funds for writing my book [99]. The Springer Verlag published the book in 2007, which was a partial implementation of the initial plan. Both Barut and Haag found the notion of rearrangement particularly promising. I would like to dedicate the present paper to the memory of these remarkable personalities and outstanding theorists.

Bibliography159

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1]
2[2] Abraham, M., 1903, ‘‘Prinzipien der Dynamik des Elektrons,’’ Ann. Phys. (Leipzig), 10 , 105-179.
3[3] Abraham, M., 1904, ‘‘Zur Theorie der Strahlung und des Strahlungsdruckens,’’ Ann. Phys. (Leipzig), 14 , 236-287.
4[4] Abraham, M., 1905, Theorie der Elektrizität . Vol. II. (Teubner, Leipzig).
5[5] Actor, A., 1979, ‘‘Classical solutions of SU ( 2 ) 2 (2) Yang–Mills theories,’’ Rev. Mod. Phys. 51 , 461-525.
6[6] Anderson, M. R., 2003, The Mathematical Theory of Cosmic Strings (Institute of Physics, Bristol).
7[7] Arnowitt, R., S. Deser, and C. W. Misner, 1960 a, ‘‘Canonical variables for general relativity,’’ Phys. Rev. 117 , 1595-1602.
8[8] Arnowitt, R., S. Deser, and C. W. Misner, 1960 b, ‘‘Energy and the criteria for radiation in general relativity,’’ Phys. Rev. 118 , 1100-1104.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Contents

Abstract

1 INTRODUCTION

1.1 Elusive renditions of self-interaction

1.2 Manifestations of self-interaction

1.3 Plan of the review

2 TOPOLOGICAL PHASES

3 SELF-INTERACTION IN ELECTRODYNAMICS

3.1 The Maxwell–Lorentz theory

3.1.1 Radiation

3.1.2 Local balance of energy-momentum

3.1.3 The Abraham–Lorentz–Dirac equation

3.1.4 Another way of looking at the dressed dynamics

3.1.5 Paradoxes and misconceptions

3.2 Electrodynamics in various dimensions

3.2.1 R1,2n−1{\mathbb{R}}_{1,2n-1}R1,2n−1​

3.2.2 R1,1{\mathbb{R}}_{1,1}R1,1​

3.2.3 R1,5{\mathbb{R}}_{1,5}R1,5​

3.3 Massless charged particles

3.4 Action at a distance

3.5 Nonlinear electrodynamics

3.6 Nonlocal interactions

3.7 Particles with spin

4 SELF-INTERACTING GAUGE FIELD SYSTEMS

4.1 Self-interaction in the Yang–Mills–Higgs theory

4.2 Self-interaction in the Yang–Mills–Wong theory

5 CLASSICAL SELF-INTERACTING STRINGS

6 SELF-INTERACTION IN GENERAL RELATIVITY

ACKNOWLEDGMENTS

3.2.1 ${\mathbb{R}}_{1,2n-1}$

3.2.2 ${\mathbb{R}}_{1,1}$

3.2.3 ${\mathbb{R}}_{1,5}$