Special Relativity from Soft Gravitons
Mark P. Hertzberg, McCullen Sandora

TL;DR
This paper demonstrates that local, non-Lorentz invariant theories with massless spin 2 particles inherently require special relativity's energy-momentum relation, suggesting relativity's structure can emerge from spin 2 interactions.
Contribution
It proves that locality and soft gauge invariance in theories with massless spin 2 particles enforce the exact special relativistic energy-momentum relation, even without initial Lorentz invariance.
Findings
Universal graviton couplings are required.
The energy-momentum relation must be exact.
Relativity structure can emerge from spin 2 interactions.
Abstract
We study all translationally and rotationally invariant local theories involving massless spin 2 and spin 1 particles that mediate long range forces, allowing for general energy relations and violation of boost invariance. Although gauge invariance is not a priori required to describe non Lorentz invariant theories, we first establish that locality requires `soft gauge invariance'. Then by taking the soft graviton limit in scattering amplitudes, we prove that in addition to the usual requirement of universal graviton couplings, the special relativistic energy-momentum relation is also required and must be exact. We contrast this to the case of theories with only spin particles, where, although we can still derive charge conservation from locality, special relativity can be easily violated. We provide indications that the entire structure of relativity can be built up from spin 2…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Special Relativity from Soft Gravitons
Mark P. Hertzberg, McCullen Sandora
Institute of Cosmology, Department of Physics and Astronomy,
Tufts University, Medford, MA 02155, USA
Abstract
We study all translationally and rotationally invariant local theories involving massless spin 2 and spin 1 particles that mediate long range forces, allowing for general energy relations and violation of boost invariance. Although gauge invariance is not a priori required to describe non Lorentz invariant theories, we first establish that locality requires ‘soft gauge invariance’. Then by taking the soft graviton limit in scattering amplitudes, we prove that in addition to the usual requirement of universal graviton couplings, the special relativistic energy-momentum relation is also required and must be exact. We contrast this to the case of theories with only spin particles, where, although we can still derive charge conservation from locality, special relativity can be easily violated. We provide indications that the entire structure of relativity can be built up from spin 2 in this fashion.
Introduction:— Special relativity is in spectacular agreement with all current observations. It provides a beautiful unification of space and time. Combined with quantum mechanics, it explains much of what we observe in the world. When applied to massless spin 1 particles, it describes the structure of the electroweak and strong forces, and when applied to massless spin 2 particles, it describes the structure of gravitation.
However, one may enquire what is the origin of special relativity, or more specifically, the Lorentz symmetry; is it a structure that can be deformed easily at low energies; is it an accidental symmetry that must be exact at low energies but might be violated at high energies? In the Standard Model of particle physics, Colladay and Kostelecky Colladay:1998fq and Coleman and Glashow Coleman:1998ti found that one can easily deform the Lorentz symmetry. In fact 46 new CPT even couplings at the dimension 4 level are allowed, without affecting the unitarity of the theory, the degree of freedom counting, or leading to any known pathology. Of course such deformations are highly constrained by experiment Mattingly:2005re , but it is interesting that it can be done so easily in theories with only spin particles.
In this Letter we would like to point out the tremendous theoretical difficulty of violating the Lorentz symmetry when massless spin 2 is included. In particular, we will allow our matter species and massless spin 2 particle to carry any dispersion relation, arbitrary violations of boost invariance, and we will prove using Weinberg’s method Weinberg:1964ew of demanding consistent soft graviton scattering that the special relativistic dispersion relation is required. We provide indications that the full structure of relativity can be built up too. Our only assumptions will be that we have translational and rotational invariance in some frame and that interactions avoid instantaneous action at a distance.
Spin:– Compatibility with quantum mechanics and rotation invariance demands that particles transform under a unitary representation of the rotation group . These representations are organized by spin in the usual fashion . Furthermore, there are two distinct classes of representations for particles with spin : (i) One class of representations, usually called “massive”, fill out the full set of spins along, say, the -axis as , i.e., all components of angular momentum. (ii) Another class of representations, usually called “massless”, only fill out the spins parallel to the direction of motion, known as helicity with . We note that the presence of these classes do not rely upon the presence of the Lorentz symmetry, but only the rotation symmetry. Without boost invariance, the “massive” representations do transform in a complicated fashion, while the “massless” representations transform in a relatively simple fashion. Note that neither representation is a priori gapped or gapless.
The quantum state of a massless spin 2 particle is specified by its momenta and helicity as . In order to describe how it transforms under rotations, one needs to introduce a symmetric polarization matrix . In order to project down to only 2 helicities, one can demand that the polarization matrix is transverse-traceless
[TABLE]
Note that these constraints are manifestly rotationally invariant, so we have cut down to the correct number of degrees of freedom, while maintaining our assumed space-time symmetry. The situation is analogous for massless spin 1, which can be described in a manifestly rotationally invariant way by a polarization vector that is transverse . Hence, unlike the Lorentz invariant case, there is no a priori reason to introduce gauge invariance into these descriptions.
Long Range Interactions:— We would like to build a theory that contains long range interactions. However, if we attempt to do so using the above spin 1 or spin 2 particles we encounter a problem. To see this, consider the propagator for these particles (we assume parity here for simplicity, but our results do not rely upon this)
[TABLE]
where the index structure enforces these to be transverse (and traceless for spin 2) and are dispersion relations for the spin 1 and spin 2 particles, respectively.
Now consider scattering between some matter particles via the exchange of a single spin 1 or spin 2 particle. Since the propagator carries indices, we are required to contract with some vector current or matrix for spin 1 and spin 2, respectively. This gives the following contribution to the action for the matter degrees of freedom from tree-level exchange, which we show now for the spin 1 case, and extend to spin 2 in the Appendix:
[TABLE]
where we have also allowed for the exchange of a non-dynamical scalar that mediates some type of Coulomb interaction between charge density . This second term is included for two reasons: (i) it is compatible with rotation invariance and does not introduce any additional degrees of freedom, and (ii) the first term evidently cannot by itself mediate a Coulomb-like interaction since it involves the currents associated with moving particles, and hence the first term does not give rise to a force between static charges, while the second term can.
For any current that does not trivially vanish in the soft limit, the first term here is invariably non-local due to the structure, whose Fourier transform is long ranged. The Coulomb interaction being associated with a non-dynamical field is evidently also non-local. Both lead to instantaneous action at a distance.
Local Interactions:— In this work, we will impose the most basic version of locality: we demand our theories do not have instantaneous action at a distance. In order for this to be possible, we require that the non-locality in the above action cancels out.
To do so, there must be a constitutive relationship shared between and ; this relationship must be linear since both terms are of the same order. One can check that the most general form allowed is
[TABLE]
where is some function that mixes the two fields together. This equation reduces to the familiar charge conservation equation if , but is different otherwise.
If we take the classical particle limit for the charge density , and vary the action with respect to , we obtain the force applied to a test charge at position . Let us assume that for all times the charge and current densities vanish, but are suddenly non-zero at time . In order to avoid instantaneous action at a distance, every derivative of () must vanish when the test charge is separated from the rest of the charges. In order for this to occur, every coefficient of in (4) must be a local function if we expand in inverse powers of .
By using the constitutive relationship to relate the and terms, and the fact that only polynomials of have local Fourier transforms (derivatives of delta functions), we find the following conditions must be satisfied by the functions :
[TABLE]
where are polynomials in , is a constant, and is a mass that must vanish unless . If we compute the Coulomb interaction between a pair of charges by Fourier transforming we obtain
[TABLE]
Hence in order to have a long range interaction we require . We assume this going forward and can set . We then find the following properties: (i) there is an exact Coulomb potential at large distances (plus negligible contact interactions), (ii) the photon is necessarily gapless, and (iii) we can integrate up the constitutive relationship (5) over all space and find global (though usually not local) charge conservation.
Soft Gauge Invariance:— The above exchange can be obtained from a complete theory that allows for external photons. By introducing creation and annihilation operators to describe multi-particle states and Fourier transforming to position space , we can construct the following Lagrangian density for spin 1 fields
[TABLE]
where and . One can check that this gives precisely the exchange action in (4). Note that we have included a Lagrange multiplier to project out the longitudinal mode of .
Now locality restricts the form of to that given above in Eqs. (6-8). If we focus on only the terms with the lowest number of spatial derivatives, we see that the kinetic term for the spin 1 field organizes into and the theory becomes endowed with a gauge invariance
[TABLE]
(combined with familiar gauge transformations for the matter sector) with for slowly varying , which we refer to as ‘soft gauge invariance’.
A similar result holds for spin 2 particles, where we can again Fourier transform . In this case we need to firstly introduce a non-dynamical scalar to mediate a long ranged Newtonian force, and secondly we need a non-dynamical vector . Together they allow the theory to avoid instantaneous action at a distance when introduced appropriately, with similar properties to the above: (i) an exact Newtonian potential at large distances and (ii) a gapless graviton with . Moreover, all are organized into a symmetric matrix , again endowed with a soft gauge invariance (see the Appendix)
[TABLE]
Intuitively, the presence of the soft gauge invariance is required to ensure that the non-dynamical fields are mixed with the propagating degrees of freedom such that long range forces inherit the finite speed of propagation of the spin 2 or spin 1 particles.
Spin 1 Interacting Theories:— For spin 1, we can construct various Lorentz violating, but local Lagrangians. If we truncate to just two derivatives, then in fact we obtain exact gauge invariance. As an example, consider the following Lagrangian for massless spin 1 coupled to multiple species of fermions
[TABLE]
where is the usual covariant derivative and the matrices encode arbitrary limiting speeds of propagation as follows:
[TABLE]
which violate Lorentz (but not gauge) invariance.
We note that this Lorentz violating construction carries over immediately to Yang-Mills fields by simply dressing up the spin 1 gauge fields with color indices and introducing a Lie algebra structure to encode self interactions in the usual way. Furthermore, this can be used to introduce various forms of Lorentz violation in the Standard Model at the dimension 4 level Colladay:1998fq ; Coleman:1998ti .
Spin 2 Interacting Theories:— For spin 2, it is significantly more complicated to construct consistent theories. One possibility is to simply couple the linearized Riemann tensor directly to matter Wald:1986bj ; Hertzberg:2016djj . These models typically have problems with causality Hertzberg:2016djj ; Hertzberg:2017abn , but in any case do not mediate long range forces, and will not be further explored here. Instead we would like to explore leading order interactions. We know that one can introduce a consistent theory if one assumes the Lorentz symmetry, as this leads to general relativity. But, if we do not assume the Lorentz symmetry, then there are many challenges and open questions: (i) Does the propagation speed of the matter sector need to agree with the propagation speed of the graviton? (ii) What restrictions are placed on a general dispersion relation for the matter particles? (iii) What changes occur if we include various types of matter species; fermions and gauge/vector bosons? (iv) What constraints apply to the graviton’s dispersion relation? (v) What possible structures could we utilize to build a consistent theory, including conserved currents? (vi) Is the equivalence principle still required for consistency?
In order to address these questions in a systematic fashion, we now utilize Weinberg’s technique of studying a generic scattering process involving the emission of a soft graviton Weinberg:1964ew .
Soft Graviton Emission:— Consider a general scattering processes involving external particles including a graviton. The amplitude is known to simplify in the soft graviton limit; namely it factorizes into the amplitude for particles times a piece describing the emission of the graviton from an external leg, as depicted in Figure 1, as follows:
[TABLE]
where is the polarization matrix of the outgoing soft graviton, is some matrix associated with the th matter particle that the graviton is coupled to, the subscript refers to initial particles, and the subscript refers to final particles. Note that we have taken each interaction vertex to involve the same virtual and external particles. If this were not the case and the two particles had different dispersion relations, then the virtual particle would be highly off-shell and would not contribute appreciably to the above sum in the soft limit.
Constraint from Locality and Unitarity:— Under a soft gauge transformation the polarization matrix of the external graviton transforms as
[TABLE]
where is the graviton’s 4-momenta. In order for the scattering amplitude to be associated with a local and unitary theory, it must be left unchanged. This ensures that the longitudinal modes are consistently removed and not being sourced by .
Furthermore, in the soft limit the respective denominators of (15) can be simplified by Taylor expanding to in the graviton’s momentum. The need for soft gauge invariance leads to the constraint
[TABLE]
where is a type of “momentum” with an upstairs index; more precisely it is
[TABLE]
Now Eq. (17) must be valid for any graviton momentum, regardless of its direction and for any set of momenta for the particles. The only way this is possible is for the graviton momenta to cancel out of numerator and denominator. It is easy to see that this is only possible if the matrix that the graviton momenta is contracted with must be proportional to the vector in each of its indices. The most general form of is therefore
[TABLE]
where we have allowed for a prefactor that can depend on any property of the th particle that is a scalar under rotations, including energy. Inserting this into (17) gives the conservation laws
[TABLE]
Dispersion Relation:— The conservation law is . Since the only conserved scalars are the energy itself and charges, the function must take the form
[TABLE]
where is a universal coupling and is some charge associated with the th particle. The conservation law is
[TABLE]
Now the only conserved 3-vectors are the 3-momentum and angular momentum of the particles. But since must be parallel to the 3-momentum, then this is the only viable option. Hence we have
[TABLE]
where is some universal coefficient. Using (21) and the fact that on-shell , we can integrate up this equation to give
[TABLE]
where is some constant of integration.
Galilean Symmetry?:— To explore this dispersion relation, let us begin by considering the special case in which . By dividing throughout by the charge we obtain
[TABLE]
where and . Hence we discover the dispersion relation of Newtonian mechanics, and since the coupling , we see a connection to Newtonian gravity. Furthermore we see that the mass of particles is required to be conserved as it is the re-scaled conserved charge . So in fact (25) is the most general structure compatible with Galilean symmetry.
Lorentz Symmetry:— The above option that leads to Galilean symmetry among the matter sector is only a viable option if one restricts attention to processes in which all gravitons are taken to zero momentum. However, we are also allowed to consider hard gravitons as part of the particles participating in the scattering process, in addition to the one soft graviton. Since gravitons exchange momentum with the rest of the particles, they are required to be part of the momentum conservation law, and hence the dispersion relation (24) must apply to gravitons too. Since the gravitons have at low momenta, then self consistency demands that
[TABLE]
and for the graviton. We then discover that not only does the graviton’s dispersion relation start linear for small momenta, it is required to stay exactly linear for all momenta.
Furthermore, for any particles that carry non-zero charge , we can always absorb the charge into the definition of the energy by the replacement
[TABLE]
since this maps a conserved energy into another conserved energy. Then without loss of generality the dispersion relation for all particles can be put into the form
[TABLE]
So we find the graviton’s speed sets a universal speed limit, and we can replace . Hence we discover the complete and most general energy-momentum relation allowed is that of the familiar special relativistic form.
Then by noting that energy is the generator of time translations and momentum is the generator of spatial translations, we readily obtain that is invariant. This provides the hyperbolic structure of Minkowski space, boost invariance, and the Lorentz transformations. This goes a long way toward constructing special relativity from the ground up.
Soft Photon Emission:— One may compare the above analysis to theories only involving particles with spin . It is well known that with a massless spin 1 particle, which we shall refer to as a “photon”, requiring consistent soft scattering is still highly constraining Weinberg:1964ew .
If we consider a soft outgoing massless spin 1 particle with matter obeying an arbitrary dispersion relation, then the matrix element is still given by (15) with the replacement . Here is the polarization vector of the outgoing soft photon and is some vector associated with the th matter particle. The locality and unitarity constraint for the soft photon
[TABLE]
then leads to a version of (17) with in the numerator, namely
[TABLE]
In order for this to be satisfied for any photon and matter momenta, we need to cancel out, which requires
[TABLE]
where again we have allowed for a prefactor that is some scalar under rotations. Inserting this into (30) gives the single conservation law . Again using the fact that the only conserved scalars are charges and energy, we have the general solution
[TABLE]
where is a charge and is some universal mass scale.
In the language of Lagrangians, these terms are associated with the interaction terms
[TABLE]
where the first term is the familiar coupling of the photon to a current and the second (Lorentz violating) term couples the photon to the time-components of the energy-momentum tensor. We do not know if the second term possesses a non-linear completion, but we shall not focus on this term here. What is most important is that the dispersion relation has completely dropped out of this analysis, and hence one can couple (at least using the term) a massless photon to any Lorentz violating (an example was seen earlier in Eqs. (13-14)). On the other hand, this is impossible when spin 2 is included.
Discussion:— An interesting suggestion made in Khoury:2013oqa is that the Lorentz symmetry might emerge as an accidental symmetry associated with massless spin 2 particles. In that work, only the graviton with only a 2 derivative action was studied. In this work, we have shown that even with an arbitrary number of matter species and higher derivatives, the relativistic dispersion relation must be exact in the IR and UV, rather than merely accidental.
It remains to be proven if all higher order interaction terms with matter must necessarily obey the Lorentz symmetry and to construct the entire Einstein-Hilbert action with matter species, but we believe it is impressive that the leading order interactions and the free sector are forced to carry the Lorentz symmetry exactly.
Moreover, there are plausible reasons why typical interactions may be Lorentz invariant: (i) Suppose we consider a bosonic field that acquires some condensate configuration, plus small fluctuations . Then consider configurations where the length and time scale over which the condensate changes is slow compared to the soft graviton. Then from our work, the fluctuations should be found to be described by a Lorentz invariant theory, as they look like free fields at the quadratic order. This usually only arises from a fully Lorentz invariant interaction term to begin with. (ii) Non-Lorentz invariant interaction terms may renormalize lower dimension operators and be inconsistent with the relativistic dispersion relation derived here. (iii) It is possible that any Lorentz violating process is merely spontaneous breakdown.
Finally, our result that any local and unitary effective theory (in a universe like ours that includes gravity) must obey the special relativistic dispersion relation exactly, brings into serious question the viability of various proposed modifications of special relativity. Some examples include (i) the Lorentz violating construction of the Standard Model Colladay:1998fq ; Coleman:1998ti , (ii) Lorentz violating constructions of quantum gravity, such as Horava-Lifschitz Horava:2009uw , (iii) the construction of various deformed versions of relativity AmelinoCamelia:2000mn ; Magueijo:2001cr , and (iv) some alternatives to inflation that appeal to an arbitrarily widened light cone at high energies Moffat:1992ud .
Acknowledgments:— We would like to thank Jose Blanco-Pilado, John Donoghue, Gary Goldstein, Ali Masoumi, Mohammad Namjoo, and Masaki Yamada for useful discussions, and the Tufts Institute of Cosmology.
Email Correspondence:— [email protected], [email protected]
Appendix:– Here we extend the locality analysis to the spin 2 case. The most general interaction from tree-level graviton exchange is
[TABLE]
where . For non-localities to cancel, we need the constitutive relations: and . Imposing locality we find that these functions must be related to polynomials as:
[TABLE]
So a long range force requires . Hence the graviton must be massless, and we can set , and we need . By Fourier transforming to the local field representation , we can construct the following Lagrangian:
[TABLE]
For ease of notation, we have defined the dot product between two matrices as . The functions here are related to the above as , , , and . Demanding that the spin 2 exchange arises from an action places further consistency conditions on the functions , , and . Note that in our convention, the gravitational couplings are included in the sources .
We then find that to leading order in a derivative expansion, the first, second, sixth and seventh terms in (42) assemble into , which is invariant under the gauge transformation , . Likewise, the seventh and eighth terms in (42) are invariant under the gauge transformation , (up to a total derivative). A matrix can then be assembled as , , and we obtain soft gauge invariance, as reported in Eq. (12).
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1(1) D. Colladay and V. A. Kostelecky, Phys. Rev. D 58 , 116002 (1998) [hep-ph/9809521].
- 2(2) S. R. Coleman and S. L. Glashow, Phys. Rev. D 59 , 116008 (1999) [hep-ph/9812418].
- 3(3) D. Mattingly, Living Rev. Rel. 8 , 5 (2005) [gr-qc/0502097].
- 4(4) S. Weinberg, Phys. Rev. 135 , B 1049 (1964).
- 5(5) R. M. Wald, Phys. Rev. D 33 , 3613 (1986).
- 6(6) M. P. Hertzberg, ar Xiv:1610.03065 [hep-th].
- 7(7) M. P. Hertzberg and M. Sandora, ar Xiv:1702.07720 [hep-th].
- 8(8) J. Khoury, G. E. J. Miller and A. J. Tolley, Class. Quant. Grav. 31 , 135011 (2014) [ar Xiv:1305.0822 [hep-th]].
