Gravitational rotation of polarization: Clarifying the gauge dependence and prediction for double pulsar
Ue-Li Pen, Xin Wang, and I-Sheng Yang

TL;DR
This paper clarifies the gauge dependence of gravitational polarization rotation, emphasizing the importance of observer-lens relative motion, and predicts a measurable effect in the double pulsar system, significantly larger than previous estimates.
Contribution
It resolves confusion in the literature regarding polarization rotation and provides an updated, larger prediction for the effect in the double pulsar system based on fundamental general relativity principles.
Findings
Rotation angle estimated at ~10^{-7} rad for the double pulsar
The effect depends explicitly on observer-lens relative motion
Previous predictions underestimated the effect by ten orders of magnitude
Abstract
From the basic concepts of general relativity, we investigate the rotation of the polarization angle by a moving gravitational lens. Particularly, we clarify the existing confusion in the literature by showing and explaining why such rotation must explicitly depend on the relative motion between the observer and the lens. We update the prediction of such effect on the double pulsar PSR J0737-3039 and estimate a rotation angle of . Despite its tiny signal, this is orders of magnitude larger than the previous prediction by Ruggiero and Tartaglia, which apparently was misguided by the confusion in the literature.
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Gravitational rotation of polarization:
Clarifying the gauge dependence and prediction for double pulsar
Ue-Li Pen
Canadian Institute of Theoretical Astrophysics, 60 St George St, Toronto, ON M5S 3H8, Canada.
Canadian Institute for Advanced Research, CIFAR program in Gravitation and Cosmology.
Dunlap Institute for Astronomy & Astrophysics, University of Toronto, AB 120-50 St. George Street, Toronto, ON M5S 3H4, Canada.
Perimeter Institute of Theoretical Physics, 31 Caroline Street North, Waterloo, ON N2L 2Y5, Canada.
Xin Wang
Canadian Institute of Theoretical Astrophysics, 60 St George St, Toronto, ON M5S 3H8, Canada.
I-Sheng Yang
Canadian Institute of Theoretical Astrophysics, 60 St George St, Toronto, ON M5S 3H8, Canada.
Perimeter Institute of Theoretical Physics, 31 Caroline Street North, Waterloo, ON N2L 2Y5, Canada.
Abstract
From the basic concepts of general relativity, we investigate the rotation of the polarization angle by a moving gravitational lens. Particularly, we clarify the existing confusion in the literature by showing and explaining why such rotation must explicitly depend on the relative motion between the observer and the lens. We update the prediction of such effect on the double pulsar PSR J0737-3039 and estimate a rotation angle of . Despite its tiny signal, this is orders of magnitude larger than the previous prediction by Ruggiero and Tartaglia Ruggiero and Tartaglia (2007), which apparently was misguided by the confusion in the literature.
I Introduction
Einstein’s theory of General Relativity has been the dominant theory of gravity for a century. Many of its signature outcomes, such as light-bending, orbital precession, and gravitational waves, have been confirmed by precision tests in astronomy Dyson et al. (1920); Kramer et al. (2006); Weisberg and Taylor (2005); Abbott et al. (2016). One of the last pending tests is the gravitational rotation of the polarization angle. Since gravity directly affects the spacetime geometry, a gravitational lens not only can bend the light ray, it may also rotate the polarization. Such gravitational rotation of polarization has not been measured so far. 111The E-mode and B-mode in CMB are defined as the relative angle between polarization and gradient. The observed leading order effect is a consequence of a rotated gradient but a fixed polarization, thus it does not count. The actual rotation of polarization contributes to sub-leading effects, which are analyzed in Dai (2014), and it is unclear whether we will eventually observe them. One obvious reason is that it is usually very small. The rotation angle of a linearly polarized light ray, caused by a gravitational lens, is suppressed by two small numbers.
[TABLE]
Here is the mass of the lens, is the impact parameter—the shortest distance when the light ray pass near the lens, and is the velocity of the lens. Unless the light goes through somewhere comparable to the Schwarzschild radius, the first factor is small. Unless the velocity is almost relativistic, the second factor is small.
Luckily, an almost edge-on, compact pulsar binary system, such as the double pulsar PSR J0737-3039, can be a very strong candidate to measure such effect. Pulsar signals are often highly polarized, allowing precise measurements of its rotation. This system has an almost edge-on orbit which allows the impact parameter to be very small at the superior conjunction. Finally, its compact orbit, with a two-hour period, means a large velocity. One main point of this paper is to show that one can expect to have from double pulsar, which might be observable given a dedicated observation campaign.
The gravitational rotation of polarization from this double pulsar system was previously studied in Ruggiero and Tartaglia (2007). They however derived a much smaller number which is incorrect. In fact, various theoretical derivations of this rotation of polarization have probably brought more confusion than clarity since its first appearance in Skrotskii (1957). It was summarized in Brodutch et al. (2011) that three different values of can be derived from existing literature for seemingly identical physical situations. Many authors disagreed on “whether there is a nonzero rotation in Schwarzschild metric”, while none of them correctly pointed out that it is not even a well-defined question to ask. Although Eq. (1) has been derived by some authors, such as in Kopeikin and Mashhoon (2002); Dai (2014), they did not explicitly explain why it is the correct result.
In this paper, we will resolve the confusions by deriving Eq. (1) from the very basic concept of general relativity—parallel transports. It turns out that , despite being a number, is not a gauge-invariant scalar. It is actually an projection of an tensor. The rotation is a gauge-invariant property associated with a light ray that starts and ends far away from the lens, but depends on which we project to. Therefore it is natural for to be gauge-dependent. *More precisely, the rotation of polarization depends on the frame of the observer. * Such observer/gauge-dependence was the source of confusions. For example, one cannot simply ask whether there are rotations in Schwarzschild metric without specifying who the observer is. For an observer at rest, there is indeed zero rotation; for a moving observer, however, the rotation will be nonzero.
We also identify the “correct” gauge to compute such rotation, in which the answer will agree with the actual observation of double-pulsar signal from the earth. It involves comparing the polarization of two signals, and both are measured in the rest frame of the earth. Thus it is only natural that we calculate in the earth (observer’s) frame. The in Eq. (1) is the relative velocity between the observer and the lens.
In Sec.II, we will provide an operational definition of polarization rotation from the basic principles in general relativity and explain the inevitable observer-dependence. In Sec.III, we will derive Eq. (1) in the small rotation limit for one spin-less, point-mass lens. We then generalize it into multiple lenses even with spins. In Sec.IV, we will describe the observable effects on double pulsar. In Sec.V, we conclude with a discussion of an appropriate observation campaign of future detection.
II Definition from Scratch
II.1 Parallel Transport around a Loop
Intuitively, one can imagine the polarization as a vector attached to a light ray that is spacelike and orthogonal to the direction of propagation. Let k be the null vector of the light ray and e be a polarization vector, a parallel transport of e should be valid in the geometric optic limit, i.e.
[TABLE]
When the rotation is small, using the Born approximation, we can integrate along a light ray from point A to point B to get the change of polarization vector.
[TABLE]
where is the original vector and e is the change. Straightforwardly, could serve as a definition of how much a polarization vector has been rotated. This however, cannot be the full story. That is because Eq. (3) literally compares two vectors on the tangent spaces of two different points,. Such comparison is mathematically meaningless. The two vectors must be in the same tangent space to provide a meaningful rotation. Another way to state the same problem is that the value of connections, , depends on the coordinate choice. By choosing a different gauge, one can change the value of the line integral of Eq. (3) to any value.
Many literature used Eq. (3) and computed its value in one very natural gauge, for example Kopeikin and Mashhoon (2002); Dai (2014). We can probably call that the asymptotic Minkowski gauge or the Schwarzschild gauge, in which falls of to zero asymptotically away from matter sources as they would in the Schwarzschild coordinate. It turns out that Eq. (3) in such gauge happens to give the correct, gauge-independent answer. Here we will explain why.
First of all, parallel transport is not limited to null rays. One vector at a point can be parallel transported along any path by an integral similar to Eq. (3). After we parallel transport from to along a null ray, we can again parallel transport the resulting vector in along another curve back to . If we compare the final vector back at with the initial vector, the result is a loop integral.
[TABLE]
Since now we are comparing two vectors on the tangent space of one point, it is mathematically meaningful. It is also gauge invariant. The gauge freedom allows us to change the value of locally but not globally, as there will be constraints. Parallel transport in any close loop is indeed one of those constraints. Its answer carries the gauge-invariant information about spacetime curvature enclosed by such loop, and it must be gauge invariant.
Eq. (3) is just providing a convenient way to evaluate such gauge-invariant loop integral. Assume that we are in asymptotic Minkowski space, and both points and are in the asymptotic region. We can then choose this path from back to to stay in the asymptotic region. In the asymptotic Minkowski gauge that asymptotically, the segment of integral through the asymptotic region contributes nothing. Therefore, the line integral, Eq. (3), in the asymptotic Minkowski gauge, gives exactly the gauge-invariant answer of the loop integral in Eq. (4).
Furthermore, when there are multiple matter sources in asymptotic Minkowski space, there is a well-defined, common asymptotic Minkowski gauge. That simply means is only nonzero near sources, decays away from individual sources like in a Schwarzschild metric, and the contribution from each source superimposes in the region far away from all sources. Computing a gauge-invariant loop integral in this gauge allows us to identify the contributions to rotation from individual sources, since only the segments of integral near sources have nonzero contributions.
An actual observation, as we depict that in Fig.1, is closely related to a loop integral. What we have is a source (pulsar) which constantly emits a fixed (albeit unknown) polarization. We measure the polarization during a usual time, which is a light ray from to . And then we compare it with the polarization measured when its binary companion passes very close to the line of sight, which is another light ray from to . We take the difference between these two measurements, which is a loop integral if we add two extra time-like segments.
[TABLE]
The two time-like segments, and , and the light ray , are all far away from the companion. They all contribute nothing in the asymptotic Minkowski gauge. Thus the above loop integral can be calculated in the asymptotic Minkowski gauge as only the line integral . Such line integral actually only has contribution near the companion, thus it is indeed the rotation of polarization caused by the passage of the companion. 222 This simplified story is true when both the emitter and receiver are light so we can ignore their contribution to . In reality, points , are near the pulsar, and , are on earth, so neither is in the asymptotic region. Thus the time-like segments can be nonzero, and the null segments will have extra contribution near the end points. Nevertheless, the extra contributions to the null segments will cancel each other. The time-like contributions has nothing to do with the companion, and they are degenerate with an intrinsic variation of pulsar signal or the telescope receiving function. We can simply observe and fit such behaviour when the companion is not passing through the line-of-sight, and subtract it from the data. Thus treating them as in the asymptotic region is a simple way to show the contribution from the companion without loss of generality.
II.2 Observer Dependence
In the previous section, we explained that a given vector, parallel-transported along a loop, goes through a gauge-invariant rotation. In the asymptotic Minkowski gauge, such rotation can be calculated along a light ray. This however, does not resolve all the confusion in the rotation of polarization. The next problem is which vector do we rotate? Polarization lives in a 2-dimensional plane, which cannot be uniquely determined by a light ray. Thus there is no unambiguous answer to the question of “how much rotation of polarization has a light ray gone through.” We need both the light ray and the observer’s 4-velocity to determine the 2-dimensional plane on which the polarization lives. Therefore, “how much rotation” must have an observer dependent answer.
Mathematically, we can see that Eq. (3) is the leading order effect of a small rotation matrix.
[TABLE]
where
[TABLE]
By definition of a rotation matrix, has to be anti-symmetric, which can be verified explicitly.
[TABLE]
Note that we have to drop the boundary term in the above integral, which is allowed because this is effectively a loop integral as we explained in the previous section. This demonstrates that there is actually a full rotation, , that is associated with a loop, therefore a light ray which starts and ends in asymptotic Minkowski region.
This rotation is the gauge-invariant property associated with the light ray, but it does not yet determine the rotation of polarization, which is only an . It also contains extra information such as the deflection of the light ray itself. Particularly, one needs to specify a two-dimensional plane of polarization to determine which to project to. For any observer, the polarization vectors are orthogonal to both the incoming light ray and its own worldline. Thus a projection to the co-dimension-two surface orthogonal to the light ray and the observer 4-velocity is the desired rotation of polarization. Therefore, it is natural and necessary that polarization rotation depends on the observer frame. This also explains the dependence in Eq. (1), which has to be the velocity of the lens in the observer’s frame.
One last possible confusion is why such dependence is on the observer instead of the source, since they seem to play equivalent roles in the integral of Eq. (3). Again, we remind the readers that the apparent line integral in Eq. (3) is a convenient illusion. The physically meaningful rotation is defined by a loop, where one compares a vector to its parallel transported outcome after going around the loop. Thus there does exist one unique point at which the rotation is defined, which is where the two polarizations are being compared. In practice, we will have no idea about the actual polarization when the signal is emitted at the pulsar. All we know are the polarizations we received on earth, so this is the frame we have to choose.
II.3 Beyond Born Approximation
The actual effect we will calculate in the rest of this paper will be quite small, so Born approximation is justified. Nevertheless, the above abstract explanation must still be true beyond the Born approximation, and we will spend this subsection to demonstrate that.
It is straightforward to actually solve the parallel transport equation, Eq. (2), instead of using the Born-approximation integral in Eq. (3). A loop-parallel transport back to the same point is obviously an rotation. So one can see that up to getting the rotation, everything we said in the previous section directly generalizes beyond Born approximation. The only question is that we have used the unique to determine the projection in the Born approximation. Now the direction of light is also deflected significantly, . Do we still have an unambiguous way to determine which to project into?
The answer is yes, and this is how we do it. First of all, the observer’s 4-velocity reduces down to . The light rays, before and after the deflection, and , are also reduced down to two spacelike vectors in the observer’s frame, and . As long as , there is a unique, minimal rotation that aligns them. 333Note that there are many rotations which can align them, but there is a unique minimal rotation, that is rotating along the direction orthogonal to both of them, . Aligning and also merges their polarization planes, in which an rotation is uniquely defined. Thus, one can see that even beyond Born approximation, the rotation of polarization is still a well-defined, unambiguous, observer-dependent projection of a covariant tensor.
III Explicit calculation
III.1 Point Mass
We will treat the gravitational lens as a point mass and model it with a Schwarzschild metric in the isotropic form, expanded to the leading order of .
[TABLE]
where , and the Newton constant is conveniently set to . Instead of studying an arbitrary light ray in the above coordinate, we will shift and boost this metric such that the lens has arbitrary position and velocity, and the relevant light ray is aways . In principle, we need six parameters, i.e. . By applying symmetries, we can further simplify the problem so that eventually only three will be needed.
First we use shift symmetries in and to set , which simply means that is defined as the time when the light ray is closest to the lens. Next, we set , so instead of letting the lens to have an -velocity, the asymptotic observer who measures the polarization will have a nonzero -velocity. This changes nothing because the light ray is in the direction, . Independent of what -velocity the observer has, the plane orthogonal to both the light ray and the observer will be the - plane. Thus we are always calculating the rotation of polarization on the - plane. Finally, using rotational symmetry on the - plan, we can set , leaving the remaining three parameters to be , and .
Employing these symmetries significantly simplifies the problem, and we illustrate the final situation in Fig.2 Since the lens is the centre of the coordinate in Eq. (III.1), we need to apply the appropriate coordinate transformation to accommodate our symmetry choice.
[TABLE]
The resulting metric becomes
[TABLE]
where .
While calculating the connections,
[TABLE]
we can treat the first as the flat metric since we are only keeping the leading order result. This applies to any that is not hit by a derivative in the calculation, e.g. the one in Eq. (8). We also assume that both the null ray direction and the polarization direction are only changed by a small amount, i.e. the Born approximation. Thus we can compute by Eq. (8) along the undeflected light ray . Many components of are zero due to our choice of symmetries, so it is straightforward to see that
[TABLE]
For , we can perform the integral and keep only the leading order value to get
[TABLE]
Here is the lens’ angular momentum with respect to , i.e. the point where the light ray was closest to the lens. The direction of the light ray determines which component of we care about, which is the -component in our symmetry choice. The final, generalized format of Eq. (14) should be straightforward from our symmetry choice. A more explicit calculation in Kopeikin and Mashhoon (2002) led to the same result.
Furthermore, let us imagine that there is a continuous source of signals. The light along the direction continues to shine while the lens is moving in its constant velocity in the direction. The light ray which is closest to the lens will get the maximum rotation. In other words, we get maximum rotation when the lens’ velocity is orthogonal to its distance to the light way. This corresponds to in the above calculation.
[TABLE]
This maximum rotation along a source trajectory is the promised result in Eq. (1).
III.2 General Case
The above point-mass calculation assumes that it carries no spin. Many papers employed a Kerr metric instead to calculate how the angular momentum from the spin also contributes to the rotation of polarization. In the limit of small rotations, we can instead generalize the above result without explicitly starting from a Kerr metric. That is because Eq. (11) allows superposition when all lenses are not moving too fast and not too close to the light ray. At the leading order (ignoring sub-leading velocity terms), the metric of multiple moving point masses are given by
[TABLE]
Their contributions to the total rotation also superimpose linearly. If we further assume that the masses are distributed in a small enough region such that their locations stay the same during the passage of the light ray, the answer is very simple.
[TABLE]
Since their velocities are small, they are roughly in the same location after the light ray goes through all of them, thus do not matter at all.
Under these assumptions, Eq. (18) provides the general answer to any mass and velocity distribution. By the uniqueness theorem, the effect from a Kerr metric of mass and spin can be mimicked by a two-particle system at the leading order.
[TABLE]
Taking while holding fixed, we get
[TABLE]
The spin of the point mass contributes in exactly the same way as its “orbital” angular momentum around the light ray. Intriguingly, although Kopeikin and Mashhoon (2002) agrees with Eq. (14), they claimed that the spin does not contribute at all. We cannot see any physical reason for such statement since Eq. (20) seems to be the most natural result.
IV Example: Double Pulsar
Before putting in the actual numbers, let us first give a rough estimation on the maximal rotation we can get from the double pulsar system. Recall that the Schwarzschild radius of the sun is roughly , and these neutron stars are slightly larger. A typical neutron star is slightly smaller than 10 times its own Schwarzschild radius, so we take the radius of the lens neutron star to be . If it is a slow pulsar, we take the spin period to be about . If the spin aligns with the line of sight, we estimate its contribution in Eq. (20) as
[TABLE]
We have used both and to make this quantity to have the unit of length2, which makes it easier to calculate the unitless . Similarly, assume the binary orbit is , velocity is about speed of light, and the orbital tilt is degree, so the impact parameter is roughly , then the orbital contribution is roughly
[TABLE]
In this case, the spin contribution is negligible.
If the lens is a recycled (fast) neutron star, its the period would be , and its spin angular momentum is increased by a factor of . If it is also aligned with the line of sight, the spin contribution will be comparable to the orbital contribution. Fortunately, fast pulsars are usually spun-up by accretion from the companion, so its spin is usually aligned with the orbital plane. In our case, it means that the spin is almost perpendicular to the line-of-sight, so its contribution is again negligible.
Therefore, in either case, we can estimate the maximal rotation from the orbital contribution only, which is about
[TABLE]
The actual value for double pulsar is slightly larger. In Fig. (3), we calculated the signal using the orbital information of double pulsar system PSR J0737-3039 Kramer et al. (2006). We ignore the spin contribution since they are negligible as we explained. We can see that during a rotation period, we can expect a maximal rotation of polarization () at about . This happens when the companion (lens) is almost in front of the pulsar. It is well-known that at this moment, there will also be an eclipse, so one may worry that we cannot actually see this maximal rotation. We specifically zoomed in and blacked-out the eclipsed. We found that the peak of the curve is significantly wider than the eclipse duration. Thus for a (relatively) long duration before and after the eclipse, we can still observe . We can also see that although the double-pulsar is already quite edge-on, one can hope to get luckier and discover another system whose inclination angle is even closer to degree. The resulting rotation of polarization can be even larger.
V Observational considerations
Such a small swing in polarization angle will be challenging to detect. For parameters of PSR J0737-3039 with a polarized fraction of about 50 % Demorest et al. (2004), raw thermal sensitivity requires a signal-to-noise of at least in polarization. For a pulsar self-noise dominated telescope such as FAST Jin et al. (2007) or SKA Smits, R. et al. (2009) with a band width of GHz, it requires seconds of on target on-pulse integration to achieve a detection. This is a substantial commitment of resources. On top of that, there are practicalities one must consider carefully. First of all, any given object can be only seen for a limited about of time each day. This observable duration for FAST could be short at the low declination of PSR J0737-3039, so SKA is likely the more suitable facility. At a duty cycle of about 20%, this requires a few years of observations with the full phase 2 telescopes. During superior conjunction, the companion’s magnetosphere partially absorbs the pulsar, and plasma faraday effects may also complicate the analysis. 444In principle, the plasma effect is frequency dependent, therefore distinguishable from the gravitational effect which is achromatic.
Despite the substantial efforts required, the pulsar is likely to be monitored extensively for other reasons. Thus a detection may be eventually achieved over decades of SKA operations. Alternatively, other more optimal systems may be discovered, for example pulsar-black hole systems. Our estimates indicates that this effect is in principle observable in the foreseeable future. The gravitational Faraday effect can be added to the wish-list of pulsar tests of general relativity.
Acknowledgements.
We thank John Antoniadis and Michael Kramer for informative discussions. We also thank the hospitality of Max-Planck Institute for Radio Astronomy during the Scintillometry Workshop. This work is supported by the Canadian Government through the Canadian Institute for Advance Research and Industry Canada, and by Province of Ontario through the Ministry of Research and Innovation. The Dunlap Institute is funded through an endowment established by the David Dunlap family and the University of Toronto.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Ruggiero and Tartaglia (2007) M. L. Ruggiero and A. Tartaglia, Mon. Not. Roy. Astron. Soc. 374 , 847 (2007), eprint astro-ph/0609712.
- 2Dyson et al. (1920) F. W. Dyson, A. S. Eddington, and C. Davidson, Philosophical Transactions of the Royal Society of London Series A 220 , 291 (1920).
- 3Kramer et al. (2006) M. Kramer, I. H. Stairs, R. N. Manchester, M. A. Mc Laughlin, A. G. Lyne, R. D. Ferdman, M. Burgay, D. R. Lorimer, A. Possenti, N. D’Amico, et al., Science 314 , 97 (2006), ISSN 0036-8075, eprint http://science.sciencemag.org/content/314/5796/97.full.pdf, URL http://science.sciencemag.org/content/314/5796/97 .
- 4Weisberg and Taylor (2005) J. M. Weisberg and J. H. Taylor, ASP Conf. Ser. 328 , 25 (2005), eprint astro-ph/0407149.
- 5Abbott et al. (2016) B. P. Abbott et al. (Virgo, LIGO Scientific), Phys. Rev. Lett. 116 , 061102 (2016), eprint 1602.03837.
- 6Dai (2014) L. Dai, Phys. Rev. Lett. 112 , 041303 (2014), eprint 1311.3662.
- 7Skrotskii (1957) G. V. Skrotskii, Soviet Physics Doklady 2 , 226 (1957).
- 8Brodutch et al. (2011) A. Brodutch, T. F. Demarie, and D. R. Terno, Phys. Rev. D 84 , 104043 (2011), eprint 1108.0973.
