Metrics for next-generation gravitational-wave detectors
Evan D Hall, Matthew Evans

TL;DR
This paper evaluates various configurations of future gravitational-wave detector networks, emphasizing how detector types influence performance metrics like sky localization, to guide optimal network design for diverse science goals.
Contribution
It systematically compares three-detector networks with different detector types, revealing that detector composition impacts performance more than geographic placement.
Findings
Detector type significantly affects network performance.
Network configuration influences sky localization accuracy.
Type of detectors is more critical than their geographic location.
Abstract
Gravitational-wave astrophysics has the potential to be transformed by a global network of longer, colder, and thus more sensitive detectors. This network must be constructed to address a wide range of science goals, involving binary coalescence signals as well as signals from other, potentially unknown, sources. It is crucial to understand which network configurations---the number, type, and location of the detectors in the network---can best achieve these goals. In this work we examine a large number of possible three-detector networks, variously composed of Voyager, Einstein Telescope, and Cosmic Explorer detectors, and evaluate their performance against a number of figures of merit meant to capture a variety of future science goals. From this we infer that network performance, including sky localization, is determined most strongly by the type of detectors contained in the network,…
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3
Figure 4
Figure 5
Figure 6
Figure 7
Figure 8
Figure 9
Figure 10
Figure 11| Code | Location | Lat. | Long. | |
|---|---|---|---|---|
| H | Hanford, USA | |||
| L | Livingston, USA | |||
| V | Pisa, Italy | |||
| I* | India | |||
| K | Kamioka, Japan | |||
| E* | Europe | |||
| A* | Western Australia | |||
| U* | Utah, USA |
| Network | H | L | V | I | K | E | A | U |
|---|---|---|---|---|---|---|---|---|
| HLV | aL | aL | AdV | — | — | — | — | — |
| 3Voy | Voy | — | Voy | Voy | — | — | — | — |
| ET/2Voy | — | Voy | — | — | Voy | ET | — | — |
| CE/2Voy | — | — | Voy | — | Voy | — | — | CE |
| ET/CE/Voy | — | — | — | Voy | — | ET | — | CE |
| 2CE/Voy | — | — | — | — | Voy | — | CE | CE |
| ET/2CE | — | — | — | — | — | ET | CE | CE |
| 3CE | — | — | — | — | — | CE | CE | CE |
| 3ET | — | — | — | — | — | ET | ET | ET |
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Metrics for next-generation gravitational-wave detectors
Evan D. Hall
LIGO Laboratory, Massachusetts Institute of Technology, Cambridge, MA 02139, USA
Matthew Evans
LIGO Laboratory, Massachusetts Institute of Technology, Cambridge, MA 02139, USA
Abstract
Gravitational-wave astrophysics has the potential to be transformed by a global network of longer, colder, and thus more sensitive detectors. This network must be constructed to address a wide range of science goals, involving binary coalescence signals as well as signals from other, potentially unknown, sources. It is crucial to understand which network configurations—the number, type, and location of the detectors in the network—can best achieve these goals. In this work we examine a large number of possible three-detector networks, variously composed of Voyager, Einstein Telescope, and Cosmic Explorer detectors, and evaluate their performance against a number of figures of merit meant to capture a variety of future science goals. From this we infer that network performance, including sky localization, is determined most strongly by the type of detectors contained in the network, rather than the location and orientation of the facilities.
I Introduction
Gravitational-wave science, which started in earnest with the first detection of gravitational-waves in 2015 Abbott et al. (2016), has immense unexplored potential. Aiming to exploit this potential, the gravitational-wave community is already considering designs for next generation of detectors Abernathy et al. (2011); Abbott et al. (2017). It is clear that to get the most science out of the gravitational-wave signals we detect, a network of large-scale observatories will be required. However, at present there are many unanswered questions about how best to construct such a network.
In this paper we will address several of the fundamental questions required to maximize the scientific potential of a terrestrial gravitational-wave detector network. These questions include: Which science goals are sensitive to the location and orientation of the detectors in the network, and in what way? How the does design of the network’s constituents impact its output? What science can be done with a heterogeneous mix of second and third generation detectors?
Instrument and facility designs often focus on optimizing the performance of single detectors rather than networks. Such optimizations have relied on metrics such as the inspiral range—the distance out to which each detector could detect a model system (usually a 1.4–1.4 binary coalescence) with a certain signal-to-noise ratio Finn and Chernoff (1993)—or the detector’s strain sensitivity Miao et al. (2014). A variety of cosmological generalizations of these range metrics have been developed Chen et al. (2017); one such generalization, the response distance, is shown in Fig. 1. Effective strain sensitivities are shown in Fig. 2, taking into account effects from frequency-dependent antenna patterns Essick et al. (2017) and multi-interferometer detectors.
Systematic studies of gravitational-wave detector network optimization, measured quantitatively via a set of metrics, go back more than a decade. Searle et al. Searle et al. (2002) considered how coincident detections of binary neutron stars could be augmented by the addition of another facility to the the existing gravitational-wave network. Schutz Schutz (2011) proposed three figures of merit for a gravitational-wave network: the triple-coincidence detection rate, the isotropy of detections across the sky, and the typical localization of events on the sky. Raffai et al. Raffai et al. (2013) optimized the facility placement of a set of triangular (Einstein-Telescope-like) detectors as well as the placement of a LIGO facility in India; this procedure was then generalized by Hu et al. Hu et al. (2015). Michimura et al. Michimura et al. (2018) optimized the optical configuration of the Kagra detector Aso et al. (2013) with respect to the sky localization performance of the global advanced detector network.
More generally, others Mills et al. (2018); Vitale and Evans (2017); Vitale and Whittle (2018) have already examined the performance of a set of plausible third-generation gravitational-wave networks against some set of metrics. The binary-neutron-star (BNS) localization capabilities of networks with third-generation detectors has been explored by Mills et al. Mills et al. (2018). The binary-black-hole (BBH) parameter estimation capabilities—including masses, spins, redshift, and localization—of networks with third-generation detectors has been explored by Vitale and Evans Vitale and Evans (2017) and Vitale and Whittle Vitale and Whittle (2018).
While these works evaluate the performance of some networks, and optimize a few assemblies of detectors for some performance metrics, they do not address the critical questions posed earlier in this section. In this work, we present in full the performance of a large ensemble of networks against a list of metrics, in order to see the full landscape of network performance. This reveals which metrics require careful optimization, and which are relatively insensitive to network configuration.
II Metrics that connect network parameters with science goals
The questions raised in Sec. I naturally drive one to “optimally” choose the network parameters—the number, type, location, and orientation of third-generation instruments across the globe—based on what information is needed to address specific science goals. These goals are quite diverse; e.g., determining the history of star formation, testing corrections to general relativity, uncovering the nuclear physics inside neutron stars and supernovae, and constraining the dark energy equation of state. Each of these may require different kinds of observational information or levels of precision. In addition, each of these science goals involves a variety of data analysis techniques and scientific products whose connection to the network parameters is not immediately obvious.
For these reasons, network optimization studies Hu et al. (2015); Raffai et al. (2013); Michimura et al. (2018) often focus on optimizing a smaller number of metrics, by which we mean intermediate data products that have a reasonably clear dependence on the network parameters and which then feed into the more specialized analysis that is used to address the science goals. Such metrics include signal-to-noise ratio, polarization sensitivity, and source localization.
In addition to choosing a set of metrics, one must also consider whether to optimize these metrics for the best events, or the typical events. Certain studies benefit from the large statistics of the total population of events, thereby suggesting that metrics should be optimized for the median event. Other studies benefit from collecting a few of the loudest events, thereby suggesting that metrics should be optimized for loudest events. Still other studies require collecting a few rare or special events, which may not be captured in the generic metrics; these studies may need to optimize the median event so as not to lose out on the rare events.
In the following section we choose a few example metrics and explore how different third-generation networks perform. Where appropriate, we examine how networks perform for both the average (median) event and the best 10% of events.
III Metrics
We have chosen several metrics relating to parameter estimation for coalescing binaries, as well as metrics intended to evaluate network performance for neutron-star/supernova physics and stochastic backgrounds:
the sky localization area of 1.4–1.4 binary neutron star coalescences at redshift , a metric useful for assessing the feasibility of electromagnetic followup; 2. 2.
the signal-to-noise ratio of 30–30 binary black hole coalescences at redshift , a metric for quantifying the quality of the most frequent events; 3. 3.
the distance uncertainty (or equivalently, redshift uncertainty) of the 30–30 binary black hole coalescences at , a metric useful for assessing the constraining power of third-generation networks on the stellar evolution history of the universe; 4. 4.
the inclination angle uncertainty of the same 30–30 systems at , which is representative of the network’s ability to distinguish between gravitational-wave polarizations; 5. 5.
the signal-to-noise ratio of a high-frequency strain signal, a metric useful for studies of the post-merger signal in neutron star coalescences, and for galactic supernovae; and 6. 6.
the level of stochastic background that could be observed with after one year of observation time.
Note that while these metrics have chosen to be representative of the basic kinds of signal searches and parameter estimation problems that are carried out with gravitational-wave detectors, they feed more broadly into overarching science goals. For example, for tests of general relativity, the SNR performance for binary black-hole systems as quantified by metric (2) is related to the precision with which one can perform spectroscopy on black hole ringdown signals Berti et al. (2016), and the metric (4) is related to polarization-discrimination ability required to carry out a broad class of polarization-based relativity tests Will (2014).
Our strategy for exploring the above metrics is as follows. First, we write down a list of possible three-facility configurations using various combinations of second- and third-generation detectors 111Note the distinction between detector and facility: while each Cosmic Explorer facility is assumed to contain one Cosmic Explorer detector, each Einstein Telescope facility contains three Einstein Telescope detectors, arranged in an equilateral triangle.; in the end we select nine such combinations. For each of these nine configurations, we generate an ensemble of possible networks by allowing the locations of the third-generation facilities (ET or CE) to vary randomly across the globe 222We do not take into account likely geographical constraints such as oceans. This results in some highly improbable networks, but it means that our exploration of network topology is exhaustive: one should not expect to find a geographically forbidden network that vastly outperforms the geographically realistic ones., and by allowing the Voyager detectors to be chosen randomly from one of five facilities (Hanford, Livingston, Pisa, India, and Kamioka). For each of the configurations involving at least one ET or CE facility, we generate 200 random networks; for the three-Voyager configuration we generate 10 networks (the maximum possible by choosing 3 facilities from a set of 5); and for the HLV configuration we use only the one existing network with two LIGO detectors at Hanford and Livingston and one Virgo detector Acernese et al. (2015) at Pisa. Any network realization with the facilities placed too close together (area of the planar triangle spanned by the facilities is less than , where is the radius of the Earth) is rejected. In total, this results in more than 1000 networks whose performance we evaluate. In the subsequent plots, the random networks are shown as circles, with the color of each circle corresponding to the nine configurations given above.
For each of the nine configurations we generate a single, plausible network using the facility coordinates given in Tab. 1, with detectors assigned to facilities according to Tab. 2. These are shown as colored stars in the subsequent plots.
For the metrics involving binary coalescences, we generate systems distributed isotropically in the sky (via the HEALPix scheme Gorski et al. (2005)333https://healpix.sourceforge.io), and with random inclinations. The components of the binary are non-spinning and have equal mass. For each of the networks, we evaluate each metric against all of these systems, resulting in a distribution of 768 values; we then extract the median value of this distribution, and the value delimiting the best 10% of the distribution. No SNR cuts or trigger thresholds are applied.
The stochastic metric results in only one value per network, so no computation of quantiles is necessary.
III.1 Localization of neutron-star binaries at
Out to redshift , one may reasonably expect binary neutron star coalescence events to pass through the earth every year, even assuming a pessimistic local merger rate density 100\text{,}\mathrm{G}\mathrm{p}\mathrm{c}^{-3},\mathrm{y}\mathrm{r}^{-1}$$. These events, if sufficiently localized, will be within the followup capabilities of next-generation telescopes.
With a network of three separate detector facilities, events can be localized to an ellipse on the sky. In this work we compute the localization area (90% confidence interval) via the basic Fisher matrix procedure described by Singer and Price (Singer and Price, 2016, §B), with uniform priors.
In Fig. 3 we plot the resulting distributions for the best 10% of localizations, with the networks sorted by the area spanned by the three facilities. As expected, the localization performance scales inversely with the area. It is also clear that each addition of a 3G facility to the network significantly improves the network’s localization capability. A network composed of two 2G facilities and one 3G facility is roughly a factor of 3 better than a baseline set of three 2G facilities. Including two 3G facilities offers an order of magnitude improvement over the baseline, while a full 3G network offers another factor of 4 or 5, with a significant fraction of localizations below .
In Fig. 4 we plot the best 10% of localizations against the median localizations. This result shows that these two metrics are highly correlated, indicating that there is no need to trade high performance on the best events for good performance on the majority of events, at least for localization capability.
III.2 Signal-to-noise ratios of black-hole binaries at
To evaluate the signal-to-noise ratio (SNR) performance of our ensemble of networks, we compute the matched-filter network SNRs for a population of 30–30 binary black hole coalescences at distributed isotropically in the sky and with random inclinations. Fig. 5 shows the performance of the ensemble of networks by plotting each network’s median SNR against its best 10% SNR. Unlike the distribution of localizations, the distribution of SNRs has comparatively little dependence on the location and orientation of the detectors; instead, the network performance is determined predominantly by the network composition, with a scatter .
For the networks consisting of two or more third-generation facilities, a slight anticorrelation of the median and best 10% SNRs can be observed. This can be explained as follows: in all cases the network SNR is dominated by the SNR of the third-generation detector(s) (see Fig. 1). For networks with only one third-generation facility, the network SNR is therefore determined by the antenna pattern of the single third-generation facility regardless of its location and orientation. For networks with two or three third-generation facilities, if these facilities are placed so that their antenna patterns mostly overlap, they will jointly detect events at the antenna pattern maxima with good SNR at the expense of events that are incident close to the antenna pattern minima. This leads to enhanced SNR for the best 10% of events and diminished SNR for the median events. Conversely, if these facilities are placed so that their antenna patterns have little overlap, then the network antenna pattern is more isotropic, leading to better SNR for the median events and worse SNR for the best 10% of events.
III.3 Redshift and inclination angle uncertainties for black-hole binaries at
Precise measurements of black hole redshifts at can constrain the redshift distribution of black hole binaries, and hence the star formation rate and the delay time from star formation to merger Vitale and Farr (2018). Measurement of the redshift is also critical to determining the source-frame mass of the component black holes, and thus their astrophysical origin.
The distance information in a binary coalescence signal is partially degenerate with the inclination angle (iota) of the binary relative to the line of sight. Disentangling the two requires good discrimination of the polarization of the incident wave. To quantify the polarization discrimination ability of a particular network, we can collect the the detector response tensors Dhurandhar and Tinto (1988), each with five independent components, into an matrix and compute its condition number (appendix A). The condition number of the network describes how well the network can reconstruct the parameters of the incoming wave from the detector signals Mohanty et al. (2006); Rakhmanov (2006).
Fig. 6 shows the median redshift uncertainties for 30–30 black hole binaries coalescing at , plotted against the network condition number. Fig. 7 then shows the median versus best 10% redshift uncertainties for 30–30 black hole binaries coalescing at . The equivalent plots for the are shown in Figs. 8 and 9.
The primary conclusion which can be drawn from Figs. 7 and 9 is that a network optimized for the best events will also perform well for typical events, which is similar to the conclusion for sky localization from Fig.4. Figs. 6 and 8, on the other hand, indicate that while there is some dependence on network condition number, distance and inclination angle uncertainties are not strong functions of detector location and orientation.
III.4 Signal-to-noise ratios at high frequency
Gravitational-wave observations at the kilohertz scale can reveal information about nuclear processes from newly merged neutron stars (the so-called “post-merger” phase of the waveform) Clark et al. (2016); Chatziioannou et al. (2017) and the physics behind core-collapse supernovae Gossan et al. (2016). There is considerable uncertainty in the waveforms produced from these events. Therefore, we construct a metric consisting of a uniform strain 1\text{\times}{10}^{-25}\text{,}{\mathrm{Hz}}^{-1}$$ from to and zero elsewhere; this frequency range encompasses the frequency spectra predicted from multiple neutron-star post-merger models and the expected peak gravitational-wave emission frequencies from core-collapse supernovae.
The resulting distributions of signal-to-noise ratios, plotted as median versus best 10%, is shown in Fig. 10. Changing the uniform strain to a frequency-dependent strain does not substantively alter the trends shown in Fig. 10. The conclusion from this plot is very similar to that of SNR for binary black hole systems in Fig. 5: while there is an anti-correlation between the best and the median SNR for networks involving 3G detectors, the magnitude of the effect is too small to be a strong driver of network design choices.
III.5 Stochastic background
A major goal of gravitational-wave astronomy is to detect a stochastic background from unresolved binary systems, and perhaps from primordial fluctuations in the early universe Regimbau et al. (2017). This background appears as a continuous, broadband signal that is correlated between the detectors in a network. It can be expressed in terms of a strain power spectral density , but is more often expressed as a strain energy density .
As a metric, we consider the maximum background that can be resolved with after one year of integration time Allen and Romano (1999), assuming the background is white [] as expected for a primordial background. Ultimately, the performance of the network in constraining a stochastic background is determined by the network’s effective stochastic strain sensitivity, whose power spectral density (PSD) is given by Thrane and Romano (2013)
[TABLE]
where is the overlap reduction function between detectors and , and are the detectors’ strain PSDs, and is the total number of detectors in the network.
The resulting limits on for different networks are shown in Fig. 11, where the networks are again sorted by their condition number. For the networks comprising single-detector facilities (i.e., no ETs), one should expect an anticorrelation between condition number and the limit on since the network’s effective noise is minimized when its overlap reduction functions are maximized [Eq. (1)], which occurs for colocated and co-oriented detectors. On the other hand, the networks containing Einstein Telescope facilities already contain triplets of colocated detectors, for which the overlap reduction functions are large regardless of how the facilities are placed. Thus the stochastic constraining power of the networks with ET facilities show essentially no dependence on the condition number.
Additionally, we find that the network performance for constraining an unresolved white dwarf binary [] is tightly correlated with the constraining power on .
IV Discussion
In this paper we explore a wide variety of potential future gravitational-wave detector networks and quantify their performance in terms of metrics that determine their scientific potential. Rather than engaging in a black-box optimization, we connected network performance to physically meaningful quantities, such as network area, which allowed us to draw general conclusions about the importance of various aspects of the network.
First, the performance of a three-facility network is determined primarily by the type of detectors it contains, rather than the location and orientation of the facilities. This is particularly evident for the signal-to-noise ratio metrics. For the localization metrics, the network area has a large effect on the performance, but nonetheless this effect is subdominant to the effect of the network composition.
Second, there is a steady progression of scientific potential as more third-generation facilities are added to an existing network of evolved second generation detectors. This is to say that a single 3G facility will not be sufficient to do all of the science accessible to a full 3G network, nor do we need to have three 3G facilities in order to expand our gravitational-wave potential beyond the second generation.
Lastly, there is not much difference between ranking networks by their median performance or the performance for the best 10% of events. In the signal-to-noise ratio metrics one observes a slight anti-correlation, particularly for networks with two or three third-generation facilities, but this is a effect that is subdominant to sky localization—as seen in Fig. 3, the largest networks (area ) typically achieve a factor of better BNS localization than networks whose area is similar to the area of the current HLV network ().
We offer several caveats for the results presented here. First, the parameter estimation is Fisher-matrix-based, and the results may differ from a full numerical parameter estimation; this is particularly true for the distance and inclination metrics, where priors play an important role. Second, the systems presented here are localized primarily using timing information; for heavier and higher-redshift systems, amplitude and phase information become important. This means that the relative orientations of different facilities may become important for these systems. Third, time-dependent antenna pattern effects were not considered.
With these caveats, we conclude that when designing a three-facility next-generation network, one must think carefully about which detectors should comprise the network. However, one has a great deal of freedom to choose the locations and orientations of the facilities.
Appendix A Network condition number
This appendix describes the network condition number used in the main text Mohanty et al. (2006); Rakhmanov (2006). A detector network can be viewed as a linear system that transforms an incoming gravitational-wave strain tensor into a set of detector signals. The condition number of this linear transformation quantifies how well the strain tensor can be reconstructed from the detector signals, with a smaller condition number indicating better reconstruction.
The astrophysical strain incident on a detector network is described by a symmetric, traceless Cartesian tensor of order 2; its matrix representation is
[TABLE]
with the additional constraint .
Correspondingly, the response tensor of the th detector to is also a symmetric, traceless Cartesian tensor of order 2, defined as the difference of the outer products of the detector’s arm vectors and :
[TABLE]
The (weighted) signal appearing in the th detector is then given by the contraction of and into a scalar:
[TABLE]
The weights account for the fact that in a heterogeneous network, some detectors respond to the incident strain tensor with greater fidelity than others. In this work we choose to weight the signals by the detector noise performance at : w^{(i)}=1/\sqrt{S^{(i)}(100\text{,}\mathrm{Hz})}, where is the power spectral density of the detector’s strain noise.
A symmetric, traceless, order-2 tensor in has five independent components, meaning that both and the detector tensors can be written in terms of five basis vectors Dhurandhar and Tinto (1988) 444Note, however, that a strain tensor produced by a single source on the sky in fact has only four independent components—the sky locations and , and the plus- and cross-polarization amplitudes and Guersel and Tinto (1989). Similarly, a detector with two equal-length arms also only has four independent components, two angles and to describe the orientation of the X arm and and for the Y arm.. The Cartesian matrix representation of one such (orthonormal) basis is
[TABLE]
With this representation, the system of equations (4) can instead be written as a matrix equation
[TABLE]
where are the elements of a 5-element column vector, and are the elements of an matrix .
One can then define the condition number of the network as
[TABLE]
where is the maximum singular value, and is the pseudoinverse of .
Acknowledgements.
EDH is supported by the MathWorks, Inc. NSF award PHY-1836814 supports third-generation gravitational-wave detector network research in the United States. The authors thank R. Eisenstein for careful reading of the manuscript, as well as S. Vitale, T. Callister and M. Isi for helpful discussions. This article has been assigned the document number LIGO-P1800304.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Abbott et al. (2016) B. P. Abbott et al. (Virgo, LIGO Scientific), Phys. Rev. Lett. 116 , 061102 (2016) , ar Xiv:1602.03837 [gr-qc] . · doi ↗
- 2Abernathy et al. (2011) M. Abernathy, F. Acernese, P. Ajith, B. Allen, P. Amaro-Seoane, N. Andersson, S. Aoudia, P. Astone, B. Krishnan, L. Barack, et al. , “Einstein gravitational wave telescope conceptual design study,” http://www.et-gw.eu/index.php/etdsdocument (2011).
- 3Abbott et al. (2017) B. P. Abbott et al. (LIGO Scientific), Class. Quant. Grav. 34 , 044001 (2017) , ar Xiv:1607.08697 [astro-ph.IM] . · doi ↗
- 4Finn and Chernoff (1993) L. S. Finn and D. F. Chernoff, Phys. Rev. D 47 , 2198 (1993) , ar Xiv:gr-qc/9301003 [gr-qc] . · doi ↗
- 5Miao et al. (2014) H. Miao, H. Yang, R. X. Adhikari, and Y. Chen, Class. Quant. Grav. 31 , 165010 (2014) , ar Xiv:1305.3957 [gr-qc] . · doi ↗
- 6Chen et al. (2017) H.-Y. Chen, D. E. Holz, J. Miller, M. Evans, S. Vitale, and J. Creighton, (2017), ar Xiv:1709.08079 [astro-ph.CO] .
- 7Essick et al. (2017) R. Essick, S. Vitale, and M. Evans, Phys. Rev. D 96 , 084004 (2017) , ar Xiv:1708.06843 [gr-qc] . · doi ↗
- 8Searle et al. (2002) A. C. Searle, S. M. Scott, and D. E. Mc Clelland, Gravitational waves. Proceedings, 4th Edoardo Amaldi Conference, Amaldi 4, Perth, Australia, July 8-13, 2001 , Class. Quant. Grav. 19 , 1465 (2002) , ar Xiv:gr-qc/0110053 [gr-qc] . · doi ↗
