An adaptive scheduling tool to optimize measurements to reach a scientific objective: methodology and application to the measurements of stellar orbits in the Galactic Center
A. Hees, A. Dehghanfar, T. Do, A. M. Ghez, G. D. Martinez, and R. Campbell, J. R. Lu

TL;DR
This paper introduces an adaptive scheduling tool based on Fisher matrix analysis to optimize measurement planning in astronomy, demonstrated through stellar orbit measurements in the Galactic Center, accounting for practical challenges.
Contribution
It presents a novel adaptive scheduling algorithm applicable to various measurements, incorporating systematics and contingencies, with a specific application to stellar orbit observations.
Findings
Radial velocity measurements at turning points are more effective for detecting gravitational redshift.
The methodology optimizes measurement timing considering systematics and potential data loss.
Identification of key measurement points sensitive to relativistic effects and distance estimation.
Abstract
In various fields of physics and astronomy, access to experimental facilities or to telescopes is becoming more and more competitive and limited. It becomes therefore important to optimize the type of measurements and their scheduling to reach a given scientific objective and to increase the chances of success of a scientific project. In this communication, extending the work of Ford (2008) and of Loredo et al. (2012), we present an efficient adaptive scheduling tool aimed at prioritzing measurements in order to reach a scientific goal. The algorithm, based on the Fisher matrix, can be applied to a wide class of measurements. We present this algorithm in detail and discuss some practicalities such as systematic errors or measurements losses due to contigencies (such as weather, experimental failure, ...). As an illustration, we consider measurements of the short-period star S0-2 in our…
| Order of importance | Epoch | Type of observation | Stat. SNR |
|---|---|---|---|
| 1 | 2018.286 | RV | 4.50 |
| 2 | 2018.283 | RV | 4.97 |
| 3 | 2018.700 | RV | 5.47 |
| 4 | 2018.264 | RV | 6.09 |
| 5 | 2018.275 | RV | 6.33 |
| 6 | 2018.699 | RV | 6.53 |
| 7 | 2018.272 | RV | 6.71 |
| 8 | 2018.700 | astro | 6.84 |
| 9 | 2018.288 | RV | 6.98 |
| 10 | 2018.696 | RV | 7.10 |
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
An adaptive scheduling tool to optimize measurements to reach a scientific objective: methodology and application to the measurements of stellar orbits in the Galactic Center
A. Hees11affiliationmark: 22affiliationmark: , A. Dehghanfar22affiliationmark: 33affiliationmark: , T. Do22affiliationmark: , A. M. Ghez22affiliationmark: , G. D. Martinez22affiliationmark: , R. Campbell44affiliationmark: , J. R. Lu55affiliationmark:
1SYRTE, Observatoire de Paris, Université PSL, CNRS, Sorbonne Université, LNE, 61 avenue de l’Observatoire, 75014 Paris, France
2Department of Physics and Astronomy, University of California, Los Angeles, Los Angeles, CA 90095, USA
3Institut de Planétologie et d’Astrophysique de Grenoble (IPAG), 120 rue de la Piscine, 38041 Grenoble, France
4W. M. Keck Observatory, 65-1120 Mamalahoa Highway, Kamuela, HI 96743, USA
5Astronomy Department, University of California, Berkeley, CA 94720, USA
Abstract
In various fields of physics and astronomy, access to experimental facilities or to telescopes is becoming more and more competitive and limited. It becomes therefore important to optimize the type of measurements and their scheduling to reach a given scientific objective and to increase the chances of success of a scientific project. In this communication, extending the work of Ford (2008) and of Loredo et al. (2012), we present an efficient adaptive scheduling tool aimed at prioritzing measurements in order to reach a scientific goal. The algorithm, based on the Fisher matrix, can be applied to a wide class of measurements. We present this algorithm in detail and discuss some practicalities such as systematic errors or measurements losses due to contigencies (such as weather, experimental failure, …). As an illustration, we consider measurements of the short-period star S0-2 in our Galactic Center. We show that the radial velocity measurements at the two turning points of the radial velocity curve are more powerful for detecting the gravitational redshift than measurements at the maximal relativistic signal. We also explicitly present the methodology that was used to plan measurements in order to detect the relativistic redshift considering systematics and possible measurements losses. For the future, we identify the astrometric turning points to be highly sensitive to the relativistic advance of the periastron. Finally, we also identify measurements particularly sensitive to the distance to our Galactic Center: the radial velocities around periastron and the astrometric measurements just before closest approach and at the maximal right ascension astrometric turning point.
Subject headings: methods: data analysis – telescopes
1. Introduction
In all fields of physics, measurements and observations are costly and access to experimental facilities or telescopes becomes more and more competitive. It is therefore of prime importance to optimize measurements in order to make the most out of the time allocated to measurements or observations. Many examples of measurements that would benefit from an optimization can be found like e.g. measurements of stellar orbits in the Galactic Center (GC) in order to infer astrophysical and fundamental physics properties (see e.g. Boehle et al., 2016; Gillessen et al., 2017; Gravity Collaboration et al., 2018; Do et al., 2019), measurements of exoplanets in order to infer their orbital parameters (see e.g. Konopacky et al., 2016), measurements of asteroids in order to infer orbital parameters, detect relativistic effects or infer their internal properties (see e.g. Farnocchia et al., 2013; Hees et al., 2015; Greenberg et al., 2017; Verma et al., 2017), Lunar Laser Ranging measurements to infer the properties of the Moon and to perform tests of fundamental physics (see e.g. Nordtvedt, 1998; Bourgoin et al., 2016, 2017; Viswanathan et al., 2018), radioscience tracking of spacecraft through the Deep Space Network antennas to follow the spacecraft motion or to infer planetary properties (see e.g. Moyer, 2003), specific measurements using atomic sensors in order to probe fundamental physics (see e.g. Pihan-Le Bars et al., 2017; Safronova et al., 2018), etc. This list is not exhaustive and illustrates the wide range of projects that would benefit from an optimization of their measurements to reach a specific scientific objective.
Identifying what are the optimal measurements (type of measurements and scheduling) in order to reach a given scientific objective is not an easy task considering the large parameters space over which one needs to optimize. Loredo & Chernoff (2003); Loredo (2004); Loredo et al. (2010, 2012) developed a Bayesian adaptive scheduling algorithm aiming at seeking optimized observing times for radial velocity characterization of elliptical orbits in the context of exoplanets. Another adaptive scheduling algorithm dedicated to exoplanets searches has been proposed by Ford (2008). Both these algorithms are based on maximizing the global information entropy increase due to one additional measurement and iterate. Maximizing the global information entropy is also known as maximum entropy sampling (Loredo et al., 2012). In this communication, we present another adaptive scheduling algorithm that presents three differences with respect to previous works: (i) the optimization can be done on a specific parameter of interest, for a model comparison, instead of on the global information content only ; (ii) the algorithm can be applied to any types of measurements and (iii) the algorithm is extremely fast and efficient from a computational point of view. The algorithm presented in this communication is a simple but efficient algorithm designed to prioritize measurements based on their sensitivity to a scientific objective. This algorithm relies on the computation of the Fisher matrix (Fisher, 1935) and assumes Gaussian statistics on measurements errors. Practically, this adaptive scheduling tool takes as an input existing measurements, the model that is fitted to these measurements and the criterion on which one wants to optimize and it uses a greedy algorithm to identify a set of measurements that is optimized with respect to the given criterion. It is important to emphasize that it is possible to take into account possible observational window due to seeing of a target from Earth, maintenance time of the facility, etc. In this communication, we consider three criteria for which we can prioritize our measurements: (i) the global information content inside a dataset (similarly to the work from Ford (2008) and Loredo et al. (2012).), (ii) the uncertainty related to a scientific parameter that is estimated using the measurements or (iii) the ratio of the Bayesian evidence between two different models, a criteria used to compare two models. The set of optimized measurements will in general depend on the objective considered. The method presented in this paper is based on the computation of the Fisher matrix (Ly et al., 2017) whose inverse can be used as an estimate of the covariance matrix (see e.g. Vallisneri (2008); Heavens (2009) or the Appendix from Albrecht et al. (2006)). Instead of computing the inverse of the Fisher matrix a large number of times, the algorithm uses an efficient technique to update the inverse of the Fisher matrix.
The algorithm, fully described in section 2, can be applied to a wide class of measurements and of scientific projects. In section 3, we compare this algorithm to existing work in the literature, discuss some of its current limitations and also discuss some practicalities related to systematic effects, measurement losses due to contingencies and experimental windows. In section 4, we present an application of this tool to the case of stellar orbits measurements in the GC. It has long been anticipated that the measurements of the bright short-period star S0-2 (also named S2) orbiting the supermassive black hole (SMBH) in our GC can be used to detect and probe relativistic effects (Psaltis, 2008; Johannsen, 2016; Johannsen et al., 2016; Alexander, 2005; Jaroszynski, 1998; Rubilar & Eckart, 2001; Weinberg et al., 2005; Zucker et al., 2006; Will, 2008; Angélil & Saha, 2010; Angélil et al., 2010; Iorio, 2011; Parsa et al., 2017; Hees et al., 2017). The detection of such relativistic effects relies on the long time baseline measurements of this star that has been measured astrometrically since the early 1990’s and spectroscopically since the early 2000’s at Keck by the Galactic Center Orbit Initiative (GCOI) (Ghez et al., 1998, 2000, 2003, 2005b, 2005a, 2008; Boehle et al., 2016) and at the NTT and the VLT (Genzel et al., 1997; Eckart & Genzel, 1997; Schödel et al., 2002; Eckart et al., 2002; Eisenhauer et al., 2003, 2005; Gillessen et al., 2009, 2017) combined with new high accurate measurements. The question of interest is: given the existing measurements of the star S0-2, what are the future measurements that will increase the chances of detection of relativistic effects? First, we show how the algorithm developed in this communication has been used to plan the 2018 GCOI observations campaign in order to successfully measure the relativistic redshift (Do et al., 2019) (for another measurement, see also Gravity Collaboration et al. (2018)). Secondly, we identify clearly what are the measurements within S0-2’s orbit that are relevant to detect the relativistic advance of periastron in the near future. Third, we identified what are the measurements susceptible to improve the measurement of the distance to our Galactic Center , an important point to plan future measurements but also to search for systematic effects in existing measurements in order to understand the tension between the two estimates of from the Gravity collaboration (Gravity Collaboration et al., 2018) and the GCOI team (Do et al., 2019). Finally, our conclusions are presented in section 5.
2. Methodology
2.1. Formulation of the problem
The goal of most measurements performed is to extract some scientific information by fitting a model to the observations. We will assume the measurements errors to be normally distributed and we will note one measurement by where is the type of measurement, is the time of the measurement and is the measurement uncertainty. In addition, we will note the model used to analyze the measurements as where the vector denotes all the parameters that are included in the fit. These include the parameters that are scientifically relevant but can also include other parameters such as nuisance parameters or noise parameters. In some cases, one is interested in comparing two models and (which do not necessarily depend on the same number of parameters) in order to identify the one favored by the data.
In addition, we will denote the set of existing measurements by and the set of possible (future) measurements by . Note that the set of possible future measurements is discrete and takes into account experimental/observational constraints (observational windows, maintenance time for experimental facilities, etc…).
The question we are addressing in this communication can be formulated as follow: given a set of existing measurements, how to prioritize the measurements from a set of possible future measurements in order increase the sensitivity with respect to a given scientific objective?
Three types of scientific objective will be considered: (i) a case where one is interested by maximizing the global quantity of information (the information entropy) related to the full posterior distribution, similarly to what has been developed by Ford (2008) and Loredo et al. (2012), (ii) a case where one focuses on one scientific parameter and where one wants to minimize the uncertainty related to the estimation of that specific parameter and (iii) a case where one wants to maximize the ratio of the Bayesian evidence between two models in order to maximize the chances to discriminate one model over the second one.
2.2. The Fisher matrix as a tool to evaluate quickly the uncertainties and the information entropy
The implementation of the adaptive scheduling tool presented in this paper is based on the Fisher matrix. This implicitly assumes that the posterior probability distribution function can be approximated by a multivariate normal distribution. In particular, this is the case when the errors follow a Gaussian distribution and the model can be approximated by its linearization around the maximum of likelihood. Under this assumption, the Fisher matrix is a good estimate of the inverse of the covariance matrix for unbiased estimators. The Fisher matrix, a square matrix of dimension , can be computed by (see e.g. Vallisneri (2008); Heavens (2009) or the appendix from Albrecht et al. (2006))
[TABLE]
where is the matrix of the partial derivatives of the model with respect to the parameters
[TABLE]
where corresponds to the uncertainty of the th measurement. The matrix of the partial derivatives has dimension .
An estimation of the uncertainties on the estimated parameters resulting from a fit of the model using a set of measurements can be determined from the covariance matrix , which is the inverse of the Fisher matrix (see e.g. Vallisneri (2008); Heavens (2009) or the appendix from Albrecht et al. (2006))
[TABLE]
In particular, the marginalized uncertainties of the estimated parameters are given by the diagonal of this matrix (Heavens, 2009):
[TABLE]
It is important to note that this covariance matrix depends on (i) the measurements dataset (the timing and the type of measurements and their related uncertainties ) and (ii) on the set of fitted parameters (i.e. ).
In addition, Ford (2008) has shown that the expected difference in information entropy that is induced by an additional measurement is given by (see Eq. (25) from Ford (2008))
[TABLE]
where is the model uncertainty for the measurement obtained while not including this measurement in the fit and is the expected measurement uncertainty. The notation denotes the difference in the quantity induced by adding the measurement to the existing dataset. This relationship means that the optimal measurements in order to increase the global information entropy are the ones with large model uncertainties (or, quoting Loredo et al. (2012): “we learn the most by sampling where we know the least”), which is called maximum entropy sampling (Sebastiani & Wynn, 2000).
Finally, under the assumption that the posterior is a multivariate normal distribution, the model uncertainty can directly be related to the inverse of the Fisher matrix through
[TABLE]
where is the covariance matrix obtained when the measurement is not included in the fit and is a column vector containing the partial derivatives of the measurement with respect to the fitted parameters :
[TABLE]
One statistical tool to compare two models using a set of measurements is to use the Bayesian evidence, defined as the probability to obtain the data given a certain model (or in other words, the evidence is the normalization of the posterior) (see e.g. Jaynes, 2003; Gregory, 2010; Gelman et al., 2013; Kass & Raftery, 1995; Liddle, 2007; Knuth et al., 2014). Under the assumption that the prior knowledge about the two models are uniform, the Bayesian evidence is also directly proportional to the probability of a certain model given the dataset. Comparing the Bayesian evidences from fits using two different models is one way to compare these models and to discriminate them. Using the Laplace quadratic approximation, the Bayesian evidence for a model 1 is given by (Liddle, 2007)
[TABLE]
where is the maximum of the posterior, is the number of parameters fitted in the model 1 and is the determinant of the matrix . The difference of the log-evidence between two models is therefore given by
[TABLE]
where and are the likelihood and prior of the model for the parameters that maximize the posterior. The first two terms (the ratio of the priors and the number of fitted parameters) do not depend directly on the scheduling of the measurements. On the other hand, the third part of this equation depends on the actual measurements themselves. This dependency makes it hard to find the sequence of measurements optimal to discriminate between two models (since the actual measurements are not known). One solution is to consider a Gaussian likelihood and to simulate data using one of the two models. Let us assume that model 1 is the model we are trying to confirm from the data and let us use this model to simulate the measurements so that the last equation becomes
[TABLE]
In conclusion, in this section we have shown how the Fisher matrix can be used to characterize the different objectives that one may want to optimize on:
- •
the global information entropy: it requires to optimize over Eq. (6) which depends on the inverse of the Fisher matrix.
- •
the uncertainty of one specific parameter of interest (): it requires to optimize over one component of the diagonal of the inverse of the Fisher matrix.
- •
a criterion to optimize the chances to discriminate a model over a model : one suggestion is to optimize over an estimate of the difference of the log of the Bayesian evidence, given by Eq. (10). This quantity depends on the evaluation of the models themselves and on the determinant of their Fisher matrices.
In the next two subsections, we will discuss how to estimate efficiently the Fisher matrix, its determinant and its inverse.
2.3. Computation of the partial derivatives of the model
As can be noticed from Eq. (2), one needs to compute the partial derivatives of the model with respect to the parameters in order to compute the Fisher matrix. Three methods can be used to compute these derivatives:
analytically compute the derivatives of the model. This method has the advantage to be of quick evaluation; but it requires long calculations and code. When the model relies on the integration of a set of ordinary differential equations of motion, this method also requires integration of the equations of variations (as an example, see section 4 from Lainey et al. (2004)). 2. 2.
use automatic differentiation. The main advantage of this method is that one does not need to derive explicitly the expression of the partial derivatives. 3. 3.
use finite differences. This method has the advantage of easy implementation (especially when the model requires the integration of differential equations) but has the disadvantage that it can be numerically unstable.
Each of these methods has pros and cons and they need to be chosen on a case by case basis considering each specific situation.
2.4. Update of the Fisher matrix, of its determinant and of its inverse
One of the main step in the algorithm developed in this paper is to compute the Fisher matrix, its determinant and its inverse. Indeed, the goal of the adaptive scheduling tool is to optimize either on one diagonal component of or on its determinant . Therefore, we will need to evaluate these quantities for a huge number of different combinations of measurements and it is important to compute them using a method that is computationally efficient. Recomputing the full Fisher matrix and inverting it for all the set of measurements is computationally intensive. An alternative approach is to update the the Fisher matrix as well as its determinant and its inverse to avoid to invert the full matrix again.
In other words, if we consider the Fisher matrix obtained for a set of measurements , we can efficiently compute the Fisher matrix which is obtained from the same set of measurements with an additional measurement . The procedure is straightforward and is given by
[TABLE]
Following the same principle, the determinant of the Fisher matrix can easily be update
[TABLE]
where .
Finally, the procedure to update the covariance matrix (the inverse of the Fisher matrix) is given by
[TABLE]
where is the update matrix which depends on the measurement and is given by
[TABLE]
where the vector is given by Eq. (7).
To summarize, adding one measurement to a given dataset will lead to a change of the different criterion that we are optimizing on given by the following expressions:
[TABLE]
- •
optimization over the global information entropy (maximization):
[TABLE]
where the column vector is given by Eq. (7).
- •
optimization over one specific parameter of interest (minimization of the related uncertainty):
[TABLE]
using Eq. (14).
- •
optimization for model comparison (maximization of the ratio of the Bayesian evidence):
[TABLE]
where the subscript 1 and 2 refers to the two models to be compared.
2.5. Description of the algorithm
To start the algorithm, we need a set of existing measurements and a set of possible future measurements . We also need to know what is the scientific objective for which we are optimizing for: (i) maximize the global information entropy or (ii) minimize the uncertainty related to a given parameter or (iii) maximize the odd ratio between two models.
Our implementation of the adaptive scheduling tool is iterative. There are two different ways to control the number of iterations. The first way is to chose arbitrarily the desired number of measurements () and to find a scheduling for these measurements that is optimized with respect to a given criterion. The second way is to stop the iterations when we reach a given objective (for example when the uncertainty on an estimated parameter is below a given threshold). In this second case, the number of of measurements is free as well.
The algorithm consists in the following steps:
fit the model to the existing measurements in order to determine the optimal parameters (in the case of models comparison, the two models need to be fitted). For this step, the actual data needs to be known and not only their uncertainties. 2. 2.
compute the partial derivatives of the model (of the two models in case of models comparison)
[TABLE]
for all the existing and possible future measurements. This evaluation can be computationally intensive (depending on the model) but is required only once. 3. 3.
compute the starting covariance matrix from the set of existing measurements by inverting the Fisher matrix, see Eq. (3). 4. 4.
iteratively determine the set of optimal measurements by using the following procedure (starting with and with an empty list of optimized measurements )
- (a)
loop on all the possible future measurements and for each of them compute the quantity from Eqs. (15) depending on the scientific objective considered. 2. (b)
find the measurement that maximizes , add it to the list of optimal measurement and remove it from the list of future possible measurements . 3. (c)
update the covariance matrix by adding the measurement using Eq. (13), increase and iterate by going back to step (a) as long as the number of measurements in the optimal set of future observations is smaller than , or as long as we have not reached the given threshold.
This algorithm is very efficient and allows one to prioritize quickly measurements based on a criteria related to a scientific objective.
3. Discussion
The algorithm presented in the previous section can be applied to a wide class of measurements. Our algorithm presents several differences compared to existing ones (see as en example, Ford (2008) and Loredo et al. (2012)). First of all, our algorithm is based on the Fisher matrix and not on a full Bayesian approach. This has the drawback that our algorithm is not guaranteed to work when posteriors are extremely non Gaussian or when they present multiple modes. The main advantage of using the Fisher matrix relies in the efficiency and in the computational cost of the method. Indeed, we have shown in the previous section that there exist a method to efficiently update the Fisher matrix, its determinant and its inverse. This allows one to consider a huge number of possible future measurements without any problem (optimization over sets of possible measurements can easily be considered).
Another difference between this work and the one presented in Ford (2008) and in Loredo et al. (2012) relies in the criterion that we are optimizing for. In Ford (2008) and in Loredo et al. (2012), the scheduling is performed to optimize the expected global information entropy (maximum entropy sampling). Here, we also consider two other options. The first alternative is to prioritize measurements to minimize the uncertainty of a scientific parameter. The second alternative is to prioritize measurements to maximize the chances to discriminate a model over another one. The description of the algorithm presented in the previous section includes the three options.
The algorithm presented in section 2 is a greedy optimizer (similarly to the one used in section 3 from Loredo et al. (2012)): it chooses the measurement that leads to the best one-time increase of the metric optimized for and proceeds iteratively. While the resulting measurements sequence is optimized for a particular criterion, this algorithm is not guaranteed to converge towards the global optimal solution. Still, it has been shown in Loredo (2004); Loredo et al. (2012), using a similar optimization algorithm, the superiority of adaptive scheduling over random sampling. This superiority will also be quantified in the three examples developed in section 4 below. The global optimum of the non-linear and discrete optimization problem related to the problem presented in section 2.1 can be determined by a brute force method whose complexity grows dramatically as 111This is valid when . where is the number of possible future measurements and is the length of the optimal sequence. In practice, can be huge (as high as ), which justifies the use of a algorithm. In the future, it would be interesting to study more complex algorithms that could provide a better optimization compared to the greedy algorithm from section 2 in a reasonable computation time complexity.
One input for the algorithm from section 2 is the uncertainties related to possible future measurements. It is important to note that these uncertainties can vary from measurement to measurement to take into account practicalities such as, for examples, visibility from a telescope, different integration times at different observing epochs, confusion events, geometrical consideration (in the case of range measurements for example), etc. Such variations for the uncertainties are taken into account and automatically considered in the presented algorithm. On the other hand, as presented in section 2, the algorithm cannot take into account the more general case where the uncertainty for a given future measurement is given as a distribution instead of a given value. One can overcome this drawback by implementing a Monte Carlo algorithm on top of the adaptive scheduling tool. For each instance of the Monte Carlo, the uncertainties considered in the adaptive scheduling would be drawn randomly. More pragmatically and more efficiently, one can consider several cases of measurements uncertainties (e.g. a “realistic”, an “optimistic” and a “pessimistic”) and run the adaptive scheduling tool on those limited number of cases to assess the impact of the future uncertainties on the scheduling.
We would like to emphasize that practicality such as observational window, maintenance time, etc. can easily be taken into account in our approach. Indeed, our algorithm is based on a brute force exploration of all the possible future measurements. This set of possible measurements can therefore include gaps or other constraints.
It is important to note that the algorithm presented in this communication is purely based on statistical consideration. In practice, systematic effects usually play an important role when analyzing measurements and experimental losses (losses due to weather, to unexpected downtime of a telescope or of an experimental facilities, …) can deter the expected scientific goal. While the results that can be obtained from a scheduling tool are extremely interesting to identify optimized measurements, it is important to take into account systematics and losses when planning real measurements. When two types of measurements are considered (like the case of stellar orbits in the GC where the two types of measurements are the astrometric and the spectroscopic measurements), we suggest to plan both measurements roughly at the same epochs even though only one type of measurement is required from the adaptive scheduling tool. Using both types of measurements at the same time will help to control for systematics.
In addition, we suggest to consider measurements losses (atmospheric losses, unexpected maintenance, etc.) by using a very simple statistical model for the losses and by running a Monte Carlo and to determine the distribution of the expected scientific results given this statistical model. This can help to estimate the number of measurements required to reach a given objective (see section 4.2 below for a practical example).
Finally, in Eq. (2), we implicitly assumed that the measurement error is a white noise. This can easily be generalized to the case where the measurement errors are correlated. In that case, the measurement noise is characterized by a covariance matrix , which is assumed to be diagonal in Eq. (2). A covariance matrix is a definite positive matrix so that it is always possible to find a decomposition of the form
[TABLE]
using e.g. a Cholesky decomposition. The algorithm presented in the previous section can directly be applied by replacing Eq. (2) by
[TABLE]
4. Application to the detection of relativistic effects on stellar orbits in the Galactic Center
To illustrate the utility of the algorithm presented in the previous section, we use the case of the measurements of stellar orbits in the GC as an example. Of particular interest is the star S0-2 (also named S2) which has been followed astrometrically and spectroscopically for nearly 25 years (Eckart & Genzel, 1997; Ghez et al., 1998, 2000; Schödel et al., 2002; Eckart et al., 2002; Eisenhauer et al., 2003; Ghez et al., 2003, 2005b, 2005a; Eisenhauer et al., 2005; Ghez et al., 2008; Gillessen et al., 2009; Boehle et al., 2016; Gillessen et al., 2017). One of the long standing scientific objective pursued by measuring the motion of this star is to measure relativistic effects around the supermassive black hole and to test the gravitational theory of general relativity (Psaltis, 2008; Johannsen, 2016; Johannsen et al., 2016; Alexander, 2005; Jaroszynski, 1998; Rubilar & Eckart, 2001; Weinberg et al., 2005; Zucker et al., 2006; Will, 2008; Angélil & Saha, 2010; Angélil et al., 2010; Iorio, 2011; Parsa et al., 2017). Detection of relativistic effects using S0-2’s measurements relies on the long standing dataset available and on new high accurate measurements. The first relativistic effects measurable using S0-2’s measurements are the relativistic redshift and the relativistic advance of the periastron. Recently, the impact of the relativistic redshift on S0-2’s radial velocity has been measured successfully (Gravity Collaboration et al., 2018; Do et al., 2019). In this section, we will show how the adaptive scheduling tool presented in the previous section has helped the GCOI team in planning the 2018 observations campaign that led to a successful detection of the relativistic redshift (Do et al., 2019). In addition, we will show how the same tool can be used to identify suitable measurements in order to enhance the detection of the relativistic advance of the periastron in the near future. Finally, we will also prioritize measurements to improve the knowledge of , our distance to the GC.
4.1. Data and model
The set of existing measurements used in this analysis consists in astrometric measurements performed at the Keck observatory between 1995 and 2017 and on spectroscopic measurements performed at the Keck observatory and at the Very Large Telescope (VLT) between 2000 and 2017. These measurements are reported in the literature (Gillessen et al., 2017; Do et al., 2019) and here, we consider them as independent. The model used for this analysis relies on the integration of the first post-Newtonian equations of motion and includes the Römer time delay and the first order relativistic redshift. This model is fully described in the Appendix of Do et al. (2019). There are 15 parameters that characterize this model: the six orbital parameters for the star S0-2, the SMBH mass parameter , the distance to the GC , the 2-D position and 3-D velocity of the SMBH, a parameter that encodes a deviation from the relativistic redshift and a parameter that encodes a deviation from the first post-Newtonian order in the equations of motion (or in other words that encodes a deviation from the relativistic advance of the periastron).
4.2. An example of scheduling including systematics and weather losses to detect the relativistic redshift on S0-2
In this section, we give a full example of telescope time scheduling using the tool presented in section 2 including considerations related to systematics and to weather and instrumental losses. We focus on the measurement of the relativistic redshift of S0-2 and therefore fix the parameter that encodes a deviation from the first post-Newtonian equations of motion to its relativistic value (i.e. to unity). Therefore, the model contains 14 parameters. The objective is to find a measurement sequence in 2018 that would optimize the chances of a successful detection of the relativistic redshift given the set of existing measurements (up to 2017). The set of future possible measurements considered is made of two distinct sets: the first one consists in astrometric measurements for every day between March 1st 2018 and September 15th 2018 and the second one consists in radial velocity measurements for every day in the same time period. The uncertainty for the future measurements has been chosen as 20 km/s for the radial velocity and 0.9 mas for the astrometry.
The sequence of 10 measurements optimized in order to detect the relativistic redshift (we are optimizing on the redshift parameter) using the algorithm presented in section 2 is presented in table 1 and on figure 1. It is important to keep in mind that this result depends on the existing measurements: different existing measurements could have led to another sequence. This sequence also depends on the expected accuracy of the future possible measurements and on the parameters that are included in the fit.
Two interesting facts need to be noticed about the optimized measurements sequence obtained in this analysis. First, 9 out of the 10 measurements are spectroscopic observations. This makes sense considering that the redshift impacts directly the radial velocity but not the astrometry. A more surprising fact is related to the fact that the adaptive scheduling tool does not favor measurements at the maximum of the redshift signal. Rather, it favors measurements that are at the radial velocity turning points (i.e. the maximum and minimum of the radial velocity curve, see figure 1). This is due to correlations between the redshift parameter and the other model parameters, in particular with (the time of closest approach for S0-2) and with the SMBH mass parameter . These correlations are maximal exactly when the redshift signal is maximal, making that epoch not optimal in order to measure the redshift signal. In other words, the redshift signal can easily be absorbed by a small change in or in . To demonstrate the effect of correlations, we run a test case and search for the optimized measurements sequence if we fit for all the model parameters except for and . The orange measurements from figure 1 are the radial velocity measurements optimized for that case using the algorithm from section 2. When and are not included in the fit, these measurements are all located close to the maximal of the redshift signal, as expected.
It is interesting to quantify the impact of an optimized scheduling compared to a “naive” one. For this, let us consider the gain in the relativistic redshift signal to noise ratio (SNR) due to one RV measurement in 2018. The redshift SNR using measurements existing up to 2017 is of 1.16. A naive scheduling consisting in performing one additional measurement at the time where the redshift signal is maximal induces a SNR increase of 4.5 % (the SNR being increased up to 1.205), while an optimized scheduling consisting in performing the measurement at the time where the RV is maximal increases the SNR by 335 % (the SNR value is now 4.5). This illustrates clearly the benefit of an optimized scheduling and how such a scheduling tool can help to increase the outcome of a project.
At this stage, the important point learned from our adaptive scheduling tool is that, from a statistical point of view, nearly all the power to measure the relativistic redshift comes from spectroscopic measurements around the two RV turning points. This result relies purely on statistical considerations and practicalities and systematics need to be considered as well. First of all, other scientific longer-term objectives will be pursued with S0-2’s measurements. It is important to realize that measurements around closest approach, while not of prime importance for the redshift, are important to measure other scientific parameters like e.g. (see section 4.4) and the relativistic advance of the periastron (see section 4.3). Therefore, we included measurements at closest approach, an important event as well. In addition, two strategies have been used to control systematics for the relativistic redshift measurement: (i) we decided to use different telescopes (Keck and Gemini) to measure the radial velocities in order to assess hypothetical biases related to one instrument and (ii) we decided to take also astrometric measurements at the same epochs as radial velocities and vice versa (see the discussion in section 3). Finally, the results from Table 1 have been obtained assuming a perfect knowledge of S0-2’s orbital parameter. In practice, there is an uncertainty associated to these parameters. In particular, at the end of 2017, the uncertainty related to the 2018 time of closest approach was of the order of 5 days. In order to control for a possible bias in the estimate, we included measurements very early in 2018.
In addition, it is important to consider instrumental losses and weather losses. Using different telescopes (Keck and Gemini) for the capture of the important measurements is one way to mitigate the risk of missing important measurements because of instrumental losses (in addition to allow us to check for systematics). In addition, we increased the number of requested measurements in 2018 in order to take into account for possible measurements losses. A careful analysis of the Keck Observatory statistics shows that roughly 50 % of GC measurements nights are lost because of weather or instrumental problems. This has been included in our scheduling by making a large number () of simulations where each measurement has a probability of 50% to be lost and by assessing the distribution of the relativistic redshift signal to noise ratio at the end of 2018.
To summarize, the scheduling of the 2018 GC measurements campaign for the GCOI team was based on: (i) radial velocities measurements very early in 2018 to control our estimate of ; (ii) radial velocities at the two radial velocities turning points, the most important measurements to detect the redshift, using at least two different instruments to control for systematics ; (iii) measurements at closest approach important for other scientific objectives ; (iv) a dual spectroscopic-astrometric measurements at the important epochs to control for systematics ; (v) the total number of measurements has been determined by requiring that the relativistic redshift SNR at the end of 2018 has a probability of at least 99.9 % to be above 5 considering that each night has a 50% chance to be lost and (vi) the exact scheduling has been determined by considering a 3 days weather pattern. The scheduling resulting from this analysis is presented on the top of figure 2 and consists in 36 spectroscopic measurements (for Keck and Gemini) and 10 astrometric measurements. For the sake of comparison, the actual measurements performed in 2018 are presented in the lower panel from figure 2. They consist in 19 radial velocity measurements and 5 astrometric measurements. The main sources of losses for 2018 are related to bad weather and to eruptions from the volcano in Hawaii, one of them being accompanied by an earthquake that damaged the instruments for a couple of days. The distribution of the relativistic redshift SNR at the end of 2018 using the scheduling described in this section and assuming that each measurement has a 50% chance of being lost is presented on the right panel from figure 3. The probability to have a SNR below 5 is 0.03 %. The SNR presented on this figure is a statistical SNR. No systematics is considered in this analysis and systematics will deter the SNR calculated here. The analysis of the real data has shown that including systematics such as offsets between instruments, correlations in the astrometric positional uncertainty or additional systematics uncertainties will produce a systematic uncertainty of the same order of magnitude as the statistical uncertainty (see Do et al. (2019) for the analysis of the measurements and a detailed study of systematics). The right panel of figure 3 shows the evolution of the SNR with time in 2018. The blue curve corresponds to the evolution expected from the original scheduling while the orange curve corresponds to the evolution corresponding to the real measurements. Although nearly half of the measurements in 2018 have been lost, the final SNR obtained in the analysis remains above 5. This is due to the careful planning and to the number of measurements scheduled but also to the fact that some of the measurements were actually better than anticipated.
This first example shows the utility of the adaptive scheduling tool presented in section 2 and how it can lead to surprising results. As a result of this analysis, the GCOI team decided to put more effort in measuring the radial velocity of S0-2 around the two turning points in order to maximize the chances of a successful detection of the relativistic redshfit (Do et al., 2019). We also developed into details how the GCOI has decided to include systematics and possible weather/instrumental losses in the scheduling of the 2018 measurement campaign.
4.3. Measurements on S0-2’s orbit optimized to detect the relativistic advance of periastron
The relativistic redshift is only the first relativistic effect measurable using stellar orbits in the GC. The next effect expected to be measured is the well-known advance of periastron which is due to the first post-Newtonian relativistic correction to the equations of motion. In this section, we are interested in prioritizing measurements in order to measure this effect. For this reason, we fix the redshift parameter to its relativistic value (i.e. to unity) but we let the parameter that encodes a deviation from the first post-Newtonian equation of motion to be free. As a result, the model contains again 14 parameters. The set of “existing measurements” consists in astrometric and radial velocity measurements of S0-2 up to 2018. The set of future possible measurements consist in astrometric and spectroscopic measurements for each year between March and September (the yearly window for which GC measurements are feasible from Earth) on a daily basis. The uncertainty considered for the future measurements is of 0.2 mas for the astrometry and of 20 km/s for the radial velocity.
As a first step, we run the adaptive scheduling tool considering possible measurements between 2018 and 2058. The distribution the measurements from an optimized sequence of 30 observations is presented in figure 4. While of limited interest, this figure shows three interesting features. First, it shows that, as anticipated, the detection of the advance of the periastron relies mainly on the astrometry. Secondly, it shows that the optimal strategy to measure the advance of the periastron given a certain measurement window is to take measurements as late as possible. This is easily explained considering that this effect is a secular effect that increases with time. Finally, we can notice that measurements around periastron (2050 correspond to a time of closest approach) are of prime importance. This last point will be developed further below. Changing the possible measurements window does not change qualitatively these three results.
Figure 4 is somehow of limited interest because “late” measurements are always favored by the adaptive scheduling tool. A more interesting result consists in identifying the regions within one orbit that are particularly sensitive to the advance of the periastron. One way to derive such a result is to run the adaptive scheduling tool multiple times by increasing the measurements window one year at a time and to save all the optimized measurements sequence after removing the measurements that correspond to the last year of the observational window (to discard the “late” measurements effect due to the secular effect). The result from such an analysis is presented in figure 5 and is particularly interesting. The most sensitive measurements are astrometric measurements around closest approach. It is interesting to note that the closest approach itself is not favored by the adaptive scheduling tool (see the bottom left panel from figure 5) because of correlations with other fitted parameters. Rather, it is important to have measurements bracketing the closest approach. The second important set of measurements consists of measurements around apoastron. This can easily be interpreted since this is the location where a precession of the orbit will have the largest impact. The next set of important measurements consists in the right ascension turning point (astrometric measurements around the maximum of the right ascension). In addition, while radial velocity measurement are less crucial for the detection of the relativistic advance of periastron, the important spectroscopic measurements are all located around the closest approach and at the two turning points in the radial velocity curve.
In order to quantify the impact of an optimized scheduling obtained using the algorithm from section 2, we compare the gain in the relativistic (advance of periastron) SNR after one orbital period for two different scheduling strategies. More precisely, we use the current existing measurements and compare two schedulings of 32 astrometric measurements for the next 16 years. The first scheduling consists in a “naive” sampling for which 2 measurements are taken every year and the second scheduling results from our optimization such that the measurements are timely distributed following the results presented in the top left panel from figure 5. The SNR resulting from a regular sampling is of 4.1 while the SNR resulting from the optimized scheduling is of 5.9, a relative increase of 45%. This shows that, although the scheduling resulting from the algorithm presented in section 2 is not guaranteed to be the global optimum (see the discussion in section 3), it is nevertheless superior than standard non-optimized scheduling and it can be helpful for scientists to prioritize their measurements for a given scientific objective.
As a conclusion, the relativistic advance of the periastron is a secular effect such that long time baseline increases the chance for its detection. On top of that, measurements at the astrometric turning points (closest approach, apoastron and right ascension turning point) are important. In the mid-term, this means that 2019 will be an important year since it corresponds to the right ascension turning point and the years between 2022 and 2026 will be important to measure precisely the motion of S0-2 around the apoastron.
4.4. Measurements on S0-2’s orbit optimized to improve our knowledge about the distance to the Galactic Center
In this section, we will focus on the distance to our GC (), and prioritize measurements to improve our knowledge of using the algorithm from section 2. For this reason, we fix both GR parameters to their relativistic value (i.e. to unity) so that the model contains 13 free parameters. The set of “existing measurements” is exactly the same as the one used in the previous section, i.e. astrometric and radial velocity measurements of S0-2 up to 2018. The set of future possible measurements consist in astrometric and spectroscopic measurements for each year between March and September (the yearly window for which GC measurements are feasible from Earth) on a daily basis. The uncertainty considered for the future measurements is of 0.2 mas for the astrometry and of 20 km/s for the radial velocity.
First, a similar behavior as for the advance of the periastron is observed: late measurements are always favored by the adaptive scheduling tool and long time baseline will increase our knowledge on . Therefore, we use the same strategy as the one described in the previous section: we run the adaptive scheduling tool by increasing the time baseline one year at a time, record the optimized measurements sequences but discard the observations related to the last year of the observational window. The result from such an analysis is presented in figure 6. The estimation of relies on both astrometry and radial velocity. The important RV measurements are located at closest approach and are bracketing closest approach. The important astrometric measurements are located at the right ascension turning point (maximum of the right ascension curve) and before closest approach. Of particular interest, this analysis shows that the 2019 astrometric measurements will be of prime importance to improve our knowledge about .
In order to assess the impact of the optimized scheduling, we compare the SNR between two schedulings extending over one orbital period: one regular sampling and one optimized scheduling using the algorithm from section 2. More precisely, we use the current existing measurements and compare two schedulings of 32 astrometric measurements and 16 RV for the next 16 years. The first scheduling consists in a “naive” scheduling for which 2 astrometric measurements and one radial velocity measurement are performed every year and the second scheduling results from our optimization such that the measurements are timely distributed following the results presented in the top left panel from figure 6. The SNR resulting from a regular sampling is of 287 while the SNR resulting from the optimized scheduling is of 380, a relative increase of 30%. Similarly to what has been discussed in sections 4.2 and 4.3, this shows the superiority of the scheduling obtained from the algorithm presented in section 2 over standard non-optimized and uniform scheduling and it can help scientists to prioritize measurements for a given scientific objective.
5. Conclusion
In this paper, we have presented a simple but efficient algorithm aiming at prioritizing measurements in order to reach a scientific objective. Identifying an optimized scheduling and the best type of measurements will become more and more important considering the increasing costs of measurements and considering the increasing pressure to obtain telescope time or time in different experimental facilities. The algorithm developed and presented in section 2 relies on the assumption that the measurements errors and the posteriors are Gaussian but can be applied to any type of measurements or observations. This adaptive scheduling tool is based on the computation of the Fisher matrix, on a very efficient technique to update it and on a greedy algorithm. As discussed in section 3, a greedy algorithm is not guaranteed to converge to the global optimal solution but it has the advantage to have a reasonable time complexity, which makes it relatively fast and which allows the user to consider very large set of possible future measurements. In addition, the three examples discussed in section 4 clearly shows the superiority of the optimized scheduling over non-optimized standard samplings (see also the discussion in Loredo et al. (2012)).
We have used the case of measurements of the short-period star S0-2 in our GC as an example to illustrate the type of analysis and results that can be obtained with this adaptive scheduling tool. In particular, we have shown that the measurements that increase the the relativistic redshift SNR are radial velocity measurements at the turning points of the radial velocity curve and not at the maximum of the signal, a surprising result that is explained by correlations. We also suggest a method to plan measurements to control systematics and to account for possible weather/instrumental losses. This detailed analysis led to the scheduling for the GCOI 2018 measurement campaign detailed in section 4.2, which resulted in a successful detection of the relativistic redshift in 2018 (see Do et al. (2019)).
In addition, we have identified sets of measurements that are particularly important to detect the relativistic advance of periastron on S0-2’s orbit. They consist of astrometric measurements at the astrometric turning points, in particular around periastron and apoastron. Finally, we have shown that both astrometry and radial velocity are important to improve our knowledge about , the distance to our GC. The important astrometric measurements are the ones corresponding to the right ascension turning point and before closest approach while the radial velocities adapted to measure are at closest approach and bracketing this epoch.
Similar analysis can be performed in various fields of physics and astrophysics ranging from Solar System measurements, exoplanets, gravitational waves and even laboratory measurements in order to help scientist to prioritize the type of measurements and their scheduling to increase their desired SNR in order to enhance the scientific return of different instruments and experiments.
6. Acknowledgements
Support for this work was provided by the the W.M. Keck Foundation (20170668), by the Heising Simons Foundation (2017-282) andby the National Science Foundation (AST-1412615). The authors warmly thank the anonymous referee whose comments helped to improve the manuscript.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Albrecht et al. (2006) Albrecht, A., Bernstein, G., Cahn, R., Freedman, W. L., Hewitt, J., Hu, W., Huth, J., Kamionkowski, M., Kolb, E. W., Knox, L., Mather, J. C., Staggs, S., & Suntzeff, N. B. 2006, ar Xiv Astrophysics e-prints
- 2Alexander (2005) Alexander, T. 2005, Phys. Rep., 419, 65
- 3Angélil & Saha (2010) Angélil, R. & Saha, P. 2010, Ap J, 711, 157
- 4Angélil et al. (2010) Angélil, R., Saha, P., & Merritt, D. 2010, Ap J, 720, 1303
- 5Boehle et al. (2016) Boehle, A., Ghez, A. M., Schödel, R., Meyer, L., Yelda, S., Albers, S., Martinez, G. D., Becklin, E. E., Do, T., Lu, J. R., Matthews, K., Morris, M. R., Sitarski, B., & Witzel, G. 2016, Ap J, 830, 17
- 6Bourgoin et al. (2016) Bourgoin, A., Hees, A., Bouquillon, S., Le Poncin-Lafitte, C., Francou, G., & Angonin, M.-C. 2016, Physical Review Letters, 117, 241301
- 7Bourgoin et al. (2017) Bourgoin, A., Le Poncin-Lafitte, C., Hees, A., Bouquillon, S., Francou, G., & Angonin, M.-C. 2017, Physical Review Letters, 119, 201102
- 8Do et al. (2019) Do, T., Hees., A., Ghez, A. M., Martinez, G., Chu, D., & et. al. 2019, submitted to Science
