Background modelling for $\gamma$-ray spectroscopy with INTEGRAL/SPI
Thomas Siegert, Roland Diehl, Christoph Weinberger, Moritz M. M., Pleintinger, Jochen Greiner, Xiaoling Zhang

TL;DR
This paper presents a new spectral background modelling method for INTEGRAL/SPI gamma-ray data, enabling robust analysis across various science cases by accounting for instrumental background variability.
Contribution
The authors develop a self-consistent background model based on long-term detector data, improving the accuracy of gamma-ray source analysis with INTEGRAL/SPI.
Findings
The background model adapts to different emission types and spectral ranges.
The number of parameters needed depends on the source and energy range.
The method yields nearly systematics-free results.
Abstract
The coded-mask spectrometer-telescope SPI on board the INTEGRAL observatory records photons in the energy range between 20 and 8000 keV. A robust and versatile method to model the dominating instrumental background (BG) radiation is difficult to establish for such a telescope in the rapidly changing space environment. From long-term monitoring of SPI's Germanium detectors, we built up a spectral parameter data base, which characterises the instrument response as well as the BG behaviour. We aim to build a self-consistent and broadly applicable BG model for typical science cases of INTEGRAL/SPI, based on this data base. The general analysis method for SPI relies on distinguishing between illumination patterns on the 19-element Germanium detector array from BG and sky in a maximum likelihood framework. We illustrate how the complete set of measurements, even including the exposures of the…
| Energy | Bin size | Type | BGRDB | Comments | |||
| 508–514 | 6.0 | 1 | 12587 | 24.3 | diffuse/line | o | inner Galaxy / also weak point sources |
| 1795–1820 | 0.5 | 50 | 92867 | 201.0 | diffuse/line | a | full sky |
| 2500–3500 | 1000.0 | 1 | 11029 | 21.6 | diffuse/continuum | a | inner Galaxy |
| 50–100 | 2.0 | 25 | 3771 | 7.2 | point-like/continuum | o | |
| 508–514 | 6.0 | 1 | 3771 | 7.2 | point-like/continuum | o | annihilation emission possible |
| 1790–1840 | 0.5 | 100 | 3771 | 7.2 | point-like/continuum | o | no expected |
| 2500–3500 | 1000.0 | 1 | 3185 | 6.0 | point-like/continuum | a |
| Scaling | Orbits | Pointings | Days | Hours |
|---|---|---|---|---|
Const |
||||
| DetFail | at 140 | at 5521 | at 1435.42 | - |
| at 214 | at 10251 | at 1659.46 | - | |
| at 775 | at 42443 | at 3337.50 | - | |
| at 930 | at 52239 | at 3799.67 | - | |
Anneal |
60–70 | 1600–4500 | 170–210 | 4000–5000 |
30n |
30 | 1450–2150 | 90 | 2160 |
20n |
20 | 950–1450 | 60 | 1440 |
10n |
10 | 400–800 | 30 | 720 |
5n |
5 | 190–410 | 15 | 360 |
3n |
3 | 110–250 | 9 | 216 |
2n |
2 | 70–170 | 6 | 144 |
1n |
1 | 50–90 | 3 | 52–72 |
24p |
1/3 | 24 | 3/4–1 | 16–24 |
12p |
1/6 | 12 | 3/8–1/2 | 8–12 |
6p |
1/12 | 6 | 3/16–1/4 | 4–6 |
3p |
1/24 | 3 | 3/32–1/8 | 2–3 |
2p |
1/36 | 2 | 3/48–1/12 | 1–2 |
1p |
1/72 | 1 | 3/96–1/16 | 0.5–1 |
| Case | |||||
|---|---|---|---|---|---|
| Energy range | [keV] | ||
|---|---|---|---|
| 50–100 keV | |||
| 1790–1840 keV | |||
| Full range |
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
11institutetext: Max-Planck-Institut für extraterrestrische Physik, Gießenbachstraße, D-85741 Garching, Germany
22institutetext: Excellence Cluster Universe, Boltzmannstraße 2, D-85748, Garching, Germany
Background modelling for -ray spectroscopy with INTEGRAL/SPI
Thomas Siegert E-mail: [email protected] modelling for -ray spectroscopy with INTEGRAL/SPIBackground modelling for -ray spectroscopy with INTEGRAL/SPIBackground modelling for -ray spectroscopy with INTEGRAL/SPIBackground modelling for -ray spectroscopy with INTEGRAL/SPI
Roland Diehl Background modelling for -ray spectroscopy with INTEGRAL/SPIBackground modelling for -ray spectroscopy with INTEGRAL/SPIBackground modelling for -ray spectroscopy with INTEGRAL/SPIBackground modelling for -ray spectroscopy with INTEGRAL/SPI
Christoph Weinberger Background modelling for -ray spectroscopy with INTEGRAL/SPIBackground modelling for -ray spectroscopy with INTEGRAL/SPI
Moritz M. M. Pleintinger Background modelling for -ray spectroscopy with INTEGRAL/SPIBackground modelling for -ray spectroscopy with INTEGRAL/SPI
Jochen Greiner Background modelling for -ray spectroscopy with INTEGRAL/SPIBackground modelling for -ray spectroscopy with INTEGRAL/SPIBackground modelling for -ray spectroscopy with INTEGRAL/SPIBackground modelling for -ray spectroscopy with INTEGRAL/SPI
Xiaoling Zhang Background modelling for -ray spectroscopy with INTEGRAL/SPIBackground modelling for -ray spectroscopy with INTEGRAL/SPI
(Received December 19, 2018; accepted February 25, 2019)
Abstract
*Context. *The coded-mask spectrometer-telescope SPI on board the INTErnational Gamma-Ray Astrophysics Laboratory (INTEGRAL) records photons in the energy range between 20 and 8000 keV. A robust and versatile method to model the dominating instrumental background (BG) radiation is difficult to establish for such a telescope in the rapidly changing space environment.
*Aims. *From long-term monitoring of SPI’s Germanium detectors, we built up a spectral parameter data base (Diehl et al., 2018), which characterises the instrument response as well as the BG behaviour. We aim to build a self-consistent and broadly applicable BG model for typical science cases of INTEGRAL/SPI, based on this data base.
*Methods. *The general analysis method for SPI relies on distinguishing between illumination patterns on the 19-element Germanium detector array from BG and sky in a maximum likelihood framework. We illustrate how the complete set of measurements, even including the exposures of the sources of interest, can be used to define a BG model. The observation strategy of INTEGRAL makes it possible to determine individual BG components, originating from continuum and -ray line emission. We apply our method to different science cases, including point-like, diffuse, continuum, and line emission, and evaluate the adequacy in each case.
*Results. *From likelihood values and the number of fitted parameters, we determine how strong the impact of the unknown BG variability is. We find that the number of fitted parameters, i.e. how often the BG has to be re-normalised, depends on the emission type (diffuse with many observations over a large sky region, or point-like with concentrated exposure around one source), and the spectral energy range and bandwidth. A unique time scale, valid for all analysis issues, is not applicable for INTEGRAL/SPI, but must and can be inferred from the chosen data set.
*Conclusions. *We conclude that our BG modelling method is usable in a large variety of INTEGRAL/SPI science cases, and provides nearly systematics-free and robust results.
Key Words.:
Gamma rays: general; Methods: data analysis; Techniques: spectroscopic
1 Introduction
Since 2002, the spectrometer SPI (Vedrenne et al., 2003) on board the INTErnational Gamma-Ray Astrophysics Laboratory (INTEGRAL satellite Winkler et al., 2003) has been observing astrophysical high-energy phenomena in the hard X-ray and soft -ray range between 20 keV and 8 MeV. In this energy range, measured -ray spectra are dominated by instrumental background (BG) photons. These originate mainly from nuclear de-excitation reactions and continuum processes, such as bremsstrahlung, of the instrument and satellite material, being exposed to cosmic-rays (CRs). There is no stand-alone BG model available to deal with SPI observations in an extensive, consistent, and elaborate way. In this paper, we show how to construct a self-consistent BG model for SPI data analysis from a data base of spectral parameters over the INTEGRAL mission years.
The spectra in each of the 19 high-purity Ge detectors of SPI can be characterised by a large number of instrumental -ray lines on top of a broken power-law shaped continuum. The main features of the spectra among detectors, energy, and time stay constant and change gradually according to solar activity and detector degradation. Diehl et al. (2018) illustrated how the SPI spectral BackGround Response Data Base (BGRDB) is created, maintained, and checked for consistency over the entire mission time. In general, the BGRDB contains fitted spectral parameters per detector and time. The time integration is chosen as either one INTEGRAL orbit of three111In early 2015, the INTEGRAL spacecraft performed several orbit adjustment manoeuvre, to safely bring the satellite back to Earth in 2029. For this reason, the INTEGRAL orbit is now merely 2.7 days long, and will decrease with ongoing mission time. days, or the time between two detector restoration222About twice a year, the lattice structure of the SPI Germanium detectors is repaired from cosmic-ray radiation by heating up the camera for several days to . This is called an annealing. periods, which is typically half a year. These different time scales are a key issue for the understanding and the construction of a -ray telescope BG model - to record sufficient BG data for reliable fits to the spectral, and to be able to trace gradual changes in the spectral response.
This paper is structured as follows: In Sec. 2, we describe, how SPI data analysis is generally treated (Sec. 2.1), and describe the parts of the SPI BGRDB which are required for BG modelling in more detail. Based on this, we explain how we construct a self-consistent BG model in Sec. 3, with full algorithm detail (Sec. 3.1) and a discussion about the underlying foundation (Sec. 3.3). In Sec. 4, we show how to evaluate the BG model fit adequacy for different test cases for point-like and diffuse emission (Sec. 4.1), which are then further discussed considering their temporal BG behaviour (Sec. 4.2) in Sec. 5.
2 SPI data analysis
2.1 General method - fitting time series
SPI data analysis and source flux extraction is performed for individual energy bins. Each energy bin333The smallest public available bin size in SPI data analysis is 0.5 keV. The chosen bin width for the actual data set, however, is arbitrary. To increase the statistics and to reduce the time for the analysis at the same time (number of bins), for example, the bin width can be increased. Note, that spectral information is lost when large energy bins are used. is treated separately, even though the bins are connected, e.g. by instrumental resolution. This dependence is taken into account when the BG model is built.
The counting of photons into one energy bin over time obeys the Poisson statistics. From this, the likelihood, , can be calculated, given the data set, , and the model parameters . The data in SPI per energy bin is structured as a vector of magnitude , where would be equal to 19 if all individual detectors444SPI also records events which occur on multiple detectors within a short time, classified as double, triple, and high-order detector hits, which are individually saved in SPI data sets. of SPI are used, and equals the number of observations, , in a specific data set.
A pointing, , is an observation unit for a specific amount of time, , typically 0.5 to 1.0 h, and a specific region in the sky, marked by the galactic longitude and latitude . The satellite is staring during this time at this position, and is recording data. Depending on the type of observation (diffuse/point-like; persistent/transient), pointings are observed over larger regions in the sky, or only around the target source. Sub-pointing time intervals, e.g. for gamma-ray bursts, can also be analysed. The likelihood is given by
[TABLE]
where is the model to describe (to be fitted to) the data per pointing . The log-likelihood (C-stat) is then given by
[TABLE]
We will use the likelihood and related values (such as C-stat or -values) and the degrees of freedom (dof: number of data points minus the number of fitted parameters) in a maximum likelihood fit as a goodness-of-fit criterion for our BG modelling method in Sec. 4.
SPI data are typically dominated by instrumental BG555Only in a few cases during the INTEGRAL mission, transient sources appeared which out-shined the BG. One example would be the microquasar V404 Cygni (e.g. Rodriguez et al., 2015; Roques et al., 2015; Siegert et al., 2016a; Jourdain et al., 2017) during revolutions 1554–1557., so that a subtraction in neither spectral, nor in the -dimension would provide reasonable results. Furthermore, there is no absolute BG model for SPI as there are variations in all dimensions which change according to the unpredictable solar activity. The goal is hence to obtain a description of the BG, from the data itself, but independent from the celestial source of interest. The key to such a BG model is found in the way, the data is modelled, and how the different model terms (can) influence each other:
[TABLE]
Here, is the k-th of sky images (celestial emission models) to which the instrumental response function (IRF, coded-mask shadowing), , is applied for each pointing and image element (e.g. pixel), . The BG models are independent of the IRF in each observation, which means the background is assumed completely independent from the shadowing of the mask and therefore independent of spacecraft repointings. Both model parts, sky and BG, can depend on time, and on different scales, for celestial sources, and for the BG. While the BG timing depends on the instrument materials, being activated and decaying on various time-scales as a consequence of solar activity, the variability of the sky emission is only subject to the physics of the sources themselves. This means that the change of the BG amplitude, , depends on the process which resulted in the BG -ray photon. This may change on very short time scales, e.g. prompt emission after an intense dose of CR particles from a solar flare, or very long scales, e.g. from a radioactive build-up, when the creation of radioactive material happens on shorter time-scales than the decay time.
2.2 Spectral background and response data base
The SPI BGRDB by Diehl et al. (2018) spans the energy range between 20 and 2000 keV per INTEGRAL orbit, and consists of parameters for 383 -ray lines, each modelled with four parameters, and two continuum parameters, depending on the energy666This includes both data formats, the ”single events” data (SE) from 20 to 1392 keV and from 1745 to 2000 keV, as well as the ”pulse-shape-discriminated” data (PSD) from 490 to 2000 keV.. In the energy range between 1 and 8 MeV777Above 2 MeV, the public available data are termed ”high-energy events” (HE), for which the smallest energy binning is set to 1 keV, instead of 0.5 keV for the lower energies., the count statistics drops rapidly towards larger energies, so that longer integration times are required with increasing energy to obtain robust fitting results. For the BGRDB above 2 MeV, the time between two annealing periods is used to fit 614 -ray lines on top of a broken power-law-shaped continuum (Weinberger et al. 2019, in prep.). The low- and high-energy bands were subdivided into smaller energy regions, , to limit the number of fitted parameters per fit.
As a function of energy, , the SPI spectra, , per detector , orbit , and energy range , are described by
[TABLE]
Here, and describe the shape of the -ray continuum and lines with the parameter tuples and , respectively, where is the index of the i-th -ray line in the band . The parameters and are the amplitude and the spectral index of the power-law function , normalised at the pivot energy . This shape is motivated from the superposition of many power-law-like continuum processes in the satellite:
[TABLE]
The parameters , , and represent the amplitude, centroid, and width of a symmetric Gaussian function, with accounting for the detector degradation. The line shape that results from cosmic-ray bombardment is physically motivated (Kretschmer, 2011): As the regular lattice structure is deteriorated, the electronic structure changes, and charge carriers may be temporarily trapped. This leads to a time delay in the charge collection process, and thus in the charge pulse of the read-out electronics. The non-trapped portion of a charge cloud is proportional to (Debertin & Helmer, 1988), where is the distance to Ge detector electrode, and is the absorption coefficient. Since these parameters cannot be determined independently, a combined degradation parameter is used. The convolution of a symmetric Gaussian, , with an exponential tail function, , leads to an asymmetric line shape, (see also Eqs. (3)–(6) in Diehl et al., 2018):
[TABLE]
Diehl et al. (2018) finds that there are families of -ray lines in the instrumental background of SPI, which are characterised by their detector patterns. The authors define several groups of lines, of which we recap the most prominent: The ”Ge-like” lines show higher count rates for inner detectors (–) with respect to outer detectors (–). These include excitation of nuclei in the SPI camera, such as Ge and Al, but also related and produced isotopes, such as Ga, Zn, and Mg. ”Bi-like” lines show the opposite detector pattern, as their main source are in the materials of the SPI anticoincidence shield, made of bismuth-germanate (BGO). Related isotopes in the vicinity of Bi (e.g. Pb, Ra, Tl) and also isotopes from the actinide alpha-decay chains (e.g. Ac, Th, U) show this behaviour. To a lesser extent, also material from mountings (Ti, V), wires (Cu, Co, Fe), and other instruments aboard INTEGRAL (Cd, Te, Cs) show this pattern.
The most important finding from monitoring the long term trends in this spectral response data base is, that the patterns of individual isotopes stay constant on all time scales. This also means that isotopes which produces different line energies show the same pattern. Only detector failures change this behaviour - but then, the patterns are again constant. This is the prime assumption of how a self-consistent background model is constructed, and is further explained in Sec. 3.1. Since the spectral shape changes with time (see above), the detectors may respond differently to this degradation, and the prompt and delayed background emission is different for different energies, the SPI BGRDB includes the information to reconstruct the expected spectral response of the SPI camera and predict the instrumental background as a whole at any point in time in high spectral resolution.
3 Construction of the background model
3.1 General approach
At each specific energy, , we model the BG by two components: photons from continuum processes, and from one (or more) -ray line-producing processes. In narrow energy bins, i.e. less than a few times the instrumental resolution888Between 20 and 8000 keV, the instrumental resolution is determined from narrow background lines, and ranges from 1.7 to 8.5 keV., we construct the BG by a combination of the continuum and the lines, i.e. combining all continuum-like processes and line-like processes to two individual BG models. Depending on the energy, either of the BG components may dominate, Sec. 5.2 (Fig. 6). Per each pointing, , the factor changes for each sky component , i.e. the detector pattern - which detectors are illuminated and how strong - changes with time / pointing. This is illustrated in Fig. 1a for the standard INTEGRAL observation scheme of a point-like source.
On the other hand, for a specific physical process inside the satellite, the detector patterns from the BG, , do not change. For our two BG model components, for the continuum, and for the lines, hence follows: and . What might change as a function pointing (=time), , are the amplitudes and . Therefore only these are determined in the maximum likelihood fit, following Eqs. (2) and (3), see also Secs. 3.2 and 4.2.
Following these considerations (see also Diehl et al., 2018, Sec. 2.4), the SPI BG model for a particular detector, , at a specific pointing, , and energy bin, , is written as
[TABLE]
In Eq. (3.1), the BG is factorised into the fitted amplitudes for continuum and lines, and , respectively, the spectral parts, and a temporal part. Here, individual pointings may be scaled by the same amplitude parameter (see Sec. 4.2). For the lines component, we sum over the individual lines , which contribute to one energy bin, . In principle, each line amplitude could be fitted individually, but this is only robust for strong BG lines. The spectral parameters per pointing, and , depend on in which revolution the pointing occurred, hence 999This is true because the SPI BGRDB was created on an orbit time scale. This can be generalised to arbitrary time bins in spectral parameter data bases, e.g. above 1 MeV where the parameters are being determined on ”annealing” time scale, i.e. the average over half a year., and are fixed to the values from the SPI BGRDB in the maximum likelihood fit.
The detector pattern per energy and process is included as each detector is assigned a specific value according to the spectral response at that time and energy. Thus, and completely determine the spectral as well as detector ratio information required for BG modelling. The tracer functions and then allow for a relative weighting between pointings for continuum and line processes, respectively. These are parts of the specific BG modelling per science case, and thus are independent from the SPI BGRDB, hence do not follow . In general, each process can have its own variability, being prompt from cosmic-ray excitation and instantaneous de-excitation, or delayed when there is a longer lifetime of the produced isotopes included. The latter can lead to long-lasting radioactive decays, e.g. after a strong solar flare which created a lot of radioactive material (e.g. with a half-life time of 16 days), or to radioactive build-up when the decay-time is considerably larger than the production rate (e.g. with a half-life time of 5.27 years, cf. Diehl et al., 2018). The long-term trends are traced already by the SPI BGRDB as the amplitudes are determined on three-day time scales (Sec. 3.2), so that the remaining prompt BG emission processes at each energy can be traced by one temporal function, i.e. .
3.2 Temporal behaviour
At individual energies, i.e. for small energy bins - not integrating over the energy range of the physical process, the instrumental -ray line BG patterns, , become a function of energy because of detector degradation. This does not mean that the physical processes change - the pattern per process is constant - but the pattern at a particular energy can change. This is caused by the degradation of individual detectors on different time scales due to their individual constitutions, behaviours, and reactions to particle irradiation. These changes in detector patterns versus energy and time are taken into account by using the SPI BGRDB. This data base traces the small but important changes, as it has been created on a three-day time scale for energies below 2 MeV (Diehl et al., 2018). For higher energies, i.e. lower BG rate, the BG count statistics is rarely sufficient to determine the gradual change in degradation, so that a mean value over half a year is determined.
The SPI BG amplitude of a certain instrumental process is in general unpredictable. In SPI data analysis, these BG variations are approximated at first order by ”tracers”, i.e. rates of onboard radiation monitors, such as the SPI anti-coincidence shield count rate, or the rate of saturating Ge detector events ( MeV; GeDSat). These rates trace the cosmic-ray particle flux that leads to instrumental -ray BG. The variations of these BG time series during one orbit are of the order of 1%, i.e. from pointing to pointing.
These approaches work well for all energy bins, i.e. as small as 0.5 keV or even 1 MeV continuum bands. But it is important to note that the described tracing relies on the prompt effect of cosmic-ray irradiation, which is true for many, but not all BG-generating processes in the satellite. Any BG process on longer scales is not represented by such a tracer, but instead is already implemented in the SPI BGRDB. In the special case of mid-term radioactivities, i.e. longer than several pointings, solar-flare-induced background lines or radioactive build-ups may be traced directly by the respective exponential decay law with the isotope’s characteristic decay time (cf. Fig. 15 of Diehl et al., 2018, showing the examples and ). Thus instead of relying on independent rates, the natural time scale of the process can also be used.
This tracing provides a good first-order description of the BG model variation on shorter time scales below one orbit and down to pointing-by-pointing. When the BG is fitted to the data per energy bin, however, the appropriate BG re-scaling has to be determined. This depends on energy, because the average BG count rate changes with energy due to different contributions, and strengths of continuum and lines, and the different intrinsic time-scales of the processes, be they prompt or delayed. This is discussed in Sec. 4.2.
With the experience of hundreds of analysed X- and -ray sources during 16 years of the INTEGRAL mission, in different sky regions and -ray energies, it has become evident that for a large energy range between and keV, the saturating Ge detector events (GeDSat) are sufficient to trace the inter-pointing variations. The 511 keV BG line, for example, follows strongly the rate of the side-shield assembly of the SPI anticoincidence shield (SSATOTRATE Skinner et al., 2014; Siegert et al., 2016b). Below 200 keV, the BG rate is very high and can often be determined on a pointing time-scale, so that no tracer is required at all if the source is also strong. Note that GeDSat does not necessarily explain the full variation from one pointing to the next, and requires re-scaling in our method. Likewise, at the steps of the SPI BGRDB, the BG must be carefully investigated and re-scaled (Sec. 4.2).
3.3 Discussion - why does this work?
In a single pointing, the mask patterns for sources in the field of view can be very strong, so that a determination of detector patterns from the BG (”BG response”) is not possible on this time scale. Off-observations, for example at high latitudes, can be used if broad energy bins are analysed (continuum sources). For detailed spectroscopy in small energy bins, however, the BG patterns change smoothly with time due to detector degradation so that the actual spectra should be used to determine the BG. If many pointings of the same observation are combined, the source imprints due to the mask smear out, so that the resulting spectra per detector for a longer time scale appear as due to BG only. In Fig. 2, we show how smears out for a point source, observed with a -dithering strategy of INTEGRAL. For a source contribution of 10% (90%) to the total recorded spectrum, the cumulative pattern of BG plus source is varying by less than 1% (10%) after only 20 pointings. During one INTEGRAL orbit, 50–90 pointings are performed, so that the contribution of a residual mask pattern from even strong sources would be less than 1%. This means that the SPI BGRDB (Diehl et al., 2018) is mostly free of source contributions of any type, and that in this way, a BG model for single pointings can be constructed.
The detector patterns from instrumental BG continuum, , are nearly flat around a value of 1.0 (Diehl et al., 2018). The limit, , also evolves to a constant value of 1.0 across all detectors, so that any additional source contribution appears in the time-integrated spectra as an increased BG continuum. For individual pointings however, as analysed in a data set, thus, the amplitude of any source contribution can be determined in the fit because the expected patterns from BG and sky are now clearly different. The maximum likelihood fit including BG continuum, BG lines, and sources, will, as a consequence of this procedure, scale down the continuum to ”give way” to possible source counts.
As a result, the SPI BGRDB, fitted to the data, provides already a good first-order BG model for any INTEGRAL/SPI observation. For detailed and high-resolution spectroscopy of both continuum as well as -ray line sources - especially at fine energy binning - the characteristics of the parameter values in the data base must be investigated further to ensure consistency of the spectral description, in order to build a stable and robust BG model (see also Secs. 3.2 and 5):
In the case of partial coding of sources during one INTEGRAL orbit, the relative counts of detectors at the rim of the SPI camera (parts of the outer detectors) may be increased on a per cent level (see above), so that the derived background patterns may be slightly skewed. This can also happen with strong sources in the fully coded field of view, leading to residual ”average patterns” from these sources. The additional pattern, which is mixed with the true background pattern in the SPI BGRDB, is the sum of all detector responses from sources seen by SPI in a particular orbit (e.g. the sum of all patterns in Fig. 1a). This can introduce weak coding in the partially coded field of view, but has marginal effects on the known sources. It affects mainly the energy range below , and can be accounted for by adding the expected ”average pattern” as a third background model component.
4 Evaluation of the background model performance
If the fluxes of the analysed sources are known, for example from previous analysis, or can be expected from theoretical considerations, it is possible to estimate the contribution of source counts in the data set. Consequently, the fit adequacy can be investigated, if the sources would be ignored. In general, the covariance between BG and source amplitude parameters is nearly zero, however will turn to negative values if the source contribution is weak or absent. This is shown and explained in Appendix A, Figs. 7–8c, for the case of a line signal at 1809 keV (see also Sec. 5.1.2). As a consequence, the BG (time) scaling and the source contributions (also their time scaling if needed) should be optimised simultaneously.
As an absolute figure-of-merit to determine a fit’s quality, we use the ”modified -statistics” (Mighell, 1999) as defined by
[TABLE]
As an alternative to and to distinguish between models, we use the Akaike Information Criterion (AIC, Akaike, 1974; Cavanaugh, 1997). The AIC is defined as
[TABLE]
This figure-of-merit is similar to the reduced -value by penalising models with more parameters. Using relative likelihoods (absolute AIC differences, ), we identify on which time basis the BG has to be re-adjusted in the different cases of Sec. 5. Note, that its absolute value has no proper meaning (Burnham & Anderson, 2004). In general, the smaller the AIC, the more preferred a model is. It accounts for the appropriate likelihood of the data generating process, so that testing different assumptions on the BG (and source) scaling provides the most adequate number of BG (and source) parameters (Sec. 4.2). The and AIC values are calculated from the best-fit -value for reference. In general, one should avoid to rely on (reduced) estimations when dealing with Poisson-distributed data, however the -statistics is specifically adapted for this case (Mighell, 1999), and can provide a general measure.
4.1 Data sets
In order to illustrate different applications of the BG modelling method, we choose four energy regions to perform our analysis towards point-like and diffuse emission. We demonstrate the performance of our method on line-like emission at 511 keV, binned into one energy bin between 508 and 514 keV, as would be used for imaging. The second-strongest -ray line at 1809 keV of decaying is analysed between 1795 and 1820 keV in 0.5 keV bins to also show a fine-binned but low-statistics case. Analysis of diffuse emission in a continuum band is presented between 2.5 and 3.5 MeV, summed into one bin of 1 MeV. For point-like emission, we also use a low-energy band, from 50 to 100 keV in 2 keV bins. We make use of Crab observations, performing tests in the same energy bands as for diffuse emission. A summary of the data sets and on which time basis the BGRDB is used, is found in Tab. 1.
4.2 Background changes with time
As described above, the relative weighting of background amplitudes between individual pointings can be determined by a tracer, but which may not fully cover the actual variations. In the following, we use pre-defined time steps to re-scale the BG model, as shown in Tab. 2, and judge which scaling, i.e. how many fit parameters, are preferred.
In the case of diffuse emission, for instance, the expected detector pattern changes more smoothly than that for a point source. In addition, INTEGRAL/SPI data sets as they are typically analysed for diffuse emission cover very large areas in the sky, of which many are expected to include little or no source flux but only instrumental BG. For example, is distributed dominantly in the plane of the Galaxy, but high-latitude observations are included in the data sets to get a better leverage on the absolute BG model. The data sets for large-scale diffuse emission thus include as many data/pointings as possible. In return this means, that the BG has to be re-scaled whenever emission from the sky - in certain regions, given by the emission models - is expected to change its contribution to the total by a significant amount. This could also be, when observing the same sky region at different mission times with different background levels. As a consequence, the sensitivity for diffuse emission with respect to analyses of point sources is reduced.
If a particular point source is investigated, the data sets are typically constrained to the region in which the source is located. When this same source is monitored from time to time, the BG level should be adjusted accordingly - not because the source flux might have changed101010Of course, the source flux can also change, e.g. in the case of X-ray binaries. This just means that the fit is not stabilised by a constant source and varying BG any more, but rather independent for individual times., but because the BG level varies on longer time scales (e.g. solar cycle).
5 Test cases
In the following Sec. 5.1, we will present the different diffuse emission cases and explain the choice of parameters. In Sec. 5.2, we illustrate the findings for one point source, and highlight the differences as mentioned above.
5.1 Diffuse emission - large data sets
5.1.1 Positron annihilation emission in the bulge
The 511 keV emission in the Milky Way from the annihilation of electrons with positrons is concentrated in the bulge region (see Prantzos et al., 2011, for a review). Here we choose only INTEGRAL observations in a central area around with an extent of . Due to the large field of view of SPI, also emission regions out to and are included, and have to be modelled. We use the best-fitting emission model from Siegert et al. (2016b) to characterise the 511 keV morphology in the bulge. This model consists of four 2D-Gaussian-shaped templates to represent the bulge and the superimposed disk. Here, we combine the individual components into one map.
Clearly, a minimum is found at a BG time-scale of three days, i.e. one INTEGRAL orbit (Fig. 3a). Between a time scale of 8 h and the annealing time scale (12p–Anneal), the measured flux shows only slight variations, indicating adequate and stable fits (Fig. 3b). Very long (DetFail / Const) and very short ( h; 1p–6p) time scales, on the other hand, provide bad fits or over-fit the data, respectively, which leads to false flux values. For the optimum AIC at 3 d ( fitted BG parameters; 1n), the 511 keV flux is determined to , which is consistent with previous measurements considering a narrow and a broad bulge component, together with a disk in this region (e.g. Skinner et al., 2014; Siegert et al., 2016b).
The reduced -value at the optimal AIC is 0.9927 for 203184 dof. This is from the canonically desired value of 1.0, and thus not ”over-fitting” the data. In general in the 511 keV case, the fit, considering the -statistics, is always acceptable: Between the Const BG model with 2 parameters and a fit per each individual pointing with 25172 parameters, the reduced changes between 1.0111 and 0.9881, i.e. and from an optimal -fit value of 1.0. The BG scaling closest to is found at 2n, i.e. one BG parameter every second INTEGRAL orbit (approximately six days). The minimum reduced- value is found for a fit per pointing, 1p (0.5–1.0 h), but which is largely penalised by the AIC, and thus represents an over-parametrised fit.
5.1.2 Full-sky emission of radioactive
The radioactive isotope is produced in massive stars and ejected by winds and core-collapse supernovae. With a half-life time of 717 kyr, traces the ongoing nucleosynthesis in the Milky Way. These nuclei decay via -decay to an excited state of , which is short-lived (476 fs) and de-excites to the stable ground state of by the emission of a keV -ray photon (cf. Oberlack et al., 1996; Plüschke et al., 2001; Diehl et al., 2006, 2010; Kretschmer et al., 2013; Bouchet et al., 2015; Siegert, 2017, for example). In this work, we use the Maximum Entropy Map from COMPTEL (Plüschke et al., 2001) for our sky model to determine the BG scaling.
Here, we use a 0.5 keV binning to resolve the 1809 keV -ray line between 1795 and 1820 keV. This allows us to show that the optimal BG scaling depends on energy, as the BG intensity rises when strong instrumental BG lines have to be taken into account. In Fig. 4, we show the result of our BG analysis in the case for diffuse emission. While the energy bins containing the -line and a complex of strong BG lines (green), require a BG scaling between 30 d and half a year (30n–Anneal), the neighbouring continuum is sufficiently well fitted by only 1 to 29 parameters per BG component (Const–Anneal). Note, how also the weak instrumental BG line around 1797 keV is captured by our BG modelling method. In Appendix B, we show the same analysis, based on the minimal reduced -value in each energy bin. The major difference between the AIC and the profiles is in the erratic behaviour among bins, and the large spread in the choice of the optimal number of parameters. The line contribution also favours larger number of parameters - similar to the AIC analysis. The number of parameters in the line is very large (5396), for which reason the errors bars on the derived fluxes are much larger in the optimal -case than in the optimal AIC case. We compare the spectral parameters of the region in Tab. 3. All parameter are consistent, except for the line width, being as small as the instrumental resolution in the optimal -case, and kinematically broadened for the optimal AIC case. Especially the 1.8 MeV line flux is consistent with previous studies of the full sky (e.g. Bouchet et al., 2015; Siegert, 2017).
5.1.3 Diffuse soft -ray continuum
Above keV and up to several tens of MeV, the diffuse continuum emission in the Milky Way is dominated by inverse Compton scattering (Strong et al., 2005), with contributions from bremsstrahlung and positron annihilation (Beacom & Yüksel, 2006; Knödlseder et al., 2007; Strong et al., 2010; Lacki et al., 2014). Here, we use a continuum band between 2500 and 3500 keV to illustrate how our BG modelling method performs for diffuse continuum emission. We combine all photon events into one large energy bin of 1 MeV binwidth to increase the photon number statistics, and to illustrate the capabilities of our method in a broad band analysis.
For our analysis, we use the central part of the Milky Way, similar to Sec. 5.1.1. Because the ”high-energy” data sets of SPI, between 2 and 8 MeV, are very sensitive to solar activity, we removed periods in which solar flares strongly imprint and enhance BG intensities of nuclear de-excitation lines, such as at 2.2 MeV, at 4.4 MeV, or at 6.1 MeV. Consequently, the high-energy data sets are smaller than in the 511 keV case. We use an inverse Compton template from GALPROP (Strong et al., 2011) to fit the morphology in the 2.5–3.5 MeV band.
The optimal AIC solution is found at a re-scaling time of h (12p), corresponding to 2630 BG fit parameters for 176537 dof (Fig. 5). The reduced value at this point is 1.0107, away from the desired -solution. Based on the reduced -value only, the optimal solution would be found at a 1 h BG scaling (1p), but which is clearly disfavoured by the AIC, penalising too many fit parameters. The fitted flux values at MeV for acceptable fits range between and . From these fits, a detection significance between and is found. At the optimum AIC, the diffuse continuum flux, based on the used inverse Compton template, is determined to . This is consistent with measurements from COMPTEL (Strong et al., 1999), considering the different solid angles covered in the analyses.
5.2 Point source emission - small and partitioned data sets
We use the Crab observations of INTEGRAL/SPI from 14 mission years. As a comprehensive example to be compared to the above cases, we show the energy regions from 50–100 keV in 2 keV bins, the positron annihilation line region between 508 and 514 keV in one 6 keV bin, the region between 1790 and 1840 keV in 0.5 keV binning, and one 1 MeV bin between 2.5 and 3.5 MeV. Unlike in the diffuse emission examples, the Crab should only show continuum emission, and no -ray lines111111Emission lines from decaying could be expected but are below the sensitivity limit for SPI.. Similar to the high-energy case for diffuse emission, we apply special selection criteria to the Crab observations to avoid strong solar activity, for which reason also the Crab data set above 2 MeV is smaller than below. The choice of energy regions allows for direct comparison of the diffuse and line emission cases.
In Fig. 6,we show the results of our BG modelling method for the Crab and the time scaling analysis in four different bands with varying energy bin size. In the energy range, between 50 and 100 keV, it is evident that there is a large difference in BG re-scaling with time, as the BG flux changes rapidly with energy. Especially below and above keV, the instrumental BG line contributions dominate over the instrumental BG continuum, so that the BG scaling time is optimal between 4 and 8 h (6p–12p). In the continuum-dominated region, 3 d (1n) scaling and thus fewer parameters are enough for an optimal BG scaling. The resulting spectrum in this energy band is smooth and power-law like. Similar to the diffuse emission in the centre of the Galaxy, the BG amplitude in the 511 keV line band should be fitted every three days. There is no enhancement of a possible narrow 511 keV line in the spectrum, as determined from the spectral fit over all analysed energy bands (see below). Likewise, the energy band between 1790 and 1840, in which emission from at 1809 keV could be expected, is sufficiently scaled once between each detector failure or annealing period (DetFail–Anneal) in the continuum, and every 60–90 d (20n–30n) in the comparably strong instrumental line complex between 1805 and 1813 keV. No line is detected, which reinforces our method, being able to systematically suppress instrumental BG lines, and only reveals the source spectrum. At higher energies, between 2.5 and 3.5 MeV, the Crab flux is about the same order of magnitude as the diffuse emission in the galactic centre. However, the emission is concentrated into one point, so that the re-scaling must be less often than in the galactic diffuse case - here only once per three days (1n). Also here, the difference in the uncertainties is evident between diffuse emission and point-like emission. While the galactic centre shows a flux uncertainty of , the Crab emission is uncertain by , even though the effective exposure times are about three times larger in the galactic centre (25 Ms) with respect to the Crab (7.5 Ms).
We check for consistency in our analysis by fitting the Crab spectrum, using three different energy bands: (a) between 50 and 100 keV, (b) between 1790 and 1840 keV, or (c) all data points between 50 and 3500 keV. The fitting results are shown in Tab. 4, demonstrating consistency between the analyses, and further substantiating our BG modelling method. The Crab spectrum is determined to be between 50 and 3500 keV with a reduced value of 1.32 for 165 dof. This is consistent with previous SPI results reported for the Crab (Jourdain & Roques, 2009, with 1.2 Ms of observation time), and reduces the uncertainties by the increased amount of exposure.
6 Conclusion
In this work, we showed, how to construct BG models for INTEGRAL/SPI data from a data base of spectral parameters which characterise the response as well as the BG properties of individual components of the instrument. In particular we illustrate our method on a few characteristic examples, including point-like, diffuse, continuum, and -ray line emission.
Our method is based on long-term monitoring and understanding of instrumental BG, which is separated into continuum BGs and -ray line BGs. These physical components show individual, but characteristic detector patterns on the SPI detector array, and are constant over time. For individual energies (energy bins, not integrating over a -ray line, for example), the detector patterns change smoothly with time due to the change in their responses, mainly caused by detector degradation and the influence of solar activity. This is taken into account by our detailed spectral fits of INTEGRAL/SPI data per unit time (Diehl et al., 2018).
The time basis for these spectral fits must be chosen:
long enough to smear out residual celestial patterns which are imprinted by the coded mask, 2. 2.
short enough to trace gradual variations of detector responses, and 3. 3.
long enough to accumulate enough statistics for reliable fits.
Point 1 is fulfilled after 15–25 INTEGRAL pointings, depending on the strength of the source (Fig. 2). We note that a correlation of our BG model with celestial emission is to be minimised by the way the BG is re-normalised in time, and want to point out that the more pointings are added for determining the BG response, the smaller the correlation will be. For example, a source contribution of 10% (50%) is smeared out to less than 1% (5%) after a standard -dithering pattern. This means one INTEGRAL orbit (50–90 pointings) would smear out the expected detector pattern of a single point source almost completely ( for any source strength; for typical source contributions of less than 10%), so that the correlation is minimal. At the same time scale, we find significant changes in the detector response (-ray line shape parameters) for the strongest -ray lines, so that point 2 is also fulfilled using one INTEGRAL orbit, or three days. Depending on the energy, i.e. BG count rate, the spectral fits are adequate (point 3) on a time scale of one orbit below energies of about 1–2 MeV. Above this energy, a spectral fit per detector requires more statistics to provide robust results. We choose the time span between SPI detector annealings to determine spectral parameters above 1–2 MeV. This reduces the accuracy for the line broadenings within half a year, but this can be accounted for by correcting for a linear trend using the orbit-by-orbit fits if required. We find, however, that fits per half-year period provide accurate results, as shown for the -case in (diffuse) line emission. If continuum sources are investigated and broader spectral bands are analysed, a spectral data base per orbit is also useful.
We show that our BG modelling method is reproducible and also results in values for celestial sources which are in concordance with literature values. The optimal choice of BG re-scaling as a function of time, i.e. the number of parameters at each energy, depends on the emission type (diffuse or point-like), the spectral regime (low-energy or high-energy; dominated by instrumental continuum or lines), and the width of the energy bins analysed. We provide examples of how to determine adequate fits for INTEGRAL/SPI analysis, based on the Akaike Information Criterion. This penalises too many and disfavours too few parameters, and we compare the results to reduced -values, which are commonly used in such analyses, but should be taken with care when using Poisson-distributed data with a low mean count rate. This BG method can be applied to a multitude of analysis cases, from short-term (e.g. gamma-ray bursts, state-variable X-ray binaries) to long-term (e.g. persistent sources, diffuse emission), continuum emission (e.g. synchrotron, bremsstrahlung, inverse Compton) and -ray line searches (e.g. decay, excitation, annihilation), as well as a combination of all of these at the same time. We show that our method eliminates most of the systematic uncertainties considering BG modelling with INTEGRAL/SPI.
Acknowledgements.
This research was supported by the German DFG cluster of excellence ’Origin and Structure of the Universe’. The INTEGRAL/SPI project has been completed under the responsibility and leadership of CNES; we are grateful to ASI, CEA, CNES, DLR, ESA, INTA, NASA and OSTC for support of this ESA space science mission.
Appendix A Covariance matrix of SPI maximum likelihood fits
As an example to illustrate the covariance matrix of a SPI maximum likelihood fit, we choose the case of at 1809 keV (Sec. 5.1.2). Here, 29 BG fit parameters each for instrumental continuum and lines were used in addition to one fit parameter for the source. In Fig. 7, the covariance matrix of the fit in a single energy bin is shown. The correlations for consecutive time intervals (neighbouring fit parameters in one component) are in general around zero because these are only coupled by the one persistent source model in the fit. There are slightly higher correlations between neighbouring continuum parameters than between continuum and lines, and lines and lines, because the BG detector patterns for the continuum have less structure than line patterns in general (see also Siegert 2017; Diehl et al. 2018). We show the correlations as a function of energy between source and continuum (entries 1–29 in the top line in Fig. 7), source and lines (entries 30–58 in the top line), and continuum and lines (diagonal in top right block of the matrix) in Figs. 8a–8c. In each figure, the y-axis shows when the BG has been re-scaled in units of INTEGRAL Julian Days (IJD). The colour scheme shows the correlation coefficients as a function of energy and time.
It can be seen that the shorter times between detector failures and annealing periods (cf. Tab. 2) lead to changes in the covariance structure for all components. The largest correlation between components is found between continuum and line BGs as expected from the data structure in such time intervals: between two time nodes, continuum and line BGs can be interchanged if the patterns are not different enough. Similar to a fit of a straight line, one component can replace the other to a certain extent, which leads to a strong anti-correlation. As described in Sec. 3.1, the source shows the strongest correlations with the continuum BGs because the detector patterns of smeared-out sources are closest to those of the BG continuum (Fig. 8b). The correlations between line BGs and the source is flat and around zero because their share no similarities in detector space, even though they are centred at the same energies (Fig. 8c).
Appendix B Background rescaling as a function of time using
In general, -distributions have expectation values equal to the number of degrees of freedom (dof), (Mighell 2000). Their second moments (standard deviation) are . Likewise, the reduced -distribution, i.e. , shows an expectation value of 1.0 with a standard deviation of . Note that these values are only correct for large Poisson mean values and over- or under-estimate the true value in the low-count regime. Andrae et al. (2010) discussed that the variance in means that even if the target value of is not reached, the fit may still be considered ”good”. As the variance depends on the number of data points and the number of fitted parameters, allowed regions around the optimal value can be identified: for a data set with data points and dof, a fit may be considered ”good” if the reduced value falls in the range . Here, should be chosen according to the size of the data set, i.e. the number of data points . For large data sets, the chance of having individual outliers is larger than for small data sets, so that larger are adequate for larger data sets. This means that a reduced value of 1.2 can still be considered ”good” for a data set with only 1000 dof, for instance, within 5, however extremely ”bad” for a data set with dof, then being off by .
In Sec. 4.2, we show the resulting fluxes for different data sets as derived from the optimal AIC solution. For our chosen examples with one broad energy bin each, Figs. 3b and 5 already contain the respective -values for the different number of BG scaling parameters. It is worth to note that there is no trend for in- of decreased flux values when choosing the optimal AIC or optimal solution: In the 511 keV case, the fluxes are and for 25172 and 952 BG fit parameters, respectively. For the broad bin between 2.5 and 3.5 MeV, the solutions yield and for 11470 and 2630 parameters, respectively.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Akaike (1974) Akaike, H. 1974, IEEE Transactions on Automatic Control, 19, 716
- 2Andrae et al. (2010) Andrae, R., Schulze-Hartung, T., & Melchior, P. 2010, Ar Xiv e-prints, 1012.3754
- 3Beacom & Yüksel (2006) Beacom, J. F. & Yüksel, H. 2006, Physical Review Letters, 97, 071102
- 4Bouchet et al. (2015) Bouchet, L., Jourdain, E., & Roques, J.-P. 2015, Ap J, 801, 142
- 5Burnham & Anderson (2004) Burnham, K. P. & Anderson, D. R. 2004, Sociological Methods & Research, 33, 261
- 6Cavanaugh (1997) Cavanaugh, J. E. 1997, Statistics & Probability Letters, 33, 201
- 7Debertin & Helmer (1988) Debertin, K. & Helmer, R. 1988, Gamma- and x-ray spectrometry with semiconductor detectors (North-Holland)
- 8Diehl et al. (2006) Diehl, R., Halloin, H., Kretschmer, K., et al. 2006, Nature, 439, 45
