Kernel phase imaging with VLT/NACO: high-contrast detection of new candidate low-mass stellar companions at the diffraction limit
Jens Kammerer, Michael J. Ireland, Frantz Martinache, Julien H., Girard

TL;DR
This paper demonstrates that kernel phase imaging can detect low-mass stellar companions at the diffraction limit, achieving resolutions below classical limits, and identifies five new companions using VLT/NACO data.
Contribution
The study applies kernel phase combined with principal component calibration to high-cadence L' band data, revealing new companions and showcasing super-resolution capabilities.
Findings
Detected eight low-mass companions, five previously unknown.
Achieved resolution below the classical diffraction limit (~80-110 mas).
Reached a 5σ contrast limit of approximately 1/100 at small angular separations.
Abstract
Directly imaging exoplanets is challenging because quasi-static phase aberrations in the pupil plane (speckles) can mimic the signal of a companion at small angular separations. Kernel phase, which is a generalization of closure phase (known from sparse aperture masking), is independent of pupil plane phase noise to second order and allows for a robust calibration of full pupil, extreme adaptive optics observations. We applied kernel phase combined with a principal component based calibration process to a suitable but not optimal, high cadence, pupil stabilized L' band () data set from the ESO archive. We detect eight low-mass companions, five of which were previously unknown, and two have angular separations of - (i.e. -), demonstrating that kernel phase achieves a resolution below the classical diffraction limit of a…
| OB | Target | SpT | [pc] | K [mag] | [s] | Vis | Can | Det | |||
|---|---|---|---|---|---|---|---|---|---|---|---|
| 1 | HIP 68994 | F3/5V | 71.7 | 6.715 | 395.8 | N | – | 46.6 | Y | 4.7 | N |
| HIP 63734 | F7/8V | 54.1 | 6.436 | 389.2 | N | – | 44.3 | N | – | N | |
| HIP 55052 | K7V | 23.7 | 6.808 | 389.2 | N | – | 45.1 | N | – | N | |
| 2 | HIP 44722 | K7V | 14.5 | 5.757 | 395.6 | N | – | 22.1 | N | – | N |
| HD 108767 B | K0V | 26.7 | 6.235 | 310.2 | N | – | 21.4 | N | – | N | |
| HIP 47425 | M3V | 9.6 | 6.056 | 388.8 | N | – | 32.4 | Y | 1.0 | N | |
| HIP 50156 | M0.7V | 23.4 | 6.261 | 395.2 | N | – | 292.7 | Y | 33.2 | Y | |
| HD 102982 | G3V | 53.2 | 6.605 | 316.6 | N | – | 23.9 | N | – | N | |
| HIP 58029 | G7V | 42.2 | 6.78 | 395.8 | N | – | 32.9 | Y | 1.4 | N | |
| HIP 61804 | G3V | 59.2 | 6.869 | 395.8 | N | – | 27.1 | N | – | N | |
| HD 110058 | A0V | 130.0 | 7.583 | 383.4 | N | – | 30.3 | N | – | N | |
| HIP 72053 | G3V | 59.7 | 6.994 | 382.4 | N | – | 29.4 | N | – | N | |
| 3 | HIP 58241 | G4V | 35.5 | 6.24 | 256.0 | N | – | 16.7 | N | – | N |
| TYC 8312 0298 1 | K0II | 804.5 | 6.475 | 162.0 | N | – | 18.0 | N | – | N | |
| HIP 78747 | F5V | 41.1 | 4.859 | 280.8 | N | – | 22.8 | Y | 2.0 | N | |
| 4 | HIP 37918 | K0IV-V | 34.4 | 6.275 | 389.2 | N | – | 336.5 | Y | 20.3 | Y |
| HIP 36985 | M1.0V | 14.1 | 5.934 | 334.4 | Y | 182.2 | 31.1 | N | – | N | |
| TYC 7401 2446 1 | K0V | 42.2 | 6.778 | 117.4 | Y | 195.0 | 14.4 | N | – | N | |
| 5 | TYC 6849 1795 1 | K5V | 27.6 | 6.911 | 305.4 | Y | 250.1 | 13.3 | N | – | N |
| HIP 92403 | M3.5V | 3.0 | 5.370 | 750.8 | N | – | 39.1 | Y | 2.5 | N | |
| HIP 94020 B | K5V | 29.1 | 6.999 | 657.0 | N | – | 23.9 | N | – | N | |
| 6 | BDp19 3532 | K0 | 240.2 | 5.842 | 1361.2 | N | – | – | N | – | N |
| HIP 108085 | B8IV-V | 64.7 | 3.45 | 401.8 | N | – | – | N | – | N | |
| 7 | HIP 116231 | B9.5III | 53.4 | 4.611 | 285.2 | Y | 4.3 | – | N | – | N |
| HIP 116258 | K2V | 34.0 | 6.685 | 367.0 | N | – | – | N | – | N | |
| 8 | HIP 11484 | B9III | 60.4 | 4.392 | 279.6 | N | – | – | N | – | N |
| 9 | HIP 3203 B | K5V | 26.5 | 6.834 | 181.6 | N | – | – | N | – | N |
| 10 | TYC 5835 0469 1 | G8V | 60.9 | 6.997 | 465.0 | Y | 95.8 | – | N | – | N |
| TYC 9339 2158 1 | K3V | 30.3 | 6.712 | 461.2 | N | – | – | N | – | N | |
| 11 | HIP 7554 | M0V | 22.2 | 6.621 | 637.4 | N | – | – | N | – | N |
| HIP 13754 | K2V | 38.6 | 6.883 | 503.8 | N | – | – | N | – | N | |
| 12 | HIP 116384 | K7V | 20.8 | 6.044 | 739.4 | Y | 99.7 | 26.2 | N | – | N |
| HIP 12925 | F8 | 57.1 | 6.52 | 595.4 | N | – | 24.0 | N | – | N | |
| HIP 13008 | F5V | 39.1 | 5.442 | 617.8 | N | – | 127.4 | Y | N/A | N | |
| 13 | HIP 14555 | M1V | 19.6 | 6.367 | 609.6 | N | – | 35.1 | N | – | N |
| HIP 20737 | G9.5V | 35.6 | 6.742 | 626.6 | N | – | 31.1 | N | – | N | |
| HIP 22506 | G9V | 50.8 | 6.876 | 620.0 | N | – | 35.8 | Y | 4.3 | N | |
| HIP 23362 | B9V | 60.5 | 4.974 | 311.8 | N | – | 28.7 | N | – | N | |
| Notes. OBs 6–11 contain only one or two targets and cannot be analyzed with the kernel phase technique due to a lack of calibrators. Spectral types (SpT), distances () and apparent K band magnitudes (K) are taken from Simbad (Wenger et al., 2000). | |||||||||||
| Target | CC | L’ [mag] | [pri/sec] | [mas] | [deg] | New | Ref | |||
| HIP 36985 | B | 21.66 | 61.6 | 6311.3 | Y | – | ||||
| TYC 7401 2446 1 | B | 8.10 | 7.1 | 1238.9 | Y | – | ||||
| TYC 6849 1795 1 | B | 14.29 | 13.0 | 6090.0 | N | G16 | ||||
| HIP 116231 | B | 58.19 | 667.5 | 696.7 | N | S10 | ||||
| TYC 5835 0469 1 | B | 23.56 | 17.1 | 1883.7 | Y | – | ||||
| HIP 116384 | C | 9.18 | 40.6 | 186.4 | N | M03 | ||||
| HIP 50156 | B | 19.75 | 22.1 | 1195.7 | N | B15 | ||||
| HIP 37918 | B | 46.55 | 17.3 | 1104.6 | Y | – | ||||
| Notes. G16 = Galicher et al. (2016), S10 = Schöller et al. (2010), M03 = Martín (2003), B15 = Bowler et al. (2015). | ||||||||||
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Kernel phase imaging with VLT/NACO: high-contrast detection of new candidate low-mass stellar companions at the diffraction limit
Jens Kammerer,1 Michael J. Ireland,1 Frantz Martinache2 and Julien H. Girard3,4
1Research School of Astronomy & Astrophysics, Australian National University, ACT 2611, Australia
2Laboratoire Lagrange, Université Côte d’Azur, Observatoire de la Côte d’Azur, CNRS, Parc Valrose, Bât. H. FIZEAU, 06108 Nice, France
3Space Telescope Science Institute, 3700 San Martin Drive, Baltimore, MD 21218, USA
4Université Grenoble Alpes, CNRS, IPAG, 38000 Grenoble, France E-mail: [email protected]
(Accepted XXX. Received YYY; in original form ZZZ)
Abstract
Directly imaging exoplanets is challenging because quasi-static phase aberrations in the pupil plane (speckles) can mimic the signal of a companion at small angular separations. Kernel phase, which is a generalization of closure phase (known from sparse aperture masking), is independent of pupil plane phase noise to second order and allows for a robust calibration of full pupil, extreme adaptive optics observations. We applied kernel phase combined with a principal component based calibration process to a suitable but not optimal, high cadence, pupil stabilized L’ band () data set from the ESO archive. We detect eight low-mass companions, five of which were previously unknown, and two have angular separations of – (i.e. –), demonstrating that kernel phase achieves a resolution below the classical diffraction limit of a telescope. While we reach a contrast limit of at such angular separations, we demonstrate that an optimized observing strategy with more diversity of PSF references (e.g. star-hopping sequences) would have led to a better calibration and even better performance. As such, kernel phase is a promising technique for achieving the best possible resolution with future space-based telescopes (e.g. JWST), which are limited by the mirror size rather than atmospheric turbulence, and with a dedicated calibration process also for extreme adaptive optics facilities from the ground.
keywords:
planets and satellites: detection – planets and satellites: formation – techniques: high angular resolution – techniques: image processing – techniques: interferometric – binaries: close
††pubyear: 2018††pagerange: Kernel phase imaging with VLT/NACO: high-contrast detection of new candidate low-mass stellar companions at the diffraction limit–B
1 Introduction
Direct imaging is vital for studying the outer regions of extrasolar systems which are inaccessible to transit observations and can only be revealed by decades-long, time consuming radial velocity surveys (e.g. Fischer et al., 2014). It has proven particularly successful in probing our understanding of the formation of gas giant planets (e.g. D’Angelo et al., 2010), being able to estimate their mass from their luminosity and age (e.g. Spiegel & Burrows, 2012) and resolve their orbit. Although the majority of detected companion candidates are arguably consistent with being emission or scattering from disk material (e.g. LkCa 15, Kraus & Ireland 2012, HD 100546, Quanz et al. 2013, HD 169142, Biller et al. 2014), the recent example of PDS 70 (Keppler et al., 2018) demonstrates that direct imaging of wide-separation but still solar-system scale planets is possible at relatively moderate contrasts in the vicinity of young stars. This is spurring an ongoing discussion about the nature of planet formation and the commonness of gas giant planets with large orbital distances (e.g. Bowler & Nielsen, 2018).
However, direct imaging operates at the resolution and sensitivity limit of the most powerful instruments today (e.g. Pepe et al., 2014), placing demanding requirements on the observing and the post-processing techniques which are used to uncover faint companions at high contrasts (e.g. angular differential imaging, Marois et al. 2006, point spread function subtraction, Lafrenière et al. 2007a, principal component analysis, Amara & Quanz 2012, Soummer et al. 2012). Detecting exoplanets from the ground using these techniques has only been made possible by the recent development of extreme adaptive optics systems (e.g. Milli et al., 2016) and is mainly limited by non-common path aberrations which are not sensed by the wavefront control system (e.g. Sauvage et al., 2007). These aberrations manifest themselves as quasi-static speckles on the detector images which can mimic the signal of a companion and place a strong constraint on the achievable contrast at small angular separations (e.g. Fitzgerald & Graham, 2006). Hence, directly imaging and studying the formation of gas giant planets on solar-system scales has been extremely challenging so far (e.g. Bowler, 2016) because the nearest star forming regions lie away (e.g. Loinard et al., 2007) where such orbital distances correspond to angular separations of only .
In this paper, we explore the capabilities of the kernel phase technique (Martinache, 2010) for high-contrast imaging at the diffraction limit from the ground. This post-processing technique can be seen as refinement of sparse aperture masking and the closure phase technique (Tuthill et al., 2000). By probing only certain linear combinations of the phase of the Fourier transformed detector images, kernel phase and sparse aperture masking allow for a robust calibration of the time-varying optical transfer function of the system and a significant mitigation of quasi-static speckles and achieve an angular resolution of in the near-infrared (i.e. the L’ band, Cheetham et al., 2016). This gives access to objects on solar-system scales in the nearest star forming regions (i.e. projected separations of for a Jupiter analog in the Scorpius Centaurus OB association, Preibisch & Mamajek 2008) and has proven successful in directly imaging young exoplanets/disk features (e.g. Kraus & Ireland, 2012). The caveat of sparse aperture masking is that the mask blocks of the light (for VLT/NACO, Tuthill et al., 2010) and therefore significantly decreases the sensitivity and hence the contrast limit of the observations for relatively faint targets. However, kernel phase uses the light collected by the entire pupil and should perform better in the high Strehl regime and the bright limit (e.g. Pope et al., 2016; Sallum & Skemer, 2019).
For sparse aperture masking, a mask is placed at the Lyot stop of an instrument in order to split the primary mirror into a discrete interferometric array of real sub-apertures (e.g. Readhead et al., 1988). In the Fourier transform of the detector image (hereafter referred to as Fourier plane), these sub-apertures map onto their auto-correlation (i.e. their spatial frequencies, Ireland 2016). The phase of each spatial frequency can be extracted and linearly combined in a way such that the resulting closure phase is independent of the pupil plane (or instrumental) phase to second order (i.e. terms of first and second order in are vanishing), where the matrix encodes this special linear combination (e.g. Ireland, 2016). For observations from the ground, the pupil plane phase is affected by noise from atmospheric seeing and non-common path aberrations which ultimately cause quasi-static speckles. Being more robust with respect to these systematic effects, sparse aperture masking achieves a superior angular resolution.
For full pupil kernel phase imaging, there is no mask and the entire primary mirror is discretized into an interferometric array of virtual sub-apertures. According to Martinache (2010), it is then convenient to define a transfer matrix which maps the baselines between each pair of virtual sub-apertures onto their corresponding spatial frequency. In the high Strehl regime, where the pupil plane phase can be linearized, we obtain the relationship
[TABLE]
where is a diagonal matrix encoding the redundancy of the spatial frequencies (i.e. the baselines of the interferometric array) and is the phase intrinsic to the observed object. Multiplication with the left kernel of yields
[TABLE]
hence the kernel of the measured Fourier plane phase directly represents the kernel of the phase intrinsic to the observed object , at least in the high Strehl regime (where is negligible). This is why frame selection based on the Strehl ratio is essential. Note that the kernel phase is a generalization of the closure phase to the case of redundant apertures.
For observations from space, which do not suffer from atmospheric seeing, kernel phase has proven to be successful in resolving close companions at the diffraction limit (Martinache, 2010; Pope et al., 2013). It is our goal to determine if, under good observing conditions, kernel phase also is a competitive alternative to sparse aperture masking from the ground.
2 Methods
2.1 Data reduction
A basic direct imaging data reduction (such as dark, flat, background subtraction and bad pixel correction) is also essential for the kernel phase technique (e.g. Sallum & Eisner, 2017). For this purpose, we developed our own data reduction pipeline111https://github.com/kammerje/PyConica which can be fed the raw data with their associated raw calibrators from the VLT/NACO archive222http://archive.eso.org/wdb/wdb/eso/naco/form. Our data reduction pipeline performs the following steps which are described in more detail in the following sections:
Linearize the raw frames. 2. 2.
Compute master darks and their bad pixel maps. 3. 3.
Compute master flats and their bad pixel maps. 4. 4.
Flag saturated pixels. 5. 5.
Apply dark, flat, background and bad pixel corrections. 6. 6.
Perform a dither subtraction. 7. 7.
Reconstruct saturated pixels. 8. 8.
Select frames with sufficient Strehl ratio.
2.1.1 Detector linearization correction
Like most photon counting devices, NACO’s infrared detector CONICA suffers from a non-linear response when the pixel counts approach the saturation threshold (16400 counts for uncorrelated high well depth mode333This is the standard imaging mode in the L’ band () and all data cubes which we analyze have been taken in this mode. according to the NACO Quality Control and Data Processing444https://www.eso.org/observing/dfo/quality/NACO/qc/detmon_qc1.html, with a more conservative 16000 counts used in our analysis). As kernel phase is an interferometric technique for which the fringes are coded spatially on the detector, it is very important to characterize the pixel to pixel response. Moreover, many of the data cubes which we analyze in Section 3 feature saturated point spread functions (PSFs) which we want to correct for non-linearity before reconstructing their core (cf. Section 2.1.6).
In order to compute the detector linearization correction we download all frames of type “FLAT, LAMP, DETCHECK” and uncorrelated high well depth mode from 2016 March 23 and 2016 September 24 (which are closest in time to the observation of the earliest and the latest data cube which we analyze) from the VLT/NACO archive. We sort them by integration time and compute the median pixel count over all frames for each individual integration time (masking out the broken stripes in the lower left quadrant of CONICA). Then, we plot the median pixel count in dependence of the integration time , fit a linear polynomial to all data points with less than 8500 counts (end of the linear regime for uncorrelated high well depth mode) and a cubic polynomial to all data points with less than 16000 counts (saturation threshold, cf. left panel of Figure 1). We linearize the detector using a continuously differentiable piecewise polynomial approach to the correction curve with a linear function up to 8500 counts and a cubic polynomial between 8500 and 16000 counts (cf. right panel of Figure 1).
2.1.2 Master darks and master flats
For each observation block (OB) we compute master darks from the associated dark frames as the median of all dark frames with a unique set of size and exposure time. Then we compute a bad pixel map for each master dark based on the frame by frame median and variance of each pixel’s count. Therefore, we first compute two frames:
The absolute difference between the master dark and the median filtered master dark. 2. 2.
The absolute of the median subtracted variance dark.
Then, we identify bad pixels in each of these frames based on their difference to the median of these frames. For frame (i) we classify pixels which are above 10 times the median as bad, for frame (ii) pixels which are above 75 times the median. Note that these thresholds were identified empirically. From the median subtracted dark frames, we estimate the readout noise as the mean over each frame’s pixel count standard deviation.
We proceed similar for the flat frames, but also group them by filter as well as size and exposure time, subtract a master dark with matching properties (i.e. similar size and exposure time) from each master flat and normalize it by its median pixel count.
2.1.3 Saturated pixels
The data cubes which we analyze in Section 3 consist of 100 frames of exposure. For each data cube, we reject the first frame (which we find to consistently suffer from a bias), so that there are 99 frames left. Note that NACO appends the median of all 100 frames at the end of each data cube which is also rejected here. Before proceeding, we also flag the saturated pixels in each frame which are all pixels with more than counts.
2.1.4 Dark, flat, background and bad pixel correction
We clean each frame of a data cube individually by subtracting a master dark with matching properties (i.e. similar size and exposure time), dividing it by a master flat with matching properties (i.e. similar size, exposure time and filter), correcting bad pixels (which are bad pixels from the master dark or the master flat) with a median filter of size five pixels and performing a simple background subtraction by subtracting the median pixel count of the frame from each pixel. A typical result is shown in the left panel of Figure 2, where residual systematic noise (mainly from the detector) is still clearly visible.
2.1.5 Dither subtraction
In order to mitigate the residual systematic noise from the detector and the sky background we perform a dither subtraction. After cleaning all data cubes within one OB, we find for each data cube (which we will here call data cube A) the data cube B with the target furthest away (on the detector) and subtract its median frame from each frame of data cube A. The new bad and saturated pixel maps are then the logical sums of those from both involved data cubes. After this step the residual noise appears like Gaussian random noise as is shown in the right panel of Figure 2.
Our typical performance is a pixel count standard deviation of outside of from the center of the PSF in exposure, where is the detector readout noise, is the observing wavelength ( for the L’ band) and is the diameter of the primary mirror ( for the VLT).
2.1.6 Reconstruction of saturated pixels
Our reconstruction of saturated pixels is based on an algorithm described in Section 2.5 of Ireland (2013). This technique also identifies and corrects residual bad pixels, with no more than 10 additional bad pixels corrected in a typical frame. First, we crop all frames to a size of 96 by 96 pixels () centered on the target. Then, we correct bad and saturated pixels for each frame separately by minimizing the Fourier plane power outside the region of support permitted by the pupil geometry. Let be the matrix which maps the bad and saturated pixel values onto the Fourier plane domain , then
[TABLE]
where are the corrections to the bad and saturated pixel values (i.e. the corrected pixel values are ) and is remaining Fourier plane noise. We solve for using the Moore-Penrose pseudo-inverse of , i.e.
[TABLE]
Since a broad-band filter was used for the observations, but we use a monochromatic central filter wavelength in our analysis and also blur the edge of the pupil through the use of a windowing function, we use a sightly larger pupil diameter to define this region of here. In fact, the only important thing for recovering the Fourier plane phase is that the Fourier plane power outside the region of support permitted by the pupil geometry is minimized, so using a larger pupil diameter just assures this in case of low quality data and is a conservative choice, especially in the case of our data which is far from the Nyquist sampling criterion.
Sometimes, the remaining Fourier plane noise can be significant, which is why we repeat the entire correction process up to 15 times for each frame. After each iteration, we look for remaining bad pixels by:
Computing the Fourier transform of the corrected frame from the previous iteration. 2. 2.
Windowing this frame by the Fourier domain . 3. 3.
Computing the inverse Fourier transform of this frame. 4. 4.
Identifying remaining bad pixels in this frame based on their difference to the median filtered frame.
If no remaining bad pixels are identified, we terminate the iteration.
A cross-section of a saturated PSF before and after performing the reconstruction is shown in Figure 3. Obviously, this reconstruction cannot reveal any structure or companions hidden behind saturated pixels, but it allows us to perform our kernel phase analysis on saturated data cubes which would otherwise suffer from high Fourier plane phase noise (cf. Figure 4). Please note that a method from the class of least squares spectral analysis techniques (i.e. image plane fringe fitting) may be more robust in dealing with bad pixels, but would require the simultaneous fitting of all Fourier plane phases and amplitudes and is therefore beyond the scope of this paper, although it is a promising approach for future work.
2.1.7 Frame selection
As explained in the Introduction, a high Strehl ratio is essential for the kernel phase technique in order for the mathematical framework (i.e. the linearization of the Fourier plane phase, cf. Equation 1) to be valid. Therefore, we select frames with sufficient Strehl ratio based on their peak pixel count. For each data cube, we first compute the median peak count of the 10% best frames. Then, we reject all frames with a peak count below 75% of this value. Using this dynamic threshold is better than simply rejecting a fixed fraction of the frames (e.g. Law et al., 2006) because it can correctly account for a sudden drop in the Strehl ratio like shown in Figure 5. Note that we consider the peak pixel count after performing the PSF reconstruction (cf Section 2.1.6) here.
2.2 Kernel phase extraction
2.2.1 VLT pupil model
In order to extract the kernel phase from VLT/NACO data we first need to construct a model for the VLT pupil (i.e. split the primary mirror into an interferometric array of virtual sub-apertures). We sample 140 virtual sub-apertures on a square grid with a pupil plane spacing of , which is approximately half the Nyquist sampling of , where is the observing wavelength and is the image size (96 pixels). Our VLT pupil model is shown in the left panel of Figure 6 and based on an primary mirror with a central obscuration. Another advantage of kernel phase over sparse aperture masking is the dense Fourier plane coverage which is shown in the right panel of Figure 6.
2.2.2 XARA
The extraction of the Fourier plane phase and the computation of the kernel phase relies on a python package called XARA555https://github.com/fmartinache/xara (eXtreme Angular Resolution Astronomy, Martinache, 2010, 2013). XARA has been designed to process data produced by multiple instruments assuming that the images comply to the kernel phase analysis requirements of proper sampling, high-Strehl (boosted by our frame selection procedure described in Section 2.1.7), and non-saturation (restored by the procedure described in Section 2.1.6). The discrete achromatic representation of the VLT aperture (i.e. our pupil model) is used by XARA to compute the phase transfer matrix and the associated left kernel operator via a singular value decomposition of .
With the added knowledge of the detector pixel scale and the observing wavelength, the discrete model is scaled so that the Fourier plane phase at the expected coordinates can be extracted by a discrete Fourier transform. For the small aberration hypothesis to remain valid, the data must be properly centered prior to the Fourier transform. Failure to do so will leave a residual Fourier plane phase ramp that can wrap and lead to meaningless kernel phases (cf. left panels of Figure 7). XARA offers several centering algorithms. It is crucial to carefully choose from the available options depending on the requirements coming from the data. For our extensive ground-based data set for example, we find that minimizing directly the Fourier plane phase which is extracted by XARA is most robust and the offered sub-pixel re-centering is very valuable (cf. right panels of Figure 7) due to an increased level of pupil plane phase noise from the atmosphere and the bright background (if compared to space-borne data).
Moreover, virtual baselines near the outer edge of the Fourier coverage suffer from low power as they are only supported by very few baselines, i.e. have small redundancy. The phase measured for these baselines is systematically noisier and needs to be excluded from the model to prevent the noise to propagate into the estimation of all kernel phases. This can be achieved using the baseline filtering option implemented in XARA. In our case, baselines of length greater than and the corresponding rows of are eliminated prior to the computation of . Some of the theoretically available kernel phases are lost but the remaining kernel phases can nevertheless be used just like for the complete model.
Finally, to limit the impact of readout noise in regions of the image where little signal is present, frames are windowed by a super-Gaussian () with a radius pixels, effectively limiting our field of view to . Note that Section 3.4 will further comment on the effect of this window and how it can affect contrast estimates for detections at large separations.
2.2.3 Kernel phase uncertainties
For estimating the uncertainties, we compute the kernel phase covariance for each frame from its photon count variance in units of (photo-electrons)2, where is the detector gain ( for uncorrelated high well depth mode), is the super-Gaussian window, is the cleaned and re-centered frame and is its background (from the simple background subtraction, cf. Section 2.1.4). Therefore, we first need to find a linear operator which maps each frame in units of photo-electrons to its kernel phase . The linear discrete Fourier transform and the kernel of the pupil model are already linear operators, and the Fourier plane phase (of a complex number ) can be approximated as for small angles. Hence, we compute
[TABLE]
Note that would be a small-angle approximation for the kernel phase. Then, we obtain an estimate for the kernel phase covariance by propagating the photon count variance according to
[TABLE]
Now, we have a kernel phase and a kernel phase covariance for each frame. In order to save computation time for the model fitting (cf. Section 2.4) we compute a weighted mean of the kernel phase for each data cube. Therefore, we first compute the average kernel phase covariance over all frames of a data cube via
[TABLE]
and then the weighted mean of the kernel phase (cf. Figure 8) via
[TABLE]
For the rest of this paper, we omit the bar for better readability, i.e.
[TABLE]
Note that this kernel phase covariance model includes the contribution of shot noise only. Any residual calibration errors not taken into account in the following section are therefore expected to increase the reduced of any model fitting, potentially to much more than 1.0 in the case of high signal-to-noise data with highly imperfect calibration.
2.3 Kernel phase calibration
Under perfect conditions the closure phase of a point-symmetric source, such as an unresolved star, is zero (e.g. Monnier, 2007). The same holds for the kernel phase, which is a generalization of the closure phase (e.g. Ireland, 2016). Practically however, one is limited by systematic errors caused by third order phase residuals (e.g. Ireland, 2013) and even point-symmetric sources have non-zero kernel phase.
For this reason, calibration is of fundamental importance when analyzing interferometric measurables (like closure or kernel phase). The systematic errors are expected to be quasi-static (e.g. Ireland, 2013), i.e. slowly varying with time, so that the kernel phase of a well-known point source measured close in time to that of the science target can serve as a calibrator. The simplest calibration technique would be to subtract the kernel phase of a well-known point source from that of the science target. This technique was for example used successfully in Martinache (2010), but here we want to go beyond this approach.
We use principal component analysis in the framework of a Karhunen-Loève decomposition (Soummer et al., 2012; Pueyo, 2016) in order to subtract the statistically most significant phase residuals of the calibrator kernel phase from that of the science target. Note that the following technique is new, but very similar to the POISE observables in Ireland (2013). We start by computing the covariance matrix of the kernel phase of all calibrator data cubes via
[TABLE]
Then, we compute an eigendecomposition of this matrix in order to obtain its sorted (in descending order) eigenvalues and eigenvectors . Finally, we compute the Karhunen-Loève transform of shape (number of kernel phases, number of calibrator data cubes) via
[TABLE]
where is the p-th component of the k-th eigenvector of and is the n-th kernel phase of the p-th calibrator data cube.
From the Karhunen-Loève transform we obtain a projection matrix via
[TABLE]
where is the identity matrix and is obtained from the first columns of . is an integer representing the order of the correction, i.e. how many eigencomponents of the calibrator kernel phase should be corrected for. The projection matrix is of shape (number of kernel phases, number of kernel phases), but it has zero eigenvalues by construction. In order to properly represent the dimensions we compute another eigendecomposition of and obtain a new projection matrix , whose columns are those eigenvectors of which correspond to non-zero eigenvalues. The projection matrix is of shape (number of “good” kernel phases, number of kernel phases), where “good” means statistically independent of systematic errors, and can be used to project the measured kernel phase and its covariance into a sub-space of dimension (number of “good” kernel phases), which is more robust with respect to quasi-static errors, via
[TABLE]
For the rest of this paper, we omit the prime for better readability, i.e.
[TABLE]
2.4 Model fitting
From Equations 2–4 it becomes clear that the measured kernel phase directly represents the kernel phase intrinsic to the observed object . Hence, we can infer information about the spatial structure of the observed object by fitting models for to .
2.4.1 Binary model
In order to search for companion candidates we use the binary model
[TABLE]
where is the contrast ratio between secondary and primary, and are the coordinates of the sampled Fourier plane positions (i.e. the spatial frequencies of the pupil model), is the observing wavelength and
[TABLE]
where is the angular separation between primary and secondary, is the position angle of the secondary with respect to the primary and is the detector position angle during the observation. Figure 8 shows the best fit binary model for the measured kernel phase of TYC 6849 1795 1 (resolved and bright binary).
2.4.2 Uncertainties from photon noise
Using the kernel phase covariance estimated from photon noise according to Section 2.2.3 we compute the best fit contrast and its uncertainty for the binary model on each position of a discrete square grid with spacing (which is half the detector pixel scale of CONICA). In some cases, where we suspect a companion candidate at a larger angular separation, we also extend the grid to .
In the high-contrast regime (where ), the phase is approximately proportional to the contrast of the binary model, so is its kernel phase (because is a linear operator). Hence, the of the binary model can be approximated as
[TABLE]
where and are vertical stacks of the kernel phase and the reference binary model of each data cube and is a block-diagonal matrix whose diagonal elements are the inverse kernel phase covariances of each data cube , i.e.
[TABLE]
The reference binary model is the binary model evaluated for and normalized by a reference contrast , i.e.
[TABLE]
Finally, we obtain the log-likelihood for the binary model as
[TABLE]
The best fit contrast for the binary model is then obtained by maximizing for each grid position, i.e.
[TABLE]
and its uncertainty is the square root of its variance, i.e.
[TABLE]
Finally, the detection significance based on photon noise is computed for each grid position as
[TABLE]
where we scale the uncertainty of the best fit contrast by the square root of the minimal reduced of the binary model of the entire grid (). Assuming that kernel phase is proportional to contrast, this is equivalent to scaling the kernel phase covariance so that the minimal reduced is 1.0. This step is necessary because kernel phase is still affected by third (or higher) order pupil plane phase noise (cf. Equations 2–4), so that the uncertainties from photon noise significantly underestimate the true errors. Note that there can be various sources of higher order phase noise (e.g. Ireland, 2013), but studying those in detail is beyond the scope of this paper.
The final parameters for the best fit binary model are then obtained from a least squares search which maximizes the log-likelihood of the binary model under varying angular separation, position angle and contrast simultaneously. For the least squares search, we use the grid position with the maximal log-likelihood as prior and restrict the search box for the angular separation to .
The uncertainties of the best fit parameters follow from the likelihood function for Gaussian errors (which are applicable to high confidence detections)
[TABLE]
where represents the three-dimensional parameter space of angular separation, position angle and contrast. Differentiating twice and neglecting terms containing second order derivatives of a single parameter yields
[TABLE]
where and are the Jacobian and the Hessian matrix of the binary model . Hence, the covariance matrix of the model parameters can be obtained via
[TABLE]
and the uncertainties of the model parameters for the best fit binary model are
[TABLE]
We also compute the correlation of the best fit model parameters as
[TABLE]
where denotes element-wise division.
2.4.3 Empirical uncertainties
Using only the uncertainties from photon noise, it is still difficult to distinguish between residual speckle noise (i.e. third order phase noise in the pupil plane) and real detections at small angular separations. This is the case because the data set which we analyze in Section 3 is very limited in terms of diversity of calibrator PSFs. For this reason, we use an empirical approach as the primary method to determine whether a detection is real or not.
First, we split our targets into candidate detections and calibrators based on their detection significance from photon noise (cf. Section 3.2.2). For each of the calibrators, we then compute two contrast curves:
The azimuthal average of the root mean square (RMS) best fit contrast . 2. 2.
The azimuthal average of the RMS best fit contrast after subtracting the best fit binary model from the measured kernel phase .
Here, the assumption is that the calibrators are single stars, so that the ratio of the two RMS contrast curves computed above, i.e.
[TABLE]
is a correction factor for the relative contrast of the residual speckle noise. This is illustrated in the left panel of Figure 9.
For each of the candidate detections, we only compute the azimuthal average of the RMS best fit contrast after subtracting the best fit binary model (which might or might not be a real detection) from the measured kernel phase . Then, we multiply this RMS contrast curve with the mean of the relative speckle contrast of all calibrators, i.e.
[TABLE]
where the bar denotes the mean, in order to obtain an empirical contrast uncertainty as a function of the angular separation for each candidate detection (cf. right panel of Figure 9). We classify a candidate detection as real if its empirical detection significance is above the threshold, i.e.
[TABLE]
Furthermore, we obtain empirically motivated uncertainties on the best fit parameters by multiplying the uncertainties from photon noise with the ratio of the empirical contrast uncertainty to the contrast uncertainty from photon noise (at the position of the best fit binary model ).
The kernel phase analysis tools described in Sections 2.2, 2.3 and 2.4 are available on GitHub666https://github.com/kammerje/PyKernel. We put a strong focus on applicability to other instruments and an exchangeable kernel phase fits file format.
3 Results and discussion
3.1 Target list
We test our methods on an archival data set because the kernel phase technique is optimized for detecting companions at much smaller angular separations to their host star than conventional high-contrast imaging techniques (such as ADI and reference star differential imaging, i.e. RDI). Hence, the parameter space that we are looking at is still unexplored. We search the VLT/NACO archive for L’ band RDI surveys and decide to analyze program 097.C-0972(A), PI J. Girard, due to a large number of observed targets and therefore potential calibrators. A target list together with our detections is reported in Table 1.
3.2 Detected companion candidates
Before we search the targets in Table 1 for close companion candidates, we perform a basic vetting procedure by visually inspecting the cleaned data for wide companion candidates (cf. Section 3.2.1). In the field of view, which is limited to due to the windowing, we find six wide companion candidates (cf. upper section of Table 2). Three of them are already known and we classify our detections as confirmed, whereas the other three have not been reported before and therefore are new detections. Note that we correct the contrast of the wide companion candidates for the windowing (cf. Section 3.4).
After detecting and subtracting off the signal induced by the wide companion candidates, we use the kernel phase technique in order to search for closer and fainter objects (cf. Section 3.2.2). We find two companion candidates with an empirical detection significance above the threshold, i.e. (cf. lower section of Table 2). One of them is already known and we classify our detection as confirmed, whereas the other one has not been reported before and therefore is a new detection. For HIP 13008 we note that the empirical detection significance is when using only HIP 116384 as calibrator, but only when using HIP 12925 due to high residuals and a very large correction. Therefore, HIP 12925 seems to be a bad calibrator and we do not report any best fit parameters for HIP 13008 due to a lack of credibility. Follow-up observations are required to confirm the true nature of this object. Also note that OBs 6–11 contain only one or two targets and are not analyzed with the kernel phase technique because the diversity of kernel phase amongst calibrators is essential for our empirical detection method. As there are systematic differences between the individual nights in the measured kernel phase, for this paper we are only analyzing OBs which contain at least two PSF calibrators (observed in the same night). Although this choice was made for simplicity and it might be possible to calibrate targets over longer timescales, this adds significant additional complexity which is beyond the scope of this paper.
From the targets for which we detect neither a wide nor a close companion candidate, we compute a contrast curve (i.e. the detection limit as a function of the angular separation) for the kernel phase technique (cf. Section 3.3).
3.2.1 Wide companion candidates
The wide companion candidates reported in the upper section of Table 2 are all detected by visually inspecting the cleaned data. When we find a companion candidate, we use a grid search followed by a least squares search in order to find its best fit binary model for the measured kernel phase . Then, we compute the empirical detection significance for the best fit binary model (cf. right panels of Figures 10 and 11). This is achieved using a simplification of the empirical detection method (cf. Section 2.4.3). Since the wide companion candidates all have a sufficiently large angular separation (i.e. ) and are sufficiently bright (otherwise we could not detect them by eye), we can skip the use of any calibrators and compute the empirical detection significance as the best fit contrast divided by the azimuthal average of the RMS best fit contrast after subtracting the best fit binary model from the measured kernel phase . Note that we do not use any Karhunen-Loève calibration for this step either, i.e. (cf. Section 2.3).
Before we search for closer and fainter objects, we subtract the signal induced by the wide companion candidates from the measured kernel phase, i.e.
[TABLE]
so that the measured kernel phase of all targets is free of wide detections. The detected wide companion candidates are shown in the left panels of Figures 10 and 11 and are described in more detail in the following paragraphs.
HIP 36985 B, TYC 7401 2446 1 B, TYC 5835 0469 1 B. These objects are new companion candidates which were not reported before. They have L’ band contrasts of , and respectively, and therefore are candidates for stellar mass companions.
TYC 6849 1795 1 B. This object was already detected in 2005 by Galicher et al. (2016) at an angular separation of , a position angle of and a H band contrast of . We find a L’ band contrast of and an angular separation () and a position angle () which are in agreement with Galicher et al. (2016), i.e. we can confirm the bound nature of the object.
HIP 116231 B. This object was already detected in 2004 by Schöller et al. (2010) at an angular separation of , a position angle of and a K band contrast of . We find a L’ band contrast of , a slightly larger angular separation of and a slightly different position angle of , but (allowing for orbital motion) we can confirm the bound nature of the object. Note that there is a huge disagreement in the contrast, but a brief look at the raw data from Schöller et al. (2010) shows a significant PSF halo and confirms our result of .
HIP 116384 C. This object was first detected in 2002 by Martín (2003) who found HIP 116384 (GJ 900) to be a triple system with a (HIP 116384 B, ) and a (HIP 116384 C, ) component. Lafrenière et al. (2007b) resolved the system again in 2004 and 2005, finding HIP 116384 B at an angular separation of and respectively, and HIP 116384 C at an angular separation of and respectively. In the cleaned data, we only find HIP 116384 C at a slightly larger angular separation of , but a position angle () and a L’ band contrast () which are in agreement with Martín (2003) and Lafrenière et al. (2007b), so that we can confirm the bound nature of the object. Looking at the raw data, we also find HIP 116384 B (which is the brighter of the two companions), noticing that it has moved to an angular separation of being too far away in order to be visible in our cleaned data (due to the windowing).
3.2.2 Close companion candidates
The close companion candidates reported in the lower section of Table 2 are all detected only by the kernel phase technique. For each target in Table 1, we use a grid search followed by a least squares search in order to find the best fit binary model for the measured kernel phase . Then, we compute the detection significance from photon noise (cf. Section 2.4.2) at the position of the best fit binary model from the least squares search. For this step, we always use all other targets which were observed in the same OB as calibrators for the Karhunen-Loève calibration, fixing 777For simplicity, we fix for all targets and regardless of the number of calibrators. Various testing has shown that subtracting off the four statistically most significant eigencomponents of the kernel phase of the calibrators mostly yields the smallest amount of significant detections, i.e. calibrates the data best. A more rigorous investigation of this relationship is foreseen for a future publication.. Knowing that the majority of VLT/NACO targets do not have any close companions, we then classify the of the targets with the highest in each OB as candidate detections (cf. column “Can” of Table 1) for the next step and the remaining targets as calibrators.
For the next step, we compute the empirical detection significance (cf. Section 2.4.3) for each of the candidate detections from the previous step. For this step, we always use all remaining targets which were classified as calibrators in the previous step for the Karhunen-Loève calibration, again fixing . If the empirical detection significance is above the threshold, i.e. , we classify the candidate detection as real. If not, we add the candidate detection to the list of calibrators and redo the computation of the empirical detection significance (this time with one calibrator more than before). We repeat this process until all candidate detections are real. The detected close companion candidates are shown in Figure 12 and are described in more detail in the following paragraphs. Please note that we report the correlation of the best fit parameters in Appendix A and present model-data correlation plots in Appendix B.
HIP 50156 B. This object was already detected in 2011 by Bowler et al. (2015) at an angular separation of and a K band contrast of . Just nine month later, Brandt et al. (2014) cannot resolve this companion and report an upper limit of for its angular separation. We find HIP 50156 B at an angular separation of and an L’ band contrast of , confirming the detection and notable orbital motion.
HIP 37918 B. This object is a new companion candidate which was not reported before. It has a L’ band contrast of , and therefore is a candidate for a stellar mass companion. Furthermore, HIP 37918 () is known to have a companion of almost equal mass (HIP 37923, , Desidera et al. 2006). Together with our companion candidate, this would make the system triple.
3.3 Detection limits
In Section 2.4.3, we present our empirical approach to find meaningful detection limits for the data analyzed in this paper. Based on this approach, we compute the contrast limit of the kernel phase technique as a function of the angular separation as the azimuthal mean of the RMS best fit contrast of all targets for which we do not detect any companions with the kernel phase technique (i.e. all non-detections, cf. column “Det” of Table 1). Note that we already subtracted off the signal induced by the wide companion candidates. The mean, the best and the worst contrast limit are shown in the left panel of Figure 13.
At the small angular separations which are inaccessible by classical high-contrast imaging techniques (i.e. within in the L’ band), the kernel phase technique achieves contrast limits of . This is not yet deep enough to detect companions in the planetary-mass regime, which would start between and for young () gas giants (e.g. Bowler, 2016). However, our closest detections prove that the resolution which is required to resolve solar-system scales in the nearest star forming regions can be achieved with the kernel phase technique. At larger angular separations, our best contrast limit is comparable with the limits achieved by RDI (e.g. Cantalloube et al., 2015). The large spread in the contrast limit comes from the fact that the amplitude of the background noise is nearly the same for all data cubes, whereas the peak value of the PSF varies heavily due to the PSF reconstruction (cf. Section 2.1.6).
3.4 Windowing correction
As mentioned in Section 2.2.2, we window all frames by a super-Gaussian (with a FWHM of ) in order to minimize edge effects when computing their Fourier transform. Due to this windowing, the brightness of companions at angular separations deviates by more than 1% from the true value. In order to correct for this effect, we again assume that kernel phase is proportional to contrast in the high-contrast regime, so that we can obtain the true contrast of a companion by dividing its measured contrast (i.e. the best fit contrast from the binary model) by the value of the super-Gaussian windowing function. We are aware that this method has its limits, as each PSF has a spatial extent on the detector and assuming that the entire PSF is multiplied by the same value is an over-simplification of the problem. Nevertheless, this method agrees fairly well with the contrasts which we measure in the cleaned fits files and we use it to correct the contrast of all wide companion candidates (cf. right panel of Figure 13). We add an additional contrast correction error in quadrature based on injection-recovery tests to companions wider than to account for limitations in this technique.
4 Conclusions
We use the kernel phase technique in order to search for close companions at the diffraction limit in an archival VLT/NACO RDI L’ band data set. Therefore, we develop our own data reduction pipeline for VLT/NACO data, which performs a basic dark, flat, bad pixel and background (i.e. dither) subtraction, but also reconstructs saturated PSFs in order to reduce their Fourier plane noise. Furthermore, we select frames with sufficiently high Strehl ratio, which is essential for the kernel phase technique as it relies on a linearization of the Fourier plane phase. Then, we use XARA for extracting the kernel phase and improve its re-centering algorithm in the case of resolved and bright companions. Furthermore, we apply a principal component analysis based calibration to the data (i.e. Karhunen-Loève decomposition, Soummer et al., 2012) and develop a suite of analytic model fitting algorithms in order to search for point source companions with the kernel phase technique888https://github.com/kammerje/PyKernel.
For the archival data set which we analyze in Section 3, we find that our kernel phase covariance model (which only takes into account shot noise) is not sufficient and significantly underestimates the true errors. This is still the case after calibrating the data, because the diversity of calibrator PSFs is not sufficient. Hence, we develop an empirical method for estimating the relative contrast of the residual speckle noise and finding meaningful detection limits for the data. With this empirical approach, we detect six wide companion candidates by visually inspecting the cleaned data and two close (–) companion candidates which are detected only by the kernel phase technique. All eight companion candidates lie in the stellar-mass regime and five of them were previously unknown.
In order to reach the planetary-mass regime, a better library of calibrator PSFs is required. Therefore, it is extremely important that the targets and their calibrators are observed as close in time as possible. This becomes very clear from the archival data set which we analyze, where there are in fact multiple calibrators observed in one night, but not close enough in time, so that the kernel phase calibration does not reduce the quasi-static errors satisfyingly. In order to make better use of our prinipal component analysis based calibration, we propose star-hopping sequences of targets, and to revisit each target at least twice. Star-hopping is an observing strategy for which the instrument (and in particular the AO system) acquisition is only performed once at the beginning of each sequence. Then, one slews (“hops”) from target to target without interrupting the AO system. Furthermore, we aim to examine more extensive Keck data sets where we are hopeful that the significant investment of telescope resources gives adequate calibrator diversity to characterize the systematic errors and possibly use Bayesian Monte-Carlo techniques.
In this paper, we have shown that kernel phase is able to achieve a resolution below the classical diffraction limit of a telescope under good observing conditions (i.e. sufficiently high Strehl ratio). This is of particular interest for future space-based observatories, such as the JWST, as it gives access to an exciting parameter space which could otherwise not be explored due to the limited mirror size (and therefore resolution). Space-based telescopes do not suffer from atmospheric turbulence, what makes the calibration much less challenging than for the ground-based VLT/NACO data (e.g. Martinache, 2010). Nevertheless, with an optimized observing strategy, kernel phase is also a competitive high-contrast imaging technique from the ground.
The application of kernel phase is of course not limited to imaging telescopes. One concept which aims to push the kernel phase technique towards higher contrasts is the VIKiNG instrument (Martinache & Ireland, 2018), which proposes kernel phase nulling interferometry with the VLTI. By combining kernel phase with a high-contrast booster (i.e. a nulling interferometer) it would allow for self-calibrating the observables and achieving a better robustness with respect to residual wavefront errors. This would in turn also be an option to reduce the demanding stability requirements on space-based nulling interferometers, such as the LIFE concept (Kammerer & Quanz, 2018; Quanz et al., 2018), which aims to detect dozens of Earth-like exoplanets in the solar neighborhood.
Acknowledgements
MJI was supported by the Australian Research Council Future Fellowship (FT130100235). This project has received funding from the European Research Council (ERC) under the European Union’s Horizon 2020 research and innovation program (grant agreement CoG #683029). JHG gratefully acknowledges support from the Director’s Research Funds at the Space Telescope Science Institute. The manuscript was also substantially improved following helpful comments from an anonymous referee.
Appendix A Parameter correlation
Appendix B Correlation plots
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Amara & Quanz (2012) Amara A., Quanz S. P., 2012, MNRAS , 427, 948 · doi ↗
- 2Biller et al. (2014) Biller B. A., et al., 2014, Ap J , 792, L 22 · doi ↗
- 3Bowler (2016) Bowler B. P., 2016, PASP , 128, 102001 · doi ↗
- 4Bowler & Nielsen (2018) Bowler B. P., Nielsen E. L., 2018, preprint, ( ar Xiv:1802.10132 )
- 5Bowler et al. (2015) Bowler B. P., Liu M. C., Shkolnik E. L., Tamura M., 2015, Ap JS , 216, 7 · doi ↗
- 6Brandt et al. (2014) Brandt T. D., et al., 2014, Ap J , 786, 1 · doi ↗
- 7Cantalloube et al. (2015) Cantalloube F., et al., 2015, A&A , 582, A 89 · doi ↗
- 8Cheetham et al. (2016) Cheetham A. C., Girard J., Lacour S., Schworer G., Haubois X., Beuzit J.-L., 2016, in Optical and Infrared Interferometry and Imaging V. p. 99072 T, doi:10.1117/12.2231983 · doi ↗
