Kernel phase imaging with VLT/NACO: high-contrast detection of new   candidate low-mass stellar companions at the diffraction limit

Jens Kammerer; Michael J. Ireland; Frantz Martinache; Julien H.; Girard

arXiv:1903.11252·astro-ph.EP·April 10, 2019

Kernel phase imaging with VLT/NACO: high-contrast detection of new candidate low-mass stellar companions at the diffraction limit

Jens Kammerer, Michael J. Ireland, Frantz Martinache, Julien H., Girard

PDF

TL;DR

This paper demonstrates that kernel phase imaging can detect low-mass stellar companions at the diffraction limit, achieving resolutions below classical limits, and identifies five new companions using VLT/NACO data.

Contribution

The study applies kernel phase combined with principal component calibration to high-cadence L' band data, revealing new companions and showcasing super-resolution capabilities.

Findings

01

Detected eight low-mass companions, five previously unknown.

02

Achieved resolution below the classical diffraction limit (~80-110 mas).

03

Reached a 5σ contrast limit of approximately 1/100 at small angular separations.

Abstract

Directly imaging exoplanets is challenging because quasi-static phase aberrations in the pupil plane (speckles) can mimic the signal of a companion at small angular separations. Kernel phase, which is a generalization of closure phase (known from sparse aperture masking), is independent of pupil plane phase noise to second order and allows for a robust calibration of full pupil, extreme adaptive optics observations. We applied kernel phase combined with a principal component based calibration process to a suitable but not optimal, high cadence, pupil stabilized L' band ( $3.8 μ m$ ) data set from the ESO archive. We detect eight low-mass companions, five of which were previously unknown, and two have angular separations of $\sim 0.8$ - $1.2 λ / D$ (i.e. $\sim 80$ - $110 mas$ ), demonstrating that kernel phase achieves a resolution below the classical diffraction limit of a…

Tables2

Table 1. Table 1: Target list grouped by OB for the VLT/NACO program 097.C-0972(A), PI J. Girard. For each target, we report the spectral type (SpT), the distance ( d 𝑑 d ), the apparent K band magnitude (K) and the total integration time after frame selection ( T int subscript 𝑇 int T_{\text{int}} ). Whether we find any wide (visual) companion candidates, close (kernel phase) candidate detections and real detections is highlighted in columns “Vis”, “Can” and “Det”. We further report the empirical detection significance for the wide (visual) companion candidates ( SNR emp vis superscript subscript SNR emp vis \text{SNR}_{\text{emp}}^{\text{vis}} ), the detection significance from photon noise for all targets ( SNR ph subscript SNR ph \text{SNR}_{\text{ph}} ) and the empirical detection significance for the close (kernel phase) candidate detections ( SNR emp can superscript subscript SNR emp can \text{SNR}_{\text{emp}}^{\text{can}} ).

Notes. OBs 6–11 contain only one or two targets and cannot be analyzed with the kernel phase technique due to a lack of calibrators. Spectral types (SpT), distances ( $d$ ) and apparent K band magnitudes (K) are taken from Simbad (Wenger et al., 2000).
OB	Target	SpT	$d$ [pc]	K [mag]	$T_{int}$ [s]	Vis	${SNR}_{emp}^{vis}$	${SNR}_{ph}$	Can	${SNR}_{emp}^{can}$	Det
1	HIP 68994	F3/5V	71.7	6.715	395.8	N	–	46.6	Y	4.7	N
	HIP 63734	F7/8V	54.1	6.436	389.2	N	–	44.3	N	–	N
	HIP 55052	K7V	23.7	6.808	389.2	N	–	45.1	N	–	N
2	HIP 44722	K7V	14.5	5.757	395.6	N	–	22.1	N	–	N
	HD 108767 B	K0V	26.7	6.235	310.2	N	–	21.4	N	–	N
	HIP 47425	M3V	9.6	6.056	388.8	N	–	32.4	Y	1.0	N
	HIP 50156	M0.7V	23.4	6.261	395.2	N	–	292.7	Y	33.2	Y
	HD 102982	G3V	53.2	6.605	316.6	N	–	23.9	N	–	N
	HIP 58029	G7V	42.2	6.78	395.8	N	–	32.9	Y	1.4	N
	HIP 61804	G3V	59.2	6.869	395.8	N	–	27.1	N	–	N
	HD 110058	A0V	130.0	7.583	383.4	N	–	30.3	N	–	N
	HIP 72053	G3V	59.7	6.994	382.4	N	–	29.4	N	–	N
3	HIP 58241	G4V	35.5	6.24	256.0	N	–	16.7	N	–	N
	TYC 8312 0298 1	K0II	804.5	6.475	162.0	N	–	18.0	N	–	N
	HIP 78747	F5V	41.1	4.859	280.8	N	–	22.8	Y	2.0	N
4	HIP 37918	K0IV-V	34.4	6.275	389.2	N	–	336.5	Y	20.3	Y
	HIP 36985	M1.0V	14.1	5.934	334.4	Y	182.2	31.1	N	–	N
	TYC 7401 2446 1	K0V	42.2	6.778	117.4	Y	195.0	14.4	N	–	N
5	TYC 6849 1795 1	K5V	27.6	6.911	305.4	Y	250.1	13.3	N	–	N
	HIP 92403	M3.5V	3.0	5.370	750.8	N	–	39.1	Y	2.5	N
	HIP 94020 B	K5V	29.1	6.999	657.0	N	–	23.9	N	–	N
6	BDp19 3532	K0	240.2	5.842	1361.2	N	–	–	N	–	N
6	HIP 108085	B8IV-V	64.7	3.45	401.8	N	–	–	N	–	N
7	HIP 116231	B9.5III	53.4	4.611	285.2	Y	4.3	–	N	–	N
7	HIP 116258	K2V	34.0	6.685	367.0	N	–	–	N	–	N
8	HIP 11484	B9III	60.4	4.392	279.6	N	–	–	N	–	N
9	HIP 3203 B	K5V	26.5	6.834	181.6	N	–	–	N	–	N
10	TYC 5835 0469 1	G8V	60.9	6.997	465.0	Y	95.8	–	N	–	N
10	TYC 9339 2158 1	K3V	30.3	6.712	461.2	N	–	–	N	–	N
11	HIP 7554	M0V	22.2	6.621	637.4	N	–	–	N	–	N
11	HIP 13754	K2V	38.6	6.883	503.8	N	–	–	N	–	N
12	HIP 116384	K7V	20.8	6.044	739.4	Y	99.7	26.2	N	–	N
	HIP 12925	F8	57.1	6.52	595.4	N	–	24.0	N	–	N
	HIP 13008	F5V	39.1	5.442	617.8	N	–	127.4	Y	N/A	N
13	HIP 14555	M1V	19.6	6.367	609.6	N	–	35.1	N	–	N
	HIP 20737	G9.5V	35.6	6.742	626.6	N	–	31.1	N	–	N
	HIP 22506	G9V	50.8	6.876	620.0	N	–	35.8	Y	4.3	N
	HIP 23362	B9V	60.5	4.974	311.8	N	–	28.7	N	–	N

Table 2. Table 2: Wide companion candidates (CC) detected by visually inspecting the cleaned data (upper section) and close companion candidates detected only by the kernel phase technique (lower section). We estimate the apparent L’ band magnitude (L’) by adding the contrast ( c 𝑐 c ) to the apparent K band magnitude of the host star (K, cf. Table 1 ). We report the angular separation ( ρ 𝜌 \rho ) and the position angle ( ϑ italic-ϑ \vartheta ) of our best fit binary model θ bin subscript 𝜃 bin \theta_{\text{bin}} , the ratio of the empirical errors (which are reported here) to the errors from photon noise ( f err subscript 𝑓 err f_{\text{err}} ) and the reduced χ 2 superscript 𝜒 2 \chi^{2} of our best fit binary model ( χ r,bin 2 superscript subscript 𝜒 r,bin 2 \chi_{\text{r,bin}}^{2} ) and the raw kernel phase ( χ r,raw 2 superscript subscript 𝜒 r,raw 2 \chi_{\text{r,raw}}^{2} ). Whether a detection is new or known is highlighted in column “New” and a reference for known detections can be found in column “Ref”.

Target	CC	L’ [mag]	$c$ [pri/sec]	$ρ$ [mas]	$ϑ$ [deg]	$f_{err}$	$χ_{r,bin}^{2}$	$χ_{r,raw}^{2}$	New	Ref
HIP 36985	B	$8.553 \pm 0.005$	$(8.96 \pm 0.04) e - 2$	$441.5 \pm 0.2$	$133.77 \pm 0.02$	21.66	61.6	6311.3	Y	–
TYC 7401 2446 1	B	$8.096 \pm 0.005$	$(2.97 \pm 0.01) e - 1$	$425.8 \pm 0.3$	$89.23 \pm 0.03$	8.10	7.1	1238.9	Y	–
TYC 6849 1795 1	B	$8.363 \pm 0.004$	$(2.63 \pm 0.01) e - 1$	$223.5 \pm 0.4$	$203.29 \pm 0.05$	14.29	13.0	6090.0	N	G16
HIP 116231	B	$9.04 \pm 0.02$	$(1.69 \pm 0.03) e - 2$	$874.6 \pm 0.8$	$254.70 \pm 0.05$	58.19	667.5	696.7	N	S10
TYC 5835 0469 1	B	$9.396 \pm 0.003$	$(1.097 \pm 0.003) e - 1$	$717.9 \pm 0.2$	$37.62 \pm 0.01$	23.56	17.1	1883.7	Y	–
HIP 116384	C	$8.732 \pm 0.001$	$(8.412 \pm 0.008) e - 2$	$842.90 \pm 0.07$	$346.614 \pm 0.004$	9.18	40.6	186.4	N	M03
HIP 50156	B	$8.17 \pm 0.03$	$(1.72 \pm 0.05) e - 1$	$77.3 \pm 0.8$	$338.7 \pm 0.2$	19.75	22.1	1195.7	N	B15
HIP 37918	B	$9.56 \pm 0.05$	$(4.9 \pm 0.2) e - 2$	$122 \pm 5$	$9.4 \pm 0.8$	46.55	17.3	1104.6	Y	–
Notes. G16 = Galicher et al. (2016), S10 = Schöller et al. (2010), M03 = Martín (2003), B15 = Bowler et al. (2015).

Equations74

ϕ = R^{- 1} \cdot A \cdot φ + ϕ_{obj} + O (φ^{3}),

ϕ = R^{- 1} \cdot A \cdot φ + ϕ_{obj} + O (φ^{3}),

θ

θ

= = 0 K \cdot R^{- 1} \cdot A \cdot φ + K \cdot ϕ_{obj} + O (φ^{3})

= θ_{obj} + \approx 0 O (φ^{3}),

f_{Z} = B_{Z} \cdot b + ϵ_{Z},

f_{Z} = B_{Z} \cdot b + ϵ_{Z},

b = B_{Z}^{+} \cdot f_{Z} = (B_{Z}^{*} \cdot B_{Z})^{- 1} \cdot B_{Z}^{*} \cdot f_{Z} .

b = B_{Z}^{+} \cdot f_{Z} = (B_{Z}^{*} \cdot B_{Z})^{- 1} \cdot B_{Z}^{*} \cdot f_{Z} .

B = K \cdot \frac{Im ( F )}{∣ F \cdot g \cdot w \cdot d ∣} .

B = K \cdot \frac{Im ( F )}{∣ F \cdot g \cdot w \cdot d ∣} .

Σ_{θ} = B \cdot Σ_{d} \cdot B^{T} .

Σ_{θ} = B \cdot Σ_{d} \cdot B^{T} .

\overset{ˉ}{Σ}_{θ} = (i \sum Σ_{θ, i}^{- 1})^{- 1},

\overset{ˉ}{Σ}_{θ} = (i \sum Σ_{θ, i}^{- 1})^{- 1},

\overset{ˉ}{θ} = \overset{ˉ}{Σ}_{θ} \cdot i \sum Σ_{θ, i}^{- 1} \cdot θ_{i} .

\overset{ˉ}{θ} = \overset{ˉ}{Σ}_{θ} \cdot i \sum Σ_{θ, i}^{- 1} \cdot θ_{i} .

\overset{ˉ}{θ} \to θ,

\overset{ˉ}{θ} \to θ,

\overset{ˉ}{Σ}_{θ} \to Σ_{θ} .

E_{RR, (i, j)} = θ_{cal, i}^{T} \cdot θ_{cal, j} .

E_{RR, (i, j)} = θ_{cal, i}^{T} \cdot θ_{cal, j} .

Z_{(n, k)} = \frac{1}{w _{k}} p \sum v_{k}^{p} \cdot θ_{cal, p}^{n},

Z_{(n, k)} = \frac{1}{w _{k}} p \sum v_{k}^{p} \cdot θ_{cal, p}^{n},

P = I - Z^{'} \cdot Z^{' T},

P = I - Z^{'} \cdot Z^{' T},

θ^{'} = P^{'} \cdot θ,

θ^{'} = P^{'} \cdot θ,

Σ_{θ}^{'} = P^{'} \cdot Σ_{θ} \cdot P^{' T} .

θ^{'} \to θ,

θ^{'} \to θ,

Σ_{θ}^{'} \to Σ_{θ} .

r_{bin} \cdot e^{i ϕ_{bin}} = 1 + c \cdot exp (- 2 π i \cdot (\frac{Δ _{RA} \cdot u}{λ} + \frac{Δ _{DEC} \cdot v}{λ})),

r_{bin} \cdot e^{i ϕ_{bin}} = 1 + c \cdot exp (- 2 π i \cdot (\frac{Δ _{RA} \cdot u}{λ} + \frac{Δ _{DEC} \cdot v}{λ})),

Δ_{RA} = - ρ \cdot sin (ϑ - ϑ_{0}),

Δ_{RA} = - ρ \cdot sin (ϑ - ϑ_{0}),

Δ_{DEC} = ρ \cdot cos (ϑ - ϑ_{0}),

χ_{bin}^{2} = (Θ - c \cdot Θ_{bin,ref})^{T} \cdot Σ_{Θ}^{- 1} \cdot (Θ - c \cdot Θ_{bin,ref}),

χ_{bin}^{2} = (Θ - c \cdot Θ_{bin,ref})^{T} \cdot Σ_{Θ}^{- 1} \cdot (Θ - c \cdot Θ_{bin,ref}),

Θ = θ_{1} θ_{2} ⋮, Σ_{Θ}^{- 1} = Σ_{θ, 1}^{- 1} 0 ⋮ 0 Σ_{θ, 2}^{- 1} ⋮ \dots \dots ⋱ .

Θ = θ_{1} θ_{2} ⋮, Σ_{Θ}^{- 1} = Σ_{θ, 1}^{- 1} 0 ⋮ 0 Σ_{θ, 2}^{- 1} ⋮ \dots \dots ⋱ .

θ_{bin,ref} = \frac{θ _{bin} ( c = c _{ref} )}{c _{ref}} .

θ_{bin,ref} = \frac{θ _{bin} ( c = c _{ref} )}{c _{ref}} .

ln L = - \frac{1}{2} χ_{bin}^{2} .

ln L = - \frac{1}{2} χ_{bin}^{2} .

\frac{\partial}{\partial c} ln L_{c_{fit}}

\frac{\partial}{\partial c} ln L_{c_{fit}}

\Rightarrow c_{fit}

σ_{c_{fit}} = \frac{1}{Θ _{bin,ref}^{T} \cdot Σ _{Θ}^{- 1} \cdot Θ _{bin,ref}} .

σ_{c_{fit}} = \frac{1}{Θ _{bin,ref}^{T} \cdot Σ _{Θ}^{- 1} \cdot Θ _{bin,ref}} .

SNR_{ph} = \frac{c _{fit}}{σ _{ph}} = \frac{c _{fit}}{σ _{c_{fit}} \cdot χ _{r,bin,min}^{2}},

SNR_{ph} = \frac{c _{fit}}{σ _{ph}} = \frac{c _{fit}}{σ _{c_{fit}} \cdot χ _{r,bin,min}^{2}},

ln L (p ∣ x)

ln L (p ∣ x)

= - \frac{1}{2} (Θ - Θ_{bin} (p))^{T} \cdot Σ_{Θ}^{- 1} \cdot (Θ - Θ_{bin} (p)),

H_{(i, j)}

H_{(i, j)}

\approx \frac{\partial Θ _{bin} ( p )}{\partial p _{i}} \cdot Σ_{Θ}^{- 1} \cdot \frac{\partial Θ _{bin} ( p )}{\partial p _{j}}

= - (J \cdot Σ_{Θ}^{- 1} \cdot J^{T})_{(i, j)},

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

Kernel phase imaging with VLT/NACO: high-contrast detection of new candidate low-mass stellar companions at the diffraction limit

Jens Kammerer,1 Michael J. Ireland,1 Frantz Martinache2 and Julien H. Girard3,4

1Research School of Astronomy & Astrophysics, Australian National University, ACT 2611, Australia

2Laboratoire Lagrange, Université Côte d’Azur, Observatoire de la Côte d’Azur, CNRS, Parc Valrose, Bât. H. FIZEAU, 06108 Nice, France

3Space Telescope Science Institute, 3700 San Martin Drive, Baltimore, MD 21218, USA

4Université Grenoble Alpes, CNRS, IPAG, 38000 Grenoble, France E-mail: [email protected]

(Accepted XXX. Received YYY; in original form ZZZ)

Abstract

Directly imaging exoplanets is challenging because quasi-static phase aberrations in the pupil plane (speckles) can mimic the signal of a companion at small angular separations. Kernel phase, which is a generalization of closure phase (known from sparse aperture masking), is independent of pupil plane phase noise to second order and allows for a robust calibration of full pupil, extreme adaptive optics observations. We applied kernel phase combined with a principal component based calibration process to a suitable but not optimal, high cadence, pupil stabilized L’ band ( $3.8~{}\text{\textmu m}$ ) data set from the ESO archive. We detect eight low-mass companions, five of which were previously unknown, and two have angular separations of $\sim 0.8$ – $1.2~{}\lambda/D$ (i.e. $\sim 80$ – $110~{}\text{mas}$ ), demonstrating that kernel phase achieves a resolution below the classical diffraction limit of a telescope. While we reach a $5\sigma$ contrast limit of $\sim 1/100$ at such angular separations, we demonstrate that an optimized observing strategy with more diversity of PSF references (e.g. star-hopping sequences) would have led to a better calibration and even better performance. As such, kernel phase is a promising technique for achieving the best possible resolution with future space-based telescopes (e.g. JWST), which are limited by the mirror size rather than atmospheric turbulence, and with a dedicated calibration process also for extreme adaptive optics facilities from the ground.

keywords:

planets and satellites: detection – planets and satellites: formation – techniques: high angular resolution – techniques: image processing – techniques: interferometric – binaries: close

††pubyear: 2018††pagerange: Kernel phase imaging with VLT/NACO: high-contrast detection of new candidate low-mass stellar companions at the diffraction limit–B

1 Introduction

Direct imaging is vital for studying the outer regions of extrasolar systems which are inaccessible to transit observations and can only be revealed by decades-long, time consuming radial velocity surveys (e.g. Fischer et al., 2014). It has proven particularly successful in probing our understanding of the formation of gas giant planets (e.g. D’Angelo et al., 2010), being able to estimate their mass from their luminosity and age (e.g. Spiegel & Burrows, 2012) and resolve their orbit. Although the majority of detected companion candidates are arguably consistent with being emission or scattering from disk material (e.g. LkCa 15, Kraus & Ireland 2012, HD 100546, Quanz et al. 2013, HD 169142, Biller et al. 2014), the recent example of PDS 70 (Keppler et al., 2018) demonstrates that direct imaging of wide-separation but still solar-system scale planets is possible at relatively moderate contrasts in the vicinity of young stars. This is spurring an ongoing discussion about the nature of planet formation and the commonness of gas giant planets with large orbital distances (e.g. Bowler & Nielsen, 2018).

However, direct imaging operates at the resolution and sensitivity limit of the most powerful instruments today (e.g. Pepe et al., 2014), placing demanding requirements on the observing and the post-processing techniques which are used to uncover faint companions at high contrasts (e.g. angular differential imaging, Marois et al. 2006, point spread function subtraction, Lafrenière et al. 2007a, principal component analysis, Amara & Quanz 2012, Soummer et al. 2012). Detecting exoplanets from the ground using these techniques has only been made possible by the recent development of extreme adaptive optics systems (e.g. Milli et al., 2016) and is mainly limited by non-common path aberrations which are not sensed by the wavefront control system (e.g. Sauvage et al., 2007). These aberrations manifest themselves as quasi-static speckles on the detector images which can mimic the signal of a companion and place a strong constraint on the achievable contrast at small angular separations (e.g. Fitzgerald & Graham, 2006). Hence, directly imaging and studying the formation of gas giant planets on solar-system scales has been extremely challenging so far (e.g. Bowler, 2016) because the nearest star forming regions lie $\gtrsim 100~{}\text{pc}$ away (e.g. Loinard et al., 2007) where such orbital distances correspond to angular separations of only $\lesssim 200~{}\text{mas}$ .

In this paper, we explore the capabilities of the kernel phase technique (Martinache, 2010) for high-contrast imaging at the diffraction limit from the ground. This post-processing technique can be seen as refinement of sparse aperture masking and the closure phase technique (Tuthill et al., 2000). By probing only certain linear combinations of the phase of the Fourier transformed detector images, kernel phase and sparse aperture masking allow for a robust calibration of the time-varying optical transfer function of the system and a significant mitigation of quasi-static speckles and achieve an angular resolution of $\lesssim 50~{}\text{mas}$ in the near-infrared (i.e. the L’ band, Cheetham et al., 2016). This gives access to objects on solar-system scales in the nearest star forming regions (i.e. projected separations of $\sim 40~{}\text{mas}$ for a Jupiter analog in the Scorpius Centaurus OB association, Preibisch & Mamajek 2008) and has proven successful in directly imaging young exoplanets/disk features (e.g. Kraus & Ireland, 2012). The caveat of sparse aperture masking is that the mask blocks $\gtrsim 85~{}\%$ of the light (for VLT/NACO, Tuthill et al., 2010) and therefore significantly decreases the sensitivity and hence the contrast limit of the observations for relatively faint targets. However, kernel phase uses the light collected by the entire pupil and should perform better in the high Strehl regime and the bright limit (e.g. Pope et al., 2016; Sallum & Skemer, 2019).

For sparse aperture masking, a mask is placed at the Lyot stop of an instrument in order to split the primary mirror into a discrete interferometric array of real sub-apertures (e.g. Readhead et al., 1988). In the Fourier transform of the detector image (hereafter referred to as Fourier plane), these sub-apertures map onto their auto-correlation (i.e. their spatial frequencies, Ireland 2016). The phase $\phi$ of each spatial frequency can be extracted and linearly combined in a way such that the resulting closure phase $\theta=\bm{K}\cdot\phi$ is independent of the pupil plane (or instrumental) phase $\varphi$ to second order (i.e. terms of first and second order in $\varphi$ are vanishing), where the matrix $\bm{K}$ encodes this special linear combination (e.g. Ireland, 2016). For observations from the ground, the pupil plane phase $\varphi$ is affected by noise from atmospheric seeing and non-common path aberrations which ultimately cause quasi-static speckles. Being more robust with respect to these systematic effects, sparse aperture masking achieves a superior angular resolution.

For full pupil kernel phase imaging, there is no mask and the entire primary mirror is discretized into an interferometric array of virtual sub-apertures. According to Martinache (2010), it is then convenient to define a transfer matrix $\bm{A}$ which maps the baselines between each pair of virtual sub-apertures onto their corresponding spatial frequency. In the high Strehl regime, where the pupil plane phase $\varphi$ can be linearized, we obtain the relationship

[TABLE]

where $\bm{R}$ is a diagonal matrix encoding the redundancy of the spatial frequencies (i.e. the baselines of the interferometric array) and $\phi_{\text{obj}}$ is the phase intrinsic to the observed object. Multiplication with the left kernel $\bm{K}$ of $\bm{R}^{-1}\cdot\bm{A}$ yields

[TABLE]

hence the kernel $\theta$ of the measured Fourier plane phase $\phi$ directly represents the kernel $\theta_{\text{obj}}$ of the phase intrinsic to the observed object $\phi_{\text{obj}}$ , at least in the high Strehl regime (where $\mathcal{O}(\varphi^{3})$ is negligible). This is why frame selection based on the Strehl ratio is essential. Note that the kernel phase is a generalization of the closure phase to the case of redundant apertures.

For observations from space, which do not suffer from atmospheric seeing, kernel phase has proven to be successful in resolving close companions at the diffraction limit (Martinache, 2010; Pope et al., 2013). It is our goal to determine if, under good observing conditions, kernel phase also is a competitive alternative to sparse aperture masking from the ground.

2 Methods

2.1 Data reduction

A basic direct imaging data reduction (such as dark, flat, background subtraction and bad pixel correction) is also essential for the kernel phase technique (e.g. Sallum & Eisner, 2017). For this purpose, we developed our own data reduction pipeline111https://github.com/kammerje/PyConica which can be fed the raw data with their associated raw calibrators from the VLT/NACO archive222http://archive.eso.org/wdb/wdb/eso/naco/form. Our data reduction pipeline performs the following steps which are described in more detail in the following sections:

Linearize the raw frames. 2. 2.

Compute master darks and their bad pixel maps. 3. 3.

Compute master flats and their bad pixel maps. 4. 4.

Flag saturated pixels. 5. 5.

Apply dark, flat, background and bad pixel corrections. 6. 6.

Perform a dither subtraction. 7. 7.

Reconstruct saturated pixels. 8. 8.

Select frames with sufficient Strehl ratio.

2.1.1 Detector linearization correction

Like most photon counting devices, NACO’s infrared detector CONICA suffers from a non-linear response when the pixel counts approach the saturation threshold (16400 counts for uncorrelated high well depth mode333This is the standard imaging mode in the L’ band ( $3.8~{}\text{\textmu m}$ ) and all data cubes which we analyze have been taken in this mode. according to the NACO Quality Control and Data Processing444https://www.eso.org/observing/dfo/quality/NACO/qc/detmon_qc1.html, with a more conservative 16000 counts used in our analysis). As kernel phase is an interferometric technique for which the fringes are coded spatially on the detector, it is very important to characterize the pixel to pixel response. Moreover, many of the data cubes which we analyze in Section 3 feature saturated point spread functions (PSFs) which we want to correct for non-linearity before reconstructing their core (cf. Section 2.1.6).

In order to compute the detector linearization correction we download all frames of type “FLAT, LAMP, DETCHECK” and uncorrelated high well depth mode from 2016 March 23 and 2016 September 24 (which are closest in time to the observation of the earliest and the latest data cube which we analyze) from the VLT/NACO archive. We sort them by integration time and compute the median pixel count over all frames for each individual integration time (masking out the broken stripes in the lower left quadrant of CONICA). Then, we plot the median pixel count in dependence of the integration time $t$ , fit a linear polynomial $f(t)$ to all data points with less than 8500 counts (end of the linear regime for uncorrelated high well depth mode) and a cubic polynomial $g(t)$ to all data points with less than 16000 counts (saturation threshold, cf. left panel of Figure 1). We linearize the detector using a continuously differentiable piecewise polynomial approach $h$ to the correction curve $f(g)$ with a linear function up to 8500 counts and a cubic polynomial between 8500 and 16000 counts (cf. right panel of Figure 1).

2.1.2 Master darks and master flats

For each observation block (OB) we compute master darks from the associated dark frames as the median of all dark frames with a unique set of size and exposure time. Then we compute a bad pixel map for each master dark based on the frame by frame median and variance of each pixel’s count. Therefore, we first compute two frames:

The absolute difference between the master dark and the median filtered master dark. 2. 2.

The absolute of the median subtracted variance dark.

Then, we identify bad pixels in each of these frames based on their difference to the median of these frames. For frame (i) we classify pixels which are above 10 times the median as bad, for frame (ii) pixels which are above 75 times the median. Note that these thresholds were identified empirically. From the median subtracted dark frames, we estimate the readout noise as the mean over each frame’s pixel count standard deviation.

We proceed similar for the flat frames, but also group them by filter as well as size and exposure time, subtract a master dark with matching properties (i.e. similar size and exposure time) from each master flat and normalize it by its median pixel count.

2.1.3 Saturated pixels

The data cubes which we analyze in Section 3 consist of 100 frames of $0.2~{}\text{s}$ exposure. For each data cube, we reject the first frame (which we find to consistently suffer from a bias), so that there are 99 frames left. Note that NACO appends the median of all 100 frames at the end of each data cube which is also rejected here. Before proceeding, we also flag the saturated pixels in each frame which are all pixels with more than $h(16000)$ counts.

2.1.4 Dark, flat, background and bad pixel correction

We clean each frame of a data cube individually by subtracting a master dark with matching properties (i.e. similar size and exposure time), dividing it by a master flat with matching properties (i.e. similar size, exposure time and filter), correcting bad pixels (which are bad pixels from the master dark or the master flat) with a median filter of size five pixels and performing a simple background subtraction by subtracting the median pixel count of the frame from each pixel. A typical result is shown in the left panel of Figure 2, where residual systematic noise (mainly from the detector) is still clearly visible.

2.1.5 Dither subtraction

In order to mitigate the residual systematic noise from the detector and the sky background we perform a dither subtraction. After cleaning all data cubes within one OB, we find for each data cube (which we will here call data cube A) the data cube B with the target furthest away (on the detector) and subtract its median frame from each frame of data cube A. The new bad and saturated pixel maps are then the logical sums of those from both involved data cubes. After this step the residual noise appears like Gaussian random noise as is shown in the right panel of Figure 2.

Our typical performance is a pixel count standard deviation of $\sim 36=4.4+(158/\text{s}\cdot 0.2~{}\text{s})$ outside of $10~{}\lambda/D$ from the center of the PSF in $0.2~{}\text{s}$ exposure, where $4.4$ is the detector readout noise, $\lambda$ is the observing wavelength ( $3.8~{}\text{\textmu m}$ for the L’ band) and $D$ is the diameter of the primary mirror ( $8.2~{}\text{m}$ for the VLT).

2.1.6 Reconstruction of saturated pixels

Our reconstruction of saturated pixels is based on an algorithm described in Section 2.5 of Ireland (2013). This technique also identifies and corrects residual bad pixels, with no more than 10 additional bad pixels corrected in a typical frame. First, we crop all frames to a size of 96 by 96 pixels ( $\sim 2.6~{}\text{arcsec}^{2}$ ) centered on the target. Then, we correct bad and saturated pixels for each frame separately by minimizing the Fourier plane power $\left|f_{Z}\right|$ outside the region of support $Z$ permitted by the pupil geometry. Let $\bm{B}_{Z}$ be the matrix which maps the bad and saturated pixel values $x$ onto the Fourier plane domain $Z$ , then

[TABLE]

where $b$ are the corrections to the bad and saturated pixel values $x$ (i.e. the corrected pixel values are $x-b$ ) and $\epsilon_{Z}$ is remaining Fourier plane noise. We solve for $b$ using the Moore-Penrose pseudo-inverse of $\bm{B}_{Z}$ , i.e.

[TABLE]

Since a broad-band filter was used for the observations, but we use a monochromatic central filter wavelength in our analysis and also blur the edge of the pupil through the use of a windowing function, we use a sightly larger pupil diameter to define this region $Z$ of $10~{}\text{m}$ here. In fact, the only important thing for recovering the Fourier plane phase is that the Fourier plane power outside the region of support permitted by the pupil geometry is minimized, so using a larger pupil diameter just assures this in case of low quality data and is a conservative choice, especially in the case of our data which is far from the Nyquist sampling criterion.

Sometimes, the remaining Fourier plane noise $\epsilon_{Z}$ can be significant, which is why we repeat the entire correction process up to 15 times for each frame. After each iteration, we look for remaining bad pixels by:

Computing the Fourier transform of the corrected frame from the previous iteration. 2. 2.

Windowing this frame by the Fourier domain $Z$ . 3. 3.

Computing the inverse Fourier transform of this frame. 4. 4.

Identifying remaining bad pixels in this frame based on their difference to the median filtered frame.

If no remaining bad pixels are identified, we terminate the iteration.

A cross-section of a saturated PSF before and after performing the reconstruction is shown in Figure 3. Obviously, this reconstruction cannot reveal any structure or companions hidden behind saturated pixels, but it allows us to perform our kernel phase analysis on saturated data cubes which would otherwise suffer from high Fourier plane phase noise (cf. Figure 4). Please note that a method from the class of least squares spectral analysis techniques (i.e. image plane fringe fitting) may be more robust in dealing with bad pixels, but would require the simultaneous fitting of all Fourier plane phases and amplitudes and is therefore beyond the scope of this paper, although it is a promising approach for future work.

2.1.7 Frame selection

As explained in the Introduction, a high Strehl ratio is essential for the kernel phase technique in order for the mathematical framework (i.e. the linearization of the Fourier plane phase, cf. Equation 1) to be valid. Therefore, we select frames with sufficient Strehl ratio based on their peak pixel count. For each data cube, we first compute the median peak count of the 10% best frames. Then, we reject all frames with a peak count below 75% of this value. Using this dynamic threshold is better than simply rejecting a fixed fraction of the frames (e.g. Law et al., 2006) because it can correctly account for a sudden drop in the Strehl ratio like shown in Figure 5. Note that we consider the peak pixel count after performing the PSF reconstruction (cf Section 2.1.6) here.

2.2 Kernel phase extraction

2.2.1 VLT pupil model

In order to extract the kernel phase from VLT/NACO data we first need to construct a model for the VLT pupil (i.e. split the primary mirror into an interferometric array of virtual sub-apertures). We sample 140 virtual sub-apertures on a square grid with a pupil plane spacing of $0.6~{}\text{m}$ , which is approximately half the Nyquist sampling of $\lambda/\alpha\approx 0.3~{}\text{m}$ , where $\lambda=3.8~{}\text{\textmu m}$ is the observing wavelength and $\alpha=2.610~{}\text{arcsec}$ is the image size (96 pixels). Our VLT pupil model is shown in the left panel of Figure 6 and based on an $8.2~{}\text{m}$ primary mirror with a $1.2~{}\text{m}$ central obscuration. Another advantage of kernel phase over sparse aperture masking is the dense Fourier plane coverage which is shown in the right panel of Figure 6.

2.2.2 XARA

The extraction of the Fourier plane phase and the computation of the kernel phase relies on a python package called XARA555https://github.com/fmartinache/xara (eXtreme Angular Resolution Astronomy, Martinache, 2010, 2013). XARA has been designed to process data produced by multiple instruments assuming that the images comply to the kernel phase analysis requirements of proper sampling, high-Strehl (boosted by our frame selection procedure described in Section 2.1.7), and non-saturation (restored by the procedure described in Section 2.1.6). The discrete achromatic representation of the VLT aperture (i.e. our pupil model) is used by XARA to compute the phase transfer matrix $\bm{A}$ and the associated left kernel operator $\bm{K}$ via a singular value decomposition of $\bm{A}$ .

With the added knowledge of the detector pixel scale and the observing wavelength, the discrete model is scaled so that the Fourier plane phase at the expected $(u,v)$ coordinates can be extracted by a discrete Fourier transform. For the small aberration hypothesis to remain valid, the data must be properly centered prior to the Fourier transform. Failure to do so will leave a residual Fourier plane phase ramp that can wrap and lead to meaningless kernel phases (cf. left panels of Figure 7). XARA offers several centering algorithms. It is crucial to carefully choose from the available options depending on the requirements coming from the data. For our extensive ground-based data set for example, we find that minimizing directly the Fourier plane phase which is extracted by XARA is most robust and the offered sub-pixel re-centering is very valuable (cf. right panels of Figure 7) due to an increased level of pupil plane phase noise from the atmosphere and the bright background (if compared to space-borne data).

Moreover, virtual baselines near the outer edge of the Fourier coverage suffer from low power as they are only supported by very few baselines, i.e. have small redundancy. The phase measured for these baselines is systematically noisier and needs to be excluded from the model to prevent the noise to propagate into the estimation of all kernel phases. This can be achieved using the baseline filtering option implemented in XARA. In our case, baselines of length greater than $7.0~{}\text{m}$ and the corresponding rows of $\bm{A}$ are eliminated prior to the computation of $\bm{K}$ . Some of the theoretically available kernel phases are lost but the remaining kernel phases can nevertheless be used just like for the complete model.

Finally, to limit the impact of readout noise in regions of the image where little signal is present, frames are windowed by a super-Gaussian ( $g(r)=\exp{-(r/r_{0})^{4}}$ ) with a radius $r_{0}=25$ pixels, effectively limiting our field of view to $\sim 1000~{}\text{mas}$ . Note that Section 3.4 will further comment on the effect of this window and how it can affect contrast estimates for detections at large separations.

2.2.3 Kernel phase uncertainties

For estimating the uncertainties, we compute the kernel phase covariance $\bm{\Sigma}_{\theta}$ for each frame $\bm{d}$ from its photon count variance $\bm{\Sigma}_{d}=g\cdot\bm{w}^{2}\cdot(\bm{d}+\bm{b})$ in units of (photo-electrons)2, where $g$ is the detector gain ( $g=9.8$ for uncorrelated high well depth mode), $\bm{w}$ is the super-Gaussian window, $\bm{d}$ is the cleaned and re-centered frame and $\bm{b}$ is its background (from the simple background subtraction, cf. Section 2.1.4). Therefore, we first need to find a linear operator $\bm{B}$ which maps each frame $g\cdot\bm{w}\cdot\bm{d}$ in units of photo-electrons to its kernel phase $\theta$ . The linear discrete Fourier transform $\bm{F}$ and the kernel $\bm{K}$ of the pupil model $\bm{R}^{-1}\cdot\bm{A}$ are already linear operators, and the Fourier plane phase $\phi(z)$ (of a complex number $z$ ) can be approximated as $\mathrm{Im}(z)/|z|$ for small angles. Hence, we compute

[TABLE]

Note that $\bm{B}\cdot{g}\cdot\bm{w}\cdot\bm{d}$ would be a small-angle approximation for the kernel phase. Then, we obtain an estimate for the kernel phase covariance by propagating the photon count variance according to

[TABLE]

Now, we have a kernel phase $\theta$ and a kernel phase covariance $\bm{\Sigma}_{\theta}$ for each frame. In order to save computation time for the model fitting (cf. Section 2.4) we compute a weighted mean $\bar{\theta}$ of the kernel phase for each data cube. Therefore, we first compute the average kernel phase covariance $\bar{\bm{\Sigma}}_{\theta}$ over all frames $\bm{d}_{i}$ of a data cube via

[TABLE]

and then the weighted mean $\bar{\theta}$ of the kernel phase (cf. Figure 8) via

[TABLE]

For the rest of this paper, we omit the bar for better readability, i.e.

[TABLE]

Note that this kernel phase covariance model includes the contribution of shot noise only. Any residual calibration errors not taken into account in the following section are therefore expected to increase the reduced $\chi^{2}$ of any model fitting, potentially to much more than 1.0 in the case of high signal-to-noise data with highly imperfect calibration.

2.3 Kernel phase calibration

Under perfect conditions the closure phase of a point-symmetric source, such as an unresolved star, is zero (e.g. Monnier, 2007). The same holds for the kernel phase, which is a generalization of the closure phase (e.g. Ireland, 2016). Practically however, one is limited by systematic errors caused by third order phase residuals (e.g. Ireland, 2013) and even point-symmetric sources have non-zero kernel phase.

For this reason, calibration is of fundamental importance when analyzing interferometric measurables (like closure or kernel phase). The systematic errors are expected to be quasi-static (e.g. Ireland, 2013), i.e. slowly varying with time, so that the kernel phase of a well-known point source measured close in time to that of the science target can serve as a calibrator. The simplest calibration technique would be to subtract the kernel phase of a well-known point source from that of the science target. This technique was for example used successfully in Martinache (2010), but here we want to go beyond this approach.

We use principal component analysis in the framework of a Karhunen-Loève decomposition (Soummer et al., 2012; Pueyo, 2016) in order to subtract the statistically most significant phase residuals of the calibrator kernel phase from that of the science target. Note that the following technique is new, but very similar to the POISE observables in Ireland (2013). We start by computing the covariance matrix $\bm{E}_{\text{RR}}$ of the kernel phase $\theta_{\text{cal},i}$ of all calibrator data cubes $i$ via

[TABLE]

Then, we compute an eigendecomposition of this matrix in order to obtain its sorted (in descending order) eigenvalues $w_{k}$ and eigenvectors $v_{k}$ . Finally, we compute the Karhunen-Loève transform $\bm{Z}$ of shape (number of kernel phases, number of calibrator data cubes) via

[TABLE]

where $v_{k}^{p}$ is the p-th component of the k-th eigenvector of $\bm{E}_{\text{RR}}$ and $\theta_{\text{cal},p}^{n}$ is the n-th kernel phase of the p-th calibrator data cube.

From the Karhunen-Loève transform $\bm{Z}$ we obtain a projection matrix $\bm{P}$ via

[TABLE]

where $\bm{I}$ is the identity matrix and $\bm{Z}^{\prime}$ is obtained from the first $K_{\text{klip}}$ columns of $\bm{Z}$ . $K_{\text{klip}}$ is an integer representing the order of the correction, i.e. how many eigencomponents of the calibrator kernel phase should be corrected for. The projection matrix $\bm{P}$ is of shape (number of kernel phases, number of kernel phases), but it has $K_{\text{klip}}$ zero eigenvalues by construction. In order to properly represent the dimensions we compute another eigendecomposition of $\bm{P}$ and obtain a new projection matrix $\bm{P}^{\prime}$ , whose columns are those eigenvectors of $\bm{P}$ which correspond to non-zero eigenvalues. The projection matrix $\bm{P}^{\prime}$ is of shape (number of “good” kernel phases, number of kernel phases), where “good” means statistically independent of systematic errors, and can be used to project the measured kernel phase $\theta$ and its covariance $\bm{\Sigma}_{\theta}$ into a sub-space of dimension (number of “good” kernel phases), which is more robust with respect to quasi-static errors, via

[TABLE]

For the rest of this paper, we omit the prime for better readability, i.e.

[TABLE]

2.4 Model fitting

From Equations 2–4 it becomes clear that the measured kernel phase $\theta$ directly represents the kernel phase intrinsic to the observed object $\theta_{\text{obj}}$ . Hence, we can infer information about the spatial structure of the observed object by fitting models for $\theta_{\text{obj}}=\bm{K}\cdot\phi_{\text{obj}}$ to $\theta$ .

2.4.1 Binary model

In order to search for companion candidates we use the binary model

[TABLE]

where $c$ is the contrast ratio between secondary and primary, $u$ and $v$ are the coordinates of the sampled Fourier plane positions (i.e. the spatial frequencies of the pupil model), $\lambda$ is the observing wavelength and

[TABLE]

where $\rho$ is the angular separation between primary and secondary, $\vartheta$ is the position angle of the secondary with respect to the primary and $\vartheta_{0}$ is the detector position angle during the observation. Figure 8 shows the best fit binary model for the measured kernel phase of TYC 6849 1795 1 (resolved and bright binary).

2.4.2 Uncertainties from photon noise

Using the kernel phase covariance $\bm{\Sigma}_{\theta}$ estimated from photon noise according to Section 2.2.3 we compute the best fit contrast $c_{\text{fit}}$ and its uncertainty $\sigma_{c_{\text{fit}}}$ for the binary model $\theta_{\text{bin}}=\bm{K}\cdot\phi_{\text{bin}}$ on each position of a discrete $500\times 500~{}\text{mas}$ square grid with spacing $13.595~{}\text{mas}$ (which is half the detector pixel scale of CONICA). In some cases, where we suspect a companion candidate at a larger angular separation, we also extend the grid to $1000\times 1000~{}\text{mas}$ .

In the high-contrast regime (where $c\ll 1$ ), the phase $\phi_{\text{bin}}$ is approximately proportional to the contrast $c$ of the binary model, so is its kernel phase $\theta_{\text{bin}}$ (because $\bm{K}$ is a linear operator). Hence, the $\chi^{2}$ of the binary model $\chi_{\text{bin}}^{2}$ can be approximated as

[TABLE]

where $\Theta$ and $\Theta_{\text{bin,ref}}$ are vertical stacks of the kernel phase $\theta_{i}$ and the reference binary model $\theta_{\text{bin,ref},i}$ of each data cube $i$ and $\bm{\Sigma}_{\Theta}^{-1}$ is a block-diagonal matrix whose diagonal elements are the inverse kernel phase covariances $\bm{\Sigma}_{\theta,i}^{-1}$ of each data cube $i$ , i.e.

[TABLE]

The reference binary model $\theta_{\text{bin,ref}}$ is the binary model $\theta_{\text{bin}}$ evaluated for and normalized by a reference contrast $c_{\text{ref}}=0.001$ , i.e.

[TABLE]

Finally, we obtain the log-likelihood $\ln{L}$ for the binary model $\theta_{\text{bin}}$ as

[TABLE]

The best fit contrast $c_{\text{fit}}$ for the binary model $\theta_{\text{bin}}$ is then obtained by maximizing $\ln{L}$ for each grid position, i.e.

[TABLE]

and its uncertainty is the square root of its variance, i.e.

[TABLE]

Finally, the detection significance based on photon noise $\text{SNR}_{\text{ph}}$ is computed for each grid position as

[TABLE]

where we scale the uncertainty of the best fit contrast $\sigma_{c_{\text{fit}}}$ by the square root of the minimal reduced $\chi^{2}$ of the binary model of the entire grid ( $\chi_{\text{r,bin,min}}^{2}$ ). Assuming that kernel phase is proportional to contrast, this is equivalent to scaling the kernel phase covariance $\bm{\Sigma}_{\theta}$ so that the minimal reduced $\chi^{2}$ is 1.0. This step is necessary because kernel phase is still affected by third (or higher) order pupil plane phase noise (cf. Equations 2–4), so that the uncertainties from photon noise $\sigma_{c_{\text{fit}}}$ significantly underestimate the true errors. Note that there can be various sources of higher order phase noise (e.g. Ireland, 2013), but studying those in detail is beyond the scope of this paper.

The final parameters $p_{\text{fit}}$ for the best fit binary model are then obtained from a least squares search which maximizes the log-likelihood $\ln{L}$ of the binary model under varying angular separation, position angle and contrast simultaneously. For the least squares search, we use the grid position with the maximal log-likelihood as prior and restrict the search box for the angular separation $\rho$ to $50~{}\text{mas}\leq\rho\leq 1000~{}\text{mas}$ .

The uncertainties of the best fit parameters $\sigma_{p_{\text{fit}}}$ follow from the likelihood function $L$ for Gaussian errors (which are applicable to high confidence detections)

[TABLE]

where $p$ represents the three-dimensional parameter space of angular separation, position angle and contrast. Differentiating twice and neglecting terms containing second order derivatives of a single parameter yields

[TABLE]

where $\bm{J}$ and $\bm{H}$ are the Jacobian and the Hessian matrix of the binary model $\Theta_{\text{bin}}$ . Hence, the covariance matrix of the model parameters $\bm{\Sigma}_{p}$ can be obtained via

[TABLE]

and the uncertainties of the model parameters for the best fit binary model $\sigma_{p_{\text{fit}}}$ are

[TABLE]

We also compute the correlation of the best fit model parameters as

[TABLE]

where $\frac{\cdot}{\cdot}$ denotes element-wise division.

2.4.3 Empirical uncertainties

Using only the uncertainties from photon noise, it is still difficult to distinguish between residual speckle noise (i.e. third order phase noise in the pupil plane) and real detections at small angular separations. This is the case because the data set which we analyze in Section 3 is very limited in terms of diversity of calibrator PSFs. For this reason, we use an empirical approach as the primary method to determine whether a detection is real or not.

First, we split our targets into candidate detections and calibrators based on their detection significance from photon noise $\text{SNR}_{\text{ph}}$ (cf. Section 3.2.2). For each of the calibrators, we then compute two contrast curves:

The azimuthal average $c_{\text{RMS}}$ of the root mean square (RMS) best fit contrast $\bm{c}_{\text{fit}}$ . 2. 2.

The azimuthal average $c_{\text{RMS}}^{\text{sub}}$ of the RMS best fit contrast $\bm{c}_{\text{fit}}^{\text{sub}}$ after subtracting the best fit binary model $\theta_{\text{bin}}$ from the measured kernel phase $\theta$ .

Here, the assumption is that the calibrators are single stars, so that the ratio of the two RMS contrast curves computed above, i.e.

[TABLE]

is a correction factor for the relative contrast of the residual speckle noise. This is illustrated in the left panel of Figure 9.

For each of the candidate detections, we only compute the azimuthal average $c_{\text{RMS}}^{\text{sub}}$ of the RMS best fit contrast $\bm{c}_{\text{fit}}^{\text{sub}}$ after subtracting the best fit binary model $\theta_{\text{bin}}$ (which might or might not be a real detection) from the measured kernel phase $\theta$ . Then, we multiply this RMS contrast curve with the mean of the relative speckle contrast $f_{\text{speckle}}$ of all calibrators, i.e.

[TABLE]

where the bar denotes the mean, in order to obtain an empirical contrast uncertainty $\sigma_{\text{emp}}$ as a function of the angular separation $\rho$ for each candidate detection (cf. right panel of Figure 9). We classify a candidate detection as real if its empirical detection significance $\text{SNR}_{\text{emp}}$ is above the $5\sigma$ threshold, i.e.

[TABLE]

Furthermore, we obtain empirically motivated uncertainties on the best fit parameters $p_{\text{fit}}$ by multiplying the uncertainties from photon noise $\sigma_{p_{\text{fit}}}$ with the ratio $f_{\text{err}}$ of the empirical contrast uncertainty $\sigma_{\text{emp}}$ to the contrast uncertainty from photon noise $\sigma_{\text{ph}}$ (at the position of the best fit binary model $\theta_{\text{bin}}$ ).

The kernel phase analysis tools described in Sections 2.2, 2.3 and 2.4 are available on GitHub666https://github.com/kammerje/PyKernel. We put a strong focus on applicability to other instruments and an exchangeable kernel phase fits file format.

3 Results and discussion

3.1 Target list

We test our methods on an archival data set because the kernel phase technique is optimized for detecting companions at much smaller angular separations to their host star than conventional high-contrast imaging techniques (such as ADI and reference star differential imaging, i.e. RDI). Hence, the parameter space that we are looking at is still unexplored. We search the VLT/NACO archive for L’ band RDI surveys and decide to analyze program 097.C-0972(A), PI J. Girard, due to a large number of observed targets and therefore potential calibrators. A target list together with our detections is reported in Table 1.

3.2 Detected companion candidates

Before we search the targets in Table 1 for close companion candidates, we perform a basic vetting procedure by visually inspecting the cleaned data for wide companion candidates (cf. Section 3.2.1). In the field of view, which is limited to $\sim 1~{}\text{arcsec}$ due to the windowing, we find six wide companion candidates (cf. upper section of Table 2). Three of them are already known and we classify our detections as confirmed, whereas the other three have not been reported before and therefore are new detections. Note that we correct the contrast of the wide companion candidates for the windowing (cf. Section 3.4).

After detecting and subtracting off the signal induced by the wide companion candidates, we use the kernel phase technique in order to search for closer and fainter objects (cf. Section 3.2.2). We find two companion candidates with an empirical detection significance above the $5\sigma$ threshold, i.e. $\text{SNR}_{\text{emp}}^{\text{can}}>5$ (cf. lower section of Table 2). One of them is already known and we classify our detection as confirmed, whereas the other one has not been reported before and therefore is a new detection. For HIP 13008 we note that the empirical detection significance is $9.4\sigma$ when using only HIP 116384 as calibrator, but only $1.9\sigma$ when using HIP 12925 due to high residuals and a very large $f_{\text{err}}$ correction. Therefore, HIP 12925 seems to be a bad calibrator and we do not report any best fit parameters for HIP 13008 due to a lack of credibility. Follow-up observations are required to confirm the true nature of this object. Also note that OBs 6–11 contain only one or two targets and are not analyzed with the kernel phase technique because the diversity of kernel phase amongst calibrators is essential for our empirical detection method. As there are systematic differences between the individual nights in the measured kernel phase, for this paper we are only analyzing OBs which contain at least two PSF calibrators (observed in the same night). Although this choice was made for simplicity and it might be possible to calibrate targets over longer timescales, this adds significant additional complexity which is beyond the scope of this paper.

From the targets for which we detect neither a wide nor a close companion candidate, we compute a contrast curve (i.e. the detection limit as a function of the angular separation) for the kernel phase technique (cf. Section 3.3).

3.2.1 Wide companion candidates

The wide companion candidates reported in the upper section of Table 2 are all detected by visually inspecting the cleaned data. When we find a companion candidate, we use a grid search followed by a least squares search in order to find its best fit binary model $\theta_{\text{bin}}$ for the measured kernel phase $\theta$ . Then, we compute the empirical detection significance $\text{SNR}_{\text{emp}}^{\text{vis}}$ for the best fit binary model $\theta_{\text{bin}}$ (cf. right panels of Figures 10 and 11). This is achieved using a simplification of the empirical detection method (cf. Section 2.4.3). Since the wide companion candidates all have a sufficiently large angular separation (i.e. $\gtrsim 200~{}\text{mas}$ ) and are sufficiently bright (otherwise we could not detect them by eye), we can skip the use of any calibrators and compute the empirical detection significance $\text{SNR}_{\text{emp}}^{\text{vis}}$ as the best fit contrast $\bm{c}_{\text{fit}}$ divided by the azimuthal average $c_{\text{RMS}}^{\text{sub}}$ of the RMS best fit contrast $\bm{c}_{\text{fit}}^{\text{sub}}$ after subtracting the best fit binary model $\theta_{\text{bin}}$ from the measured kernel phase $\theta$ . Note that we do not use any Karhunen-Loève calibration for this step either, i.e. $\theta^{\prime}=\theta$ (cf. Section 2.3).

Before we search for closer and fainter objects, we subtract the signal induced by the wide companion candidates from the measured kernel phase, i.e.

[TABLE]

so that the measured kernel phase of all targets is free of wide detections. The detected wide companion candidates are shown in the left panels of Figures 10 and 11 and are described in more detail in the following paragraphs.

HIP 36985 B, TYC 7401 2446 1 B, TYC 5835 0469 1 B. These objects are new companion candidates which were not reported before. They have L’ band contrasts of $2.619\pm 0.005^{\text{mag}}$ , $1.318\pm 0.004^{\text{mag}}$ and $2.399\pm 0.003^{\text{mag}}$ respectively, and therefore are candidates for stellar mass companions.

TYC 6849 1795 1 B. This object was already detected in 2005 by Galicher et al. (2016) at an angular separation of $\sim 220~{}\text{mas}$ , a position angle of $\sim 201~{}\text{deg}$ and a H band contrast of $\sim 1.6^{\text{mag}}$ . We find a L’ band contrast of $1.450\pm 0.004^{\text{mag}}$ and an angular separation ( $223.5\pm 0.4~{}\text{mas}$ ) and a position angle ( $203.29\pm 0.05~{}\text{deg}$ ) which are in agreement with Galicher et al. (2016), i.e. we can confirm the bound nature of the object.

HIP 116231 B. This object was already detected in 2004 by Schöller et al. (2010) at an angular separation of $641\pm 4~{}\text{mas}$ , a position angle of $240.2\pm 0.6~{}\text{deg}$ and a K band contrast of $2.75\pm 0.01^{\text{mag}}$ . We find a L’ band contrast of $4.43\pm 0.02^{\text{mag}}$ , a slightly larger angular separation of $874.6\pm 0.8~{}\text{mas}$ and a slightly different position angle of $254.70\pm 0.05~{}\text{deg}$ , but (allowing for orbital motion) we can confirm the bound nature of the object. Note that there is a huge disagreement in the contrast, but a brief look at the raw data from Schöller et al. (2010) shows a significant PSF halo and confirms our result of $\sim 4^{\text{mag}}$ .

HIP 116384 C. This object was first detected in 2002 by Martín (2003) who found HIP 116384 (GJ 900) to be a triple system with a $510\pm 10~{}\text{mas}$ (HIP 116384 B, $\Delta\text{K}=1.61\pm 0.03^{\text{mag}}$ ) and a $760\pm 10~{}\text{mas}$ (HIP 116384 C, $\Delta\text{K}=2.38\pm 0.04^{\text{mag}}$ ) component. Lafrenière et al. (2007b) resolved the system again in 2004 and 2005, finding HIP 116384 B at an angular separation of $611\pm 2~{}\text{mas}$ and $673\pm 2~{}\text{mas}$ respectively, and HIP 116384 C at an angular separation of $733\pm 2~{}\text{mas}$ and $722\pm 2~{}\text{mas}$ respectively. In the cleaned data, we only find HIP 116384 C at a slightly larger angular separation of $842.90\pm 0.07~{}\text{mas}$ , but a position angle ( $346.614\pm 0.004~{}\text{deg}$ ) and a L’ band contrast ( $2.688\pm 0.001^{\text{mag}}$ ) which are in agreement with Martín (2003) and Lafrenière et al. (2007b), so that we can confirm the bound nature of the object. Looking at the raw data, we also find HIP 116384 B (which is the brighter of the two companions), noticing that it has moved to an angular separation of $\sim 1200~{}\text{mas}$ being too far away in order to be visible in our cleaned data (due to the windowing).

3.2.2 Close companion candidates

The close companion candidates reported in the lower section of Table 2 are all detected only by the kernel phase technique. For each target in Table 1, we use a grid search followed by a least squares search in order to find the best fit binary model $\theta_{\text{bin}}$ for the measured kernel phase $\theta$ . Then, we compute the detection significance from photon noise $\text{SNR}_{\text{ph}}$ (cf. Section 2.4.2) at the position of the best fit binary model $\theta_{\text{bin}}$ from the least squares search. For this step, we always use all other targets which were observed in the same OB as calibrators for the Karhunen-Loève calibration, fixing $K_{\text{klip}}=4$ 777For simplicity, we fix $K_{\text{klip}}=4$ for all targets and regardless of the number of calibrators. Various testing has shown that subtracting off the four statistically most significant eigencomponents of the kernel phase of the calibrators mostly yields the smallest amount of significant detections, i.e. calibrates the data best. A more rigorous investigation of this relationship is foreseen for a future publication.. Knowing that the majority of VLT/NACO targets do not have any close companions, we then classify the $\sim 1/3$ of the targets with the highest $\text{SNR}_{\text{ph}}$ in each OB as candidate detections (cf. column “Can” of Table 1) for the next step and the remaining targets as calibrators.

For the next step, we compute the empirical detection significance $\text{SNR}_{\text{emp}}^{\text{can}}$ (cf. Section 2.4.3) for each of the candidate detections from the previous step. For this step, we always use all remaining targets which were classified as calibrators in the previous step for the Karhunen-Loève calibration, again fixing $K_{\text{klip}}=4$ . If the empirical detection significance is above the $5\sigma$ threshold, i.e. $\text{SNR}_{\text{emp}}^{\text{can}}>5$ , we classify the candidate detection as real. If not, we add the candidate detection to the list of calibrators and redo the computation of the empirical detection significance (this time with one calibrator more than before). We repeat this process until all candidate detections are real. The detected close companion candidates are shown in Figure 12 and are described in more detail in the following paragraphs. Please note that we report the correlation of the best fit parameters in Appendix A and present model-data correlation plots in Appendix B.

HIP 50156 B. This object was already detected in 2011 by Bowler et al. (2015) at an angular separation of $\sim 90~{}\text{mas}$ and a K band contrast of $\sim 1.8^{\text{mag}}$ . Just nine month later, Brandt et al. (2014) cannot resolve this companion and report an upper limit of $\sim 20~{}\text{mas}$ for its angular separation. We find HIP 50156 B at an angular separation of $77.3\pm 0.8~{}\text{mas}$ and an L’ band contrast of $\sim 1.91\pm 0.03^{\text{mag}}$ , confirming the detection and notable orbital motion.

HIP 37918 B. This object is a new companion candidate which was not reported before. It has a L’ band contrast of $\sim 3.29\pm 0.05^{\text{mag}}$ , and therefore is a candidate for a stellar mass companion. Furthermore, HIP 37918 ( $M\approx 0.98~{}M_{\odot}$ ) is known to have a $\sim 23.1~{}\text{arcsec}$ companion of almost equal mass (HIP 37923, $M\approx 0.95~{}M_{\odot}$ , Desidera et al. 2006). Together with our companion candidate, this would make the system triple.

3.3 Detection limits

In Section 2.4.3, we present our empirical approach to find meaningful detection limits for the data analyzed in this paper. Based on this approach, we compute the contrast limit of the kernel phase technique as a function of the angular separation as the azimuthal mean of the RMS best fit contrast $c_{\text{RMS}}$ of all targets for which we do not detect any companions with the kernel phase technique (i.e. all non-detections, cf. column “Det” of Table 1). Note that we already subtracted off the signal induced by the wide companion candidates. The mean, the best and the worst contrast limit are shown in the left panel of Figure 13.

At the small angular separations which are inaccessible by classical high-contrast imaging techniques (i.e. within $\sim 200~{}\text{mas}$ in the L’ band), the kernel phase technique achieves contrast limits of $\sim 1\mathrm{e}{-2}$ . This is not yet deep enough to detect companions in the planetary-mass regime, which would start between $1\mathrm{e}{-3}$ and $1\mathrm{e}{-4}$ for young ( $\sim 10~{}\text{Myr}$ ) gas giants (e.g. Bowler, 2016). However, our closest detections prove that the resolution which is required to resolve solar-system scales in the nearest star forming regions can be achieved with the kernel phase technique. At larger angular separations, our best contrast limit is comparable with the limits achieved by RDI (e.g. Cantalloube et al., 2015). The large spread in the contrast limit comes from the fact that the amplitude of the background noise is nearly the same for all data cubes, whereas the peak value of the PSF varies heavily due to the PSF reconstruction (cf. Section 2.1.6).

3.4 Windowing correction

As mentioned in Section 2.2.2, we window all frames by a super-Gaussian (with a FWHM of $1240~{}\text{mas}$ ) in order to minimize edge effects when computing their Fourier transform. Due to this windowing, the brightness of companions at angular separations $\gtrsim 215~{}\text{mas}$ deviates by more than 1% from the true value. In order to correct for this effect, we again assume that kernel phase is proportional to contrast in the high-contrast regime, so that we can obtain the true contrast of a companion by dividing its measured contrast (i.e. the best fit contrast from the binary model) by the value of the super-Gaussian windowing function. We are aware that this method has its limits, as each PSF has a spatial extent on the detector and assuming that the entire PSF is multiplied by the same value is an over-simplification of the problem. Nevertheless, this method agrees fairly well with the contrasts which we measure in the cleaned fits files and we use it to correct the contrast of all wide companion candidates (cf. right panel of Figure 13). We add an additional contrast correction error in quadrature based on injection-recovery tests to companions wider than $500~{}\text{mas}$ to account for limitations in this technique.

4 Conclusions

We use the kernel phase technique in order to search for close companions at the diffraction limit in an archival VLT/NACO RDI L’ band data set. Therefore, we develop our own data reduction pipeline for VLT/NACO data, which performs a basic dark, flat, bad pixel and background (i.e. dither) subtraction, but also reconstructs saturated PSFs in order to reduce their Fourier plane noise. Furthermore, we select frames with sufficiently high Strehl ratio, which is essential for the kernel phase technique as it relies on a linearization of the Fourier plane phase. Then, we use XARA for extracting the kernel phase and improve its re-centering algorithm in the case of resolved and bright companions. Furthermore, we apply a principal component analysis based calibration to the data (i.e. Karhunen-Loève decomposition, Soummer et al., 2012) and develop a suite of analytic model fitting algorithms in order to search for point source companions with the kernel phase technique888https://github.com/kammerje/PyKernel.

For the archival data set which we analyze in Section 3, we find that our kernel phase covariance model (which only takes into account shot noise) is not sufficient and significantly underestimates the true errors. This is still the case after calibrating the data, because the diversity of calibrator PSFs is not sufficient. Hence, we develop an empirical method for estimating the relative contrast of the residual speckle noise and finding meaningful detection limits for the data. With this empirical approach, we detect six wide companion candidates by visually inspecting the cleaned data and two close ( $\sim 80$ – $110~{}\text{mas}$ ) companion candidates which are detected only by the kernel phase technique. All eight companion candidates lie in the stellar-mass regime and five of them were previously unknown.

In order to reach the planetary-mass regime, a better library of calibrator PSFs is required. Therefore, it is extremely important that the targets and their calibrators are observed as close in time as possible. This becomes very clear from the archival data set which we analyze, where there are in fact multiple calibrators observed in one night, but not close enough in time, so that the kernel phase calibration does not reduce the quasi-static errors satisfyingly. In order to make better use of our prinipal component analysis based calibration, we propose star-hopping sequences of $\sim 6$ targets, and to revisit each target at least twice. Star-hopping is an observing strategy for which the instrument (and in particular the AO system) acquisition is only performed once at the beginning of each sequence. Then, one slews (“hops”) from target to target without interrupting the AO system. Furthermore, we aim to examine more extensive Keck data sets where we are hopeful that the significant investment of telescope resources gives adequate calibrator diversity to characterize the systematic errors and possibly use Bayesian Monte-Carlo techniques.

In this paper, we have shown that kernel phase is able to achieve a resolution below the classical diffraction limit of a telescope under good observing conditions (i.e. sufficiently high Strehl ratio). This is of particular interest for future space-based observatories, such as the JWST, as it gives access to an exciting parameter space which could otherwise not be explored due to the limited mirror size (and therefore resolution). Space-based telescopes do not suffer from atmospheric turbulence, what makes the calibration much less challenging than for the ground-based VLT/NACO data (e.g. Martinache, 2010). Nevertheless, with an optimized observing strategy, kernel phase is also a competitive high-contrast imaging technique from the ground.

The application of kernel phase is of course not limited to imaging telescopes. One concept which aims to push the kernel phase technique towards higher contrasts is the VIKiNG instrument (Martinache & Ireland, 2018), which proposes kernel phase nulling interferometry with the VLTI. By combining kernel phase with a high-contrast booster (i.e. a nulling interferometer) it would allow for self-calibrating the observables and achieving a better robustness with respect to residual wavefront errors. This would in turn also be an option to reduce the demanding stability requirements on space-based nulling interferometers, such as the LIFE concept (Kammerer & Quanz, 2018; Quanz et al., 2018), which aims to detect dozens of Earth-like exoplanets in the solar neighborhood.

Acknowledgements

MJI was supported by the Australian Research Council Future Fellowship (FT130100235). This project has received funding from the European Research Council (ERC) under the European Union’s Horizon 2020 research and innovation program (grant agreement CoG #683029). JHG gratefully acknowledges support from the Director’s Research Funds at the Space Telescope Science Institute. The manuscript was also substantially improved following helpful comments from an anonymous referee.

Appendix A Parameter correlation

Appendix B Correlation plots

Bibliography48

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1Amara & Quanz (2012) Amara A., Quanz S. P., 2012, MNRAS , 427, 948 · doi ↗
2Biller et al. (2014) Biller B. A., et al., 2014, Ap J , 792, L 22 · doi ↗
3Bowler (2016) Bowler B. P., 2016, PASP , 128, 102001 · doi ↗
4Bowler & Nielsen (2018) Bowler B. P., Nielsen E. L., 2018, preprint, ( ar Xiv:1802.10132 )
5Bowler et al. (2015) Bowler B. P., Liu M. C., Shkolnik E. L., Tamura M., 2015, Ap JS , 216, 7 · doi ↗
6Brandt et al. (2014) Brandt T. D., et al., 2014, Ap J , 786, 1 · doi ↗
7Cantalloube et al. (2015) Cantalloube F., et al., 2015, A&A , 582, A 89 · doi ↗
8Cheetham et al. (2016) Cheetham A. C., Girard J., Lacour S., Schworer G., Haubois X., Beuzit J.-L., 2016, in Optical and Infrared Interferometry and Imaging V. p. 99072 T, doi:10.1117/12.2231983 · doi ↗