Towards Efficient Detection of Small Near-Earth Asteroids Using the   Zwicky Transient Facility (ZTF)

Quanzhi Ye; Frank J. Masci; Hsing Wen Lin; Bryce Bolin; Chan-Kao; Chang; Dmitry A. Duev; George Helou; Wing-Huen Ip; David L. Kaplan; Emily; Kramer; Ashish Mahabal; Chow-Choong Ngeow; Avery J. Nielsen; Thomas A.; Prince; Hanjie Tan; Ting-Shuo Yeh; Eric C. Bellm; Richard Dekany; Matteo; Giomi; Matthew J. Graham; Shrinivas R. Kulkarni; Thomas Kupfer; Russ R.; Laher; Ben Rusholme; David L. Shupe; Charlotte Ward

arXiv:1904.09645·astro-ph.EP·May 29, 2019

Towards Efficient Detection of Small Near-Earth Asteroids Using the Zwicky Transient Facility (ZTF)

Quanzhi Ye, Frank J. Masci, Hsing Wen Lin, Bryce Bolin, Chan-Kao, Chang, Dmitry A. Duev, George Helou, Wing-Huen Ip, David L. Kaplan, Emily, Kramer, Ashish Mahabal, Chow-Choong Ngeow, Avery J. Nielsen, Thomas A., Prince, Hanjie Tan, Ting-Shuo Yeh, Eric C. Bellm, Richard Dekany

PDF

TL;DR

This paper introduces ZStreak, a machine-learning enhanced pipeline for real-time detection of small, fast-moving near-Earth asteroids using ZTF data, achieving significant discoveries and highlighting detection challenges.

Contribution

The paper presents ZStreak, an improved, real-time detection pipeline with a deep learning model, tailored for the increased data rate of ZTF, enabling new NEA discoveries.

Findings

01

Discovered 45 new NEAs, mostly under 100 meters in diameter.

02

Most objects are detectable within a 2-hour observation window.

03

Identified challenges in tracking and false positive suppression.

Abstract

We describe ZStreak, a semi-real-time pipeline specialized in detecting small, fast-moving near-Earth asteroids (NEAs) that is currently operating on the data from the newly-commissioned Zwicky Transient Facility (ZTF) survey. Based on a prototype originally developed by Waszczak et al. (2017) for the Palomar Transient Factory (PTF), the predecessor of ZTF, ZStreak features an improved machine-learning model that can cope with the $10 \times$ data rate increment between PTF and ZTF. Since its first discovery on 2018 February 5 (2018 CL), ZTF/ZStreak has discovered $45$ confirmed new NEAs over a total of 232 observable nights until 2018 December 31. Most of the discoveries are small NEAs, with diameters less than $\sim 100$ m. By analyzing the discovery circumstances, we find that objects having the first to last detection time interval under 2 hr are at risk of being lost. We will further…

Tables1

Table 1. Table 1: Features derived by the PSF fitting code and used for the ML training.

Feature name	Description	Variable in Eq. 1
flux	Integrated aperture flux of the streak	$F$
bg	Local background level	$b$
length	Length of the streak	$L$
sigma	Width of the streak	$σ$
lengtherr	Error of the length of the streak	$L$
sigmaerr	Error of the width of the streak	$σ$
paerr	Error of the positional angle of the streak	$θ$
bgerr	Error of the local background level	$b$
fitmagerr	Error of the fit magnitude
apsnr	SNR of the flux within the aperture
apmagerr	Error of the aperture magnitude
dmag	Difference between the aperture magnitude and the PSF-fitted magnitude
dmagerr	Error of dmag
chi2	$χ 2$ of the fit
numfit	Number of attempts for a converged fit

Equations9

f (x^{'}, y^{'}) = b + \frac{F}{L} \frac{1}{2 σ 2 π} exp (- \frac{y ^{'2}}{2 σ ^{2}}) [erf (\frac{x ^{'} + L /2}{σ 2}) - erf (\frac{x ^{'} - L /2}{σ 2})]

f (x^{'}, y^{'}) = b + \frac{F}{L} \frac{1}{2 σ 2 π} exp (- \frac{y ^{'2}}{2 σ ^{2}}) [erf (\frac{x ^{'} + L /2}{σ 2}) - erf (\frac{x ^{'} - L /2}{σ 2})]

erf (z) = \frac{2}{π} \int_{0}^{z} exp (- t^{2}) d t

erf (z) = \frac{2}{π} \int_{0}^{z} exp (- t^{2}) d t

x^{'} = (x - x_{0}) cos θ + (y - y_{0}) sin θ

x^{'} = (x - x_{0}) cos θ + (y - y_{0}) sin θ

y^{'} = (x - x_{0}) sin θ + (y - y_{0}) cos θ

η = k = 2 \sum n (k n) P^{k} (1 - P)^{n - k}

η = k = 2 \sum n (k n) P^{k} (1 - P)^{n - k}

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

Towards Efficient Detection of Small Near-Earth Asteroids Using the Zwicky Transient Facility (ZTF)

Quanzhi Ye (叶泉志)

Division of Physics, Mathematics and Astronomy, California Institute of Technology, Pasadena, CA 91125

IPAC, California Institute of Technology, 1200 E. California Blvd, Pasadena, CA 91125

Frank J. Masci

IPAC, California Institute of Technology, 1200 E. California Blvd, Pasadena, CA 91125

Hsing Wen Lin (林省文)

Department of Physics, University of Michigan, Ann Arbor, MI 48109

Institute of Astronomy, National Central University, 32001, Taiwan

Bryce Bolin

B612 Asteroid Institute and DIRAC Institute Postdoctoral Fellow

Department of Astronomy, University of Washington, Seattle, WA 98195

B612 Asteroid Institute, 20 Sunnyside Ave, Suite 427, Mill Valley, CA 94941

Chan-Kao Chang (章展誥)

Institute of Astronomy, National Central University, 32001, Taiwan

Dmitry A. Duev

Division of Physics, Mathematics and Astronomy, California Institute of Technology, Pasadena, CA 91125

George Helou

IPAC, California Institute of Technology, 1200 E. California Blvd, Pasadena, CA 91125

Wing-Huen Ip (葉永烜)

Institute of Astronomy, National Central University, 32001, Taiwan

Space Science Institute, Macau University of Science and Technology, Macau

David L. Kaplan

Department of Physics, University of Wisconsin-Milwaukee, Milwaukee, WI 53211

Emily Kramer

Jet Propulsion Laboratory, California Institute of Technology, Pasadena, CA 91109

Ashish Mahabal

Division of Physics, Mathematics and Astronomy, California Institute of Technology, Pasadena, CA 91125

Center for Data Driven Discovery, California Institute of Technology, Pasadena, CA 91125

Chow-Choong Ngeow

Institute of Astronomy, National Central University, 32001, Taiwan

Avery J. Nielsen

Division of Physics, Mathematics and Astronomy, California Institute of Technology, Pasadena, CA 91125

Thomas A. Prince

Division of Physics, Mathematics and Astronomy, California Institute of Technology, Pasadena, CA 91125

Hanjie Tan (谭瀚杰)

Department of Physics, National Central University, 32001, Taiwan

Ting-Shuo Yeh (葉庭碩)

Institute of Astronomy, National Central University, 32001, Taiwan

Eric C. Bellm

DIRAC Institute, Department of Astronomy, University of Washington, 3910 15th Avenue NE, Seattle, WA 98195

Richard Dekany

Caltech Optical Observatories, California Institute of Technology, Pasadena, CA 91125

Matteo Giomi

Institute of Physics, Humboldt-Universität zu Berlin, Newtonstr. 15, 12489 Berlin, Germany

Matthew J. Graham

Division of Physics, Mathematics and Astronomy, California Institute of Technology, Pasadena, CA 91125

Shrinivas R. Kulkarni

Division of Physics, Mathematics and Astronomy, California Institute of Technology, Pasadena, CA 91125

Thomas Kupfer

Kavli Institute for Theoretical Physics, University of California, Santa Barbara, CA 93106

Russ R. Laher

IPAC, California Institute of Technology, 1200 E. California Blvd, Pasadena, CA 91125

Ben Rusholme

IPAC, California Institute of Technology, 1200 E. California Blvd, Pasadena, CA 91125

David L. Shupe

IPAC, California Institute of Technology, 1200 E. California Blvd, Pasadena, CA 91125

Charlotte Ward

Department of Astronomy, University of Maryland, College Park, MD 20742

(Received –; Revised –; Accepted –)

Abstract

We describe ZStreak, a semi-real-time pipeline specialized in detecting small, fast-moving near-Earth asteroids (NEAs) that is currently operating on the data from the newly-commissioned Zwicky Transient Facility (ZTF) survey. Based on a prototype originally developed by Waszczak et al. (2017) for the Palomar Transient Factory (PTF), the predecessor of ZTF, ZStreak features an improved machine-learning model that can cope with the $10\times$ data rate increment between PTF and ZTF. Since its first discovery on 2018 February 5 (2018 CL), ZTF/ZStreak has discovered $45$ confirmed new NEAs over a total of 232 observable nights until 2018 December 31. Most of the discoveries are small NEAs, with diameters less than $\sim 100$ m. By analyzing the discovery circumstances, we find that objects having the first to last detection time interval under 2 hr are at risk of being lost. We will further improve real-time follow-up capabilities, and work on suppressing false positives using deep learning.

surveys — minor planets, asteroids: general

††facilities: PO:1.2m††software: Scikit-learn (Pedregosa et al., 2012)

1 Introduction

Small Solar System bodies are remnants of the formation stage of the Solar System. They encompass small natural objects in the Solar System with sizes from $\sim 1$ meter to a few hundred kilometers, including near-Earth objects (NEOs), main-belt asteroids, trans-Neptunian objects, and various other smaller groups of asteroids and comets. Studies of small bodies contribute to the understanding of several fundamental questions in planetary science, such as the composition of the proto-planetary disk, the evolutionary history of Solar System, as well as the transportation and distribution of water and organic materials in the Solar System.

NEOs are of particular interest because, aside from a purely scientific perspective, they also pose threats to our civilization and provide opportunities for resource utilization for future space activities. For example, it is now widely recognized that the impact of a 10-km-class asteroid is responsible for the extinction of the dinosaurs (c.f. Schulte et al., 2010). Since the 1990s, a handful of dedicated NEO surveys have dramatically increased our knowledge of the NEO population. According to the statistics released by the International Astronomical Union Minor Planet Center (MPC)111https://www.minorplanetcenter.net/, more than 19,000 NEOs had been discovered by the end of 2018. The majority of known NEOs are asteroids (collectively known as near-Earth asteroids or NEAs), i.e. the rocky and relatively dry component of the remnants from the proto-planetary disk; only $\sim 100$ known NEOs are comets.

At this point, our knowledge of the distribution of km-sized NEOs is fairy complete (Jedicke et al., 2015), but drops sharply towards smaller sizes. Asteroids that are tens to $\sim 100$ -m in size are generally not expected to cause global catastrophe, but are still capable to cause severe damage on city or larger scales (Binzel, 1997). The most recent and notable example is probably the Chelyabinsk event in 2013, which caused significant property damage to the city of Chelyabinsk in Russia, but was produced by an previously undiscovered asteroid that was only 18-m in diameter (Borovička et al., 2013; Brown et al., 2013). Telescopic detection of these small asteroids is challenging, since they are either very faint when they are far from the Earth, or they have high apparent motion rates when close and bright enough to be detected (see Figure 1 for an example). NEOs that approach the Earth within $\sim 15$ lunar distances typically move at a rate of $>10^{\circ}$ /day (Vereš et al., 2012). These “Fast-Moving Objects” (FMOs) would trail on typical survey exposures (usually 20–60 seconds) and present a challenge for traditional NEO detection algorithms, which are tuned to detect objects moving slower than a few degrees per day (Jedicke et al., 2013).

Searching for asteroids by searching for “streaking” objects dates back to the late 19th century, as virtually all discoverable asteroids back then would move noticeably during the hour-long exposure using photographic plates. (433) Eros, the first NEA found in 1898, was found as a long streak on a 2-hour exposure (Scholl & Schmadel, 2002). The term “Fast-Moving Object” more specifically came into use to refer to streaked NEOs in the 1970s (e.g. Aksnes, 1971; Helin et al., 1976; Morrison et al., 1976), as the operation of larger telescopes as well as the use of more sensitive photographic films reduced the exposure time. The application of highly sensitive charge-coupled device (CCD) to NEO surveys, pioneered by the Spacewatch survey in the early 1990s (Rabinowitz, 1991; Scotti et al., 1991), virtually limits the FMO phenomenon to objects that pass very close to the Earth. While modern NEO surveys are largely automatic and mostly only requires human attention in the quality assurance stage, detection of streaking FMOs remains a challenge and still requires more human intervention. In 2004–2006, Spacewatch had successfully conducted a citizen science project (the Spacewatch FMO Project) that allowed the public to access their survey images and identify possible FMOs, for which the survey observers would initiate follow-up observations to refine the orbit (McMillan et al., 2005).

Using the survey data from the Palomar Transient Factory (PTF), Waszczak et al. (2017, hereafter W17) successfully demonstrated a prototype pipeline dedicated to FMO detection. PTF was a synoptic survey operated using the 1.2-m Oschin Schmidt telescope (Palomar 48-inch, P48) and the 7.3 deg2 CFH12K camera at Palomar Observatory from 2009 to 2016 (Law et al., 2009; Rau et al., 2009). PTF was succeeded by the Zwicky Transient Facility (ZTF; Bellm et al., 2019) in 2018 which replaced the CFH12K camera with a dedicated 47 deg2 camera. Routine NEO searches using ZTF data have been initiated, with emphasis on FMO detection. Here we present the ongoing effort on implementing the FMO detection capability on ZTF.

2 Overview of the ZTF Survey

ZTF is specifically designed for visiting the entire northern visible sky (north of $-30^{\circ}$ declination for Palomar) every night in one filter. The new 576 megapixel camera (Dekany et al. in prep), installed on the 1.2-m Oschin Schmidt telescope, can observe an area of 3760 deg2 in an hour, to a $5\sigma$ median detection limit of $m_{r^{\prime}}=20.7$ with a 30-second integration. A total of 40% of the observation time is devoted to public surveys, with the visible northern sky observed in $g^{\prime}$ and $r^{\prime}$ every three nights, and the visible Galactic Plane ( $|b|<7^{\circ}$ ) observed in $g^{\prime}$ and $r^{\prime}$ every night. The remaining 60% time (comprised of 40% partnership time and 20% Caltech time) is divided into a number of sub-surveys, with the major one being an hourly-cadence survey of mid-declination fields (Bellm et al., 2019).

Data acquired by the telescope are transmitted to Infrared Processing and Analysis Center (IPAC) at Caltech and are processed in real-time (Masci et al., 2019). IPAC manages the central ZTF Science Data System (ZSDS), handling the calibration and processing of single epoch images, astrometry and photometry, image co-addition, image differencing, alert generation, and moving objects. The moving object branch is divided into the ZTF Moving Object Detection Engine (ZMODE) that specializes in linking point-source detections of moving objects, and the ZTF Streak pipeline (ZStreak) which focuses on the detection of streaked objects. Although sharing largely the same scientific objectives, ZMODE is executed independently of ZStreak. The technical aspects and some initial results from ZMODE are covered in Masci et al. (2019) and Graham et al. (2019).

3 Detection Process

The streak detection process is outlined in Figure 2 and is further described below:

3.1 Initial Detection

The first step of the detection process is to search for linear image features that resemble the appearance of “trailed” sources in difference images. As described in Masci et al. (2019, § 3.6), the difference images are generated by subtracting “reference” images from the single epoch science images. The reference images are “static” representations of the sky, constructed from co-adding historical science images acquired earlier in the survey. The use of difference images suppresses the majority of static sources, which would otherwise lead to large numbers of false positives and contaminate the streak detection process.

The search for FMO candidates is first performed by using the findStreaks software, which is documented in the original W17 paper in the context of PTF. findStreaks identifies candidates by searching for contiguous bright pixels that exceed a signal-to-noise threshold and whose spatial distribution is approximately linear according to an estimate of the Pearson correlation coefficient. The software generates a set of metadata describing the morphology of each identified “raw” streak, including equatorial coordinates of its midpoint, length, width, positional angle, and integrated flux. Compared to the W17 work, findStreaks has been optimized for ZTF, primarily to improve the detection efficiency (completeness) for fainter, shorter streaks. The correlation coefficient threshold was reduced to 0.55 (formerly 0.65) and the minimum length reduced to 3 pixels (formerly 9 pixels). The pixel signal-to-noise threshold remains at 1.5 $\sigma$ . For ZTF, the effective minimum length of candidate streaks reported by findStreaks (following all internal thresholding) is $\approx 5$ pixels (or 5”, which corresponds to a minimum on-sky motion of 4∘/day for typical ZTF exposures). For PTF, the shortest reported lengths were $\approx 10^{\prime\prime}$ (or 4∘/day for typical PTF exposures). Ensuring we can detect shorter streaks with ZTF is important since the nominal integration time of the individual exposures is half (30 sec) of what it was for PTF. In practice, we find that candidate streaks from ZTF images shorter than $\approx 5^{\prime\prime}$ turn out to be spurious (unreliable) detections. We compensate for this by setting a lower cutoff of 7 pixels (5∘/day) when selecting candidates for inclusion in the training set for the machine-learned classifier. This is further described in § 3.3.

The original W17 work used the metrics from findStreaks as direct input to their machine-learned classifier, with astrometric and flux information therefrom included in MPC submissions. Here we improve the estimation and reporting process by introducing an additional step to further characterize the candidate streaks.

3.2 Streak Point-Spread Function (PSF) Fitting Code

Photons from FMOs, like those from other astronomical sources, are subjected to the point-spread function (PSF) effect of the telescope system. This has two implications: (1) PSF-related metrics can be used as an effective discriminator against certain image artifacts such as cosmic rays, internal reflections of the optical system, and sensor artifacts; and (2) construction of the FMO’s PSF profile can allow better astrometry to be derived, as demonstrated in previous work (e.g. Rabinowitz, 1991; Kouprianov, 2008; Vereš et al., 2012).

To derive metrics that describe the PSF behavior of a streak, we apply the trail fitting technique described in Vereš et al. (2012) and Lin et al. (2015): the base function is essentially a finite-length streak convolved with a Gaussian PSF of width $\sigma$ , which can be written as

[TABLE]

where $b$ is the background level, $F$ is the total integrated flux of the streak, $L$ is the length of the streak, and

[TABLE]

is the Gaussian error function, and

[TABLE]

where $x_{0}$ , $y_{0}$ defines the centroid of the streak, and $\theta$ defines the angle between the motion of the streak and the $x$ axis. Note that a finite shutter opening and closing time of 0.43 second has not been included in this PSF.

Metrics calculated by findStreaks are used as inputs to the streak fitting code. The code attempts to fit the image feature, defined by pixels above $2\sigma$ level, using the Levenberg-Marquardt technique (Figure 3). We start with the initial position suggested by findStreaks, and perform a walk within the $3\times 3$ grid around the initial positions if the fit fails, until the fit is successful. Experiments show that streaks that cannot be fitted after multiple attempts do not exhibit any PSF-like behavior. These streaks are therefore rejected. For the ones that can be fitted, the code will compute the $F$ , $L$ , $x_{0}$ , $y_{0}$ , $\sigma$ , $\theta$ and $b$ for each of these streaks, and append these variables to the metadata previously computed by findStreaks.

After PSF-fitting filtering, one typical ZTF exposure will yield several hundreds streaks. A full night of ZTF observations will yield an order of $10^{5}$ streaks. The majority of these are long satellite tracks, bright star diffraction spikes, and various image artifacts. This is $\sim 10\times$ reduction/improvement over PTF, which would produce a similar number of raw streaks (Waszczak et al., 2017) but with $10\times$ smaller field-of-view. Still, screening $10^{5}$ candidates by eye is a challenging task. Therefore, we follow W17’s approach and use machine learning (ML) to cope with this challenge.

3.3 Machine-learned classifier

Similar to W17, we use a scikit-learn-based Random Forest classifier (Pedregosa et al., 2012) to set up our ML model, but have redefined the feature list as well as the method to construct the training set (see also Mahabal et al., 2019).

W17 defined 15 features derived from findStreaks, and used a hybrid of 1,441 real FMOs and synthetically-generated streaks, as well as $\sim 20,000$ bogus streaks (i.e., raw streaks that are not real FMOs) as their training set. We define 15 features derived from the PSF fitting code described above (Table 1) that mostly do not overlap with the feature list defined by W17. We also do not use real FMOs in our training set, as they represent the brighter asteroid population that may introduce bias into the classifier. The relatively rare occurrence of real FMOs ( $\lesssim 10$ real FMOs per night, including different sighting of the same object) also lead to the possible issue of over-fitting. Instead, we generate $\sim 50,000$ synthetic streaks using Equation 1 and inject them into random ZTF differenced images, and extract their morphological metrics as we would for the raw streaks. We then randomly select $\sim 50,000$ ZTF raw streaks from ZTF difference images and label them as bogus streaks. The contamination of real FMOs in the bogus set is expected to be minimal, again due to the rare occurrence of real FMOs.

The ML model is then trained using the $\sim 50,000$ -sample “real” set (though here it is really a “synthetic” set) and the $\sim 50,000$ -sample “bogus” set. We set the number of trees to 1,000 after experimenting with performance versus resource cost. Other ML-related parameters (such as depth of a tree, number of samples per leaf, and number of leaf nodes) are left unconstrained, as they do not have a noticeable effect on the performance. The synthetics are derived assuming a flat distribution of motion direction, a flat distribution of angular motion rate between 5–50∘/day, and an apparent brightness distribution following $N\propto F^{-0.65}$ (derived from the power-law distribution of sub-km NEAs derived by Mainzer et al., 2011) within a range of $V=15$ to $V=20$ , where $N$ is the cumulative number of synthetics to be generated and $F$ is the flux level. The brightness distribution is derived following the consensus on the NEO size distribution (c.f. Jedicke et al., 2015). Note that the “true” distribution is more complicated: the brightness distribution and the motion rate is dependent to each other, what we construct here is merely a zero-order approximation to the problem.

Model training was carried out more frequently during the commissioning months of ZTF (first half of 2018), due to construction and re-construction of reference images (which may slightly affect the quality of resulting differenced images), as well as engineering modifications to the ZTF system. As a result, the numbers in each of the sets vary a bit. As of this writing, the most recent model was trained and passed to the Data System pipeline in mid-June of 2018.

The ML model computes a score that varies from 0 to 1, with 0 corresponding to a streak least likely to be real while 1 corresponds to the most likely. Currently, we adopt a threshold of 0.05, which (as will be shown below) is $98\%$ complete in detecting real FMOs presented in the raw streak sample. The detection completeness up to this stage will be discussed in greater detail in § 4.2.

3.4 Human scanning and streak-linking

Streaks with scores higher than the ML score threshold are posted on a webpage (Figure 4) for human scanning. Typical lag time from image acquisition to posting of ML-vetted streaks is 20–40 minutes. A typical night would see $\sim 10^{4}$ streaks accumulated on this page, which is $\sim 10\%$ of the raw streaks initially detected by findStreaks.

Presently, scanning is done twice in a night: a mid-night (Pacific time) scan, usually carried out by scanners residing in a convenient timezone (e.g. Europe, Asia); and a morning scan, usually carried out by scanners in the U.S. The duties of the scanner are to identify and save objects that are potentially interesting for further inspection. For experienced scanners, it usually takes up to 20 minutes to review a half night’s worth of data ( $\sim 5$ – $10\times 10^{4}$ ). The plan is to gradually distribute the scanning effort over the Pacific night, in order to reduce elapsed time between exposure and identification of potentially interesting objects.

Once the scanning is done, a separate page displays additional information for objects that are potentially interesting (see Figure 5). This information allows operators to identify streaks that are potentially different sightings of the same object. Operators compare the position, direction and length of the streaks, and identify streaks that have similar direction, length, and region of appearance. These are streaks that are likely linked to each other. Plausible linkages are further examined using Find_Orb, an open-source orbit-computing software222https://www.projectpluto.com/find_orb.htm., to see if they have converging orbit solutions.

The idea of streak-linking is similar to the linking of tracklets in the conventional ZMODE-like point-source moving object detection (c.f. Denneau et al., 2013; Vereš & Chesley, 2017); however, it should be noted that each streak detection contains time, position and velocity information, as opposed to point-source object detection, which only contains time and position. The variable-in-question in a streak detection is the direction of the motion, which has two solutions and requires a secondary detection to fully determine. Therefore, while streak-linking only requires a minimum of two detections to operate, ZMODE-like system requires a minimum of three.

The converged orbital solution of two or more streaks is called a “track”. Each track is first checked against the database of known artificial objects and asteroids, using the sat_id tool333https://www.projectpluto.com/sat_id.htm. and the NEO Checker tool444https://www.minorplanetcenter.net/cgi-bin/checkneo.cgi.. The MPC does not have the responsibility of collecting observations of artificial objects, and therefore observations of these objects are not submitted. Observations of both known and unknown asteroids are immediately submitted to the MPC. The MPC discourages labeling known asteroids555https://www.minorplanetcenter.net/iau/info/Astrometry.html#id, retrieved 2018 September 1., therefore known asteroids are submitted by their temporary identifiers which are of the form ZTFXXXX (where X can be 0-9, A-Z and a-z).

3.5 Discovery Announcement and Follow-up

Unlike the procedure outlined in W17 of triggering Target-of-opportunity (ToO) sessions for streaks with a single detection, we generally only consider objects from which a track can be generated. This is primarily due to the benefit we can reap from ZTF’s large field-of-view, which is sufficient to detect most FMOs more than once under typical survey cadence. For plausible FMOs that are sighted only once, human operators will search for secondary detections in the images of the same region taken near the detection epoch. For most of these cases, no secondary detection can be found even though nearby images exist, suggesting that these are in fact image artifacts or satellite flashes. On rare occasions, the images containing secondary sightings did not go through the reference image pipeline and therefore did not generate streak products. This is usually due to the lack of pre-existing reference images in the field (Masci et al., 2019), and we expect that such a situation will become less common as ZTF accumulates data over the sky. Sometimes ZStreak can also miss the secondary sighting even though it is in the image, due to a variety of reasons (e.g., interference from nearby bright star or satellite tracks, being close to the edges, or simply being too faint). In these cases, the secondary sightings are manually measured and are added to the track.

Observations are submitted to the MPC in the new Astrometry Data Exchange Standard (ADES) format666https://www.minorplanetcenter.net/iau/info/ADES.html.. The ADES format implements data fields that are not represented in the traditional MPC1992 format, such as astrometric and photometric uncertainty, both of which can be derived from the PSF fitting code and other ZTF data products. Incorporating astrometric uncertainty is particularly important for FMO observations, since the along-streak uncertainty is typically larger than cross-streak uncertainty and is expected to have an impact on the quality of the resulting computed orbit.

Upon receipt of the observations, the MPC performs its own check against known asteroids and artificial objects. If both turn out to be negative, the object is posted on the Near-Earth Object Confirmation Page (NEOCP) to facilitate follow-up observations from observers worldwide. The ZTF partnership typically uses the 1-m telescope at Lulin Observatory (Ye & Lin, 2009; Lin et al., 2018) located on the other side of the Pacific for follow-up purposes. In principle, the P48 system would only conduct self follow-up if the future positional uncertainty of an object using two or more streaks would be too large for telescopes with typical field-of-view to recover (see discussions in § 4.1).

4 Preliminary Results

4.1 NEAs

ZStreak discovered its first FMO (2018 CL) on 2018 Febuary 5, the first night of its test operation (Ye, 2018). As of 2018 December 31, ZStreak has discovered a total of 45 FMOs over a total of 232 observable nights. (A “discovery” is defined as a FMO that has been assigned a provisional designation by the MPC and is first reported by ZTF.) Most (36 out of 45) were found in the second half of the year, as the first half of the year saw extensive modifications and updates as part of the ZTF commissioning phase. The operation in the second half of the year lost a month (October) due to an instrumental issue. Meanwhile, software is continuously being improved. Therefore we believe that we are yet to reach the full power of ZStreak.

To understand the NEA population that ZStreak is probing, we analyze the miss distances to the Earth and the absolute magnitudes of the discovered FMOs, with the result shown as Figure 6. Unsurprisingly, ZStreak is mostly finding small NEAs that would soon pass or have just passed very close ( $\lesssim 0.02$ au, or 10 LD) to the Earth. The mean absolute magnitude of the ZStreak discoveries is $\bar{H}=26.2\pm 1.7$ (corresponding to a diameter of $\sim 30$ m assuming an albedo of 0.1), which is higher than those of other NEO surveys ( $23.3\pm 2.6$ , c.f. Vereš et al., 2018). This highlights the fact that ZTF/ZStreak preferentially finds smaller NEAs due to its capability in detecting streaks.

Some objects posted on NEOCP did not receive adequate follow-up observations and were subsequently declared as “lost” NEOs. As of 2018 December 31, a total of 53 ZStreak objects posted on NEOCP were lost due to insufficient follow-ups. This indicates a loss rate of $\sim 50\%$ , comparable to earlier numbers derived by W17, but is much higher than average for other NEO surveys ( $11\%$ , c.f. Vereš et al., 2018).

To determine the cause of lost FMOs, we analyze the length of the orbital arc of the discovery data for both confirmed and unconfirmed objects, as well as the delay of the first confirmation for confirmed objects. As shown in Figure 7, we find that most confirmed discoveries have discovery arc length of over two hours, while all unconfirmed discoveries have discovery arc length $<2$ hr. (The discovery arc length is the time span between the first and the last detection of the discovery.) This suggests that the population of confirmed NEAs are biased towards NEAs that have longer discovery arc lengths. This could be helped by triggering additional target-of-opportunity observations 2 hours after the initial detection. Additional analysis of the $H$ of the unconfirmed objects shows $\bar{H}=31.1$ 777The $H$ of an unconfirmed object is simply derived from the orbit solution with minimum $O-C$ difference. We note that the orbit of an unconfirmed object is often poorly constrained due to the limited length of the arc., indicating that these are very small, meter-sized objects (Figure 8).

4.2 Completeness

The completeness of a survey is limited by a variety of factors, such as sky coverage, detection efficiency of the source-finding software, sufficient visits to establish a tracklet, and so on. Modern-day NEO surveys heavily rely on automatic software, therefore their completeness is largely limited by software capabilities. Compared to point-source-based method, FMO detection has its own unique niche and challenge: it requires fewer images to build up tracklets and tracks, but is more vulnerable to the contamination from other sources and artifacts since a FMO would spread across multiple pixels.

To quantify the completeness of ZStreak, we compile a list of known FMOs that are imaged by ZTF from 2018 February 5 to 2018 May 11, and then re-process the ZTF images containing these FMOs, using the current version of ZStreak. We then examine the raw streaks produced by the re-processing to estimate the likelihood of detection at each sighting of these FMOs.

Figure 9 shows the completeness of ZStreak for individual streak detection derived from 265 positive detections of known (i.e. MPC-cataloged) FMOs. We find that ZStreak works well with FMOs faster than $\sim 10^{\circ}$ /day, which are approximately 15 pixels long at typical ZTF exposures (30-second). The overall completeness of findStreaks and PSF-fitting filtering is $\sim 72\%$ for FMOs that are between a $V$ magnitude of 15 and 20 and are $>12^{\circ}$ /day, which is noticeably higher than the $45\%$ completeness achieved by W17 in the same magnitude range. Most of the improvement comes from the detection of fainter FMOs: W17 reported a detection limit of $V\sim 18.5$ , while our detection limit reaches $V\sim 19.8$ . The improvement of completeness is primarily due to a cleaner background, as well as the fine tuning of findStreaks.

Another potential bottleneck that can limit the completeness is the stage of ML classification. To assess the performance of the ML, we compute the true positive rate (TPR; i.e., fraction of real streaks correctly classified as such) using the positive detections of known FMOs collected above, as well as the false positive rate (FPR; i.e., fraction of bogus streaks incorrectly classified as reals) from a random query of the bogus streaks in the ZStreak database. The resulting receiver operating characteristic (ROC) plot is shown as Figure 10. We find that a ML score cutoff at the level of 0.05 yields a TPR of 0.98 and a FPR of 0.15, which we believe is a good balance for efficiency. Experience from survey operations (since June 2018) shows that the TPR is indeed very close to 1 while the FPR varies from 0.1 to about 0.3.

The numbers presented above are the completeness of single-sighting detections; the “true” completeness of ZTF/ZStreak as an NEO discovery engine will need to take the linkage of multiple-sighted streaks into account. This cannot be calculated easily, since ZTF employs multiple survey strategies. The problem is further complicated by the fact that the human operator occasionally conducts visual searches for the second sightings of single detections that are likely undetected by ZStreak.

Here we provide some rough estimates of the true completeness of the system. For public surveys, which visit each field twice each night (with one in $g^{\prime}$ and the other in $r^{\prime}$ ), the likelihood of successfully detecting a link-able FMO is $\eta=P^{2}=(72\%)^{2}=52\%$ (where $P=72\%$ is the likelihood of positive detection of a FMO in the image, as derived above). For sub-surveys that visit the same field $n$ times a night, the likelihood follows

[TABLE]

For $n\geq 3$ , $\eta\geq 81\%$ . If we take the fact that ZTF spends 40% time on public surveys and naïvely assumes that ZTF spends the other 60% on hourly-cadence survey (i.e. $\sim 6$ visits per field, per night), a very rough estimate on the global completeness of ZTF/ZStreak is $\sim 80\%$ .

4.3 Artificial Objects

ZStreak’s specialty makes it capable of detecting artificial satellites and debris in high Earth orbits (HEOs). Objects in HEOs orbit the Earth beyond the geosynchronous orbit, or 36,000 km (0.09 LD) above Earth’s surface, therefore can behave like Earth-approaching asteroids, especially near their apogees. Positions of these objects can be calculated from their respective Two-Line Element (TLE) files, which are made available by the North American Aerospace Defense Command (NORAD) and space enthusiasts.

On the other hand, satellites that exhibit non-Keplerian motion, as well as smaller objects that do not appear in the public catalog, can be confused with new Earth-approaching asteroids. Small objects are particularly challenging since their dynamics are dominated by radiation force and are highly complicated (e.g. Schildknecht et al., 2008). It is often not possible to distinguish artificial objects from NEAs based on the discovery data. These objects will be submitted to the MPC for further confirmation, as follow-up observations are needed to constrain the orbit and to see whether the orbit is geocentric (a strong indicator that the object is artificial).

As of 2018 December 31, a total of 30 ZStreak detections posted on NEOCP are removed for having geocentric orbits (i.e. likely artifical in origin). This suggests that approximately $30/(30+45)=40\%$ of confirmed ZStreak submissions are geocentric objects. In parallel, we find that about half of the unconfirmed candidates noted in § 4.1 have orbit solutions compatible with geocentric objects. It should be note that objects on geocentric orbits are preferentially more difficult to recover, as they are closer to the Earth than typical NEAs. Therefore, we estimate that $\sim 50\%$ of “real” ZStreak detections are likely artificial in origin.

Figure 11 shows a few examples of HEOs detected by ZStreak. It can be seen that their appearance is indistinguishable from real NEAs. Though some “new” objects have been identified to be boosters or fragments from previous HEO missions, many new detections remain unidentified. We note that all unidentified objects have $H>31$ , which seems to imply the completeness of the public HEO catalog is somewhere near $H=31$ (about 1–2 meter in diameter, assuming an albedo of 0.5). To that end, a notable discovery is ZTF00hg. At $H=35.4$ (approximately 15 cm in diameter, assuming an albedo of 0.5), this object is probably the smallest object ever detected by a NEO survey.

5 Conclusions and Future Work

We have presented the design, operation and preliminary results from ZStreak, a novel pipeline dedicated to real-time detection of fast-moving objects (FMOs) with the new Zwicky Transient Facility (ZTF) survey. We showed that ZStreak achieved a completeness of $72\%$ for FMOs with a $V$ magnitude between 15 and 20, moving at an apparent rate of $>12^{\circ}$ /day. At the time of the writing, ZStreak has operated in semi-real-time since ZTF commissioning (2018 February), and has detected about 100 FMOs in the process, with about half of them assigned to ZTF by the MPC.

It was also found that close to $50\%$ of ZTF-found new FMOs could not be confirmed. We found that objects with initial detection arc length shorter than 2 hours are at risk of being lost. We concluded that an efficient FMO survey with ZTF-like setup will need to have an internal turnaround time (i.e. from telescopic detection to MPC submission) less than $\sim 2$ hours to maximize the recovery rate.

Apart from further streamlining and automating the operation, efforts have been focused on reducing the high false positive rate. Among other options (e.g. Lieu et al., 2018; Nir et al., 2018), we have been exploring neural networks as an alternative approach to screen raw streaks (Duev et al. in prep). Preliminary testing has shown a $20-25\times$ reduction on the false positive rate. It is hoped that a smaller false positive rate would make it easier to perform real-time scanning, that could help facilitate near-real-time follow-up to prevent discoveries from getting lost.

Looking into the future, the experience of ZTF/ZStreak will provide a useful knowledge base for next generation sky surveys such as the Large Synoptic Survey Telescope (LSST). The smaller pixels, better seeing, and fainter limiting magnitude of LSST will push the detection of trailed objects into a new regime. FMO detection will also be useful for next generation space-based surveys such as NEOCam, WFIRST and Euclid (Bauer et al., 2018; Carry, 2018; Mainzer & NEOCam Science Team, 2016). Even though WFIRST and Euclid are not designed for asteroid detection, being able to extract asteroid observation from the survey data will compliment existing ground- and space-based NEO surveys, as shown by previous works that make use of general-purpose sky surveys (e.g. Ivezić et al., 2001; Masiero et al., 2011).

We thank an anonymous referee for helpful comments. We extend our thanks to many professional and amateur astronomers around the world for their valuable and timely follow-up observations which are critical for FMO discoveries. We thank Bill Gray in particular for his find_orb and sat_id software, which has been very useful to our work. We also thank Eric Christensen, Davide Farnocchia and Gareth Williams for beneficial discussions. Q.-Z. Ye acknowledges support by the GROWTH (Global Relay of Observatories Watching Transients Happen) project funded by the National Science Foundation PIRE (Partnership in International Research and Education) program under Grant No 1545949. Q.-Z. Ye is grateful for the Spacewatch FMO Project which sparked his interest in asteroids. C.-C. Ngeow thanks the funding from Ministry of Science and Technology (Taiwan) under grant 104-2923-M-008-004-MY5. Based on observations obtained with the Samuel Oschin Telescope 48-inch Telescope at the Palomar Observatory as part of the Zwicky Transient Facility project. ZTF is supported by the National Science Foundation under Grant No. AST-1440341 and a collaboration including Caltech, IPAC, the Weizmann Institute for Science, the Oskar Klein Center at Stockholm University, the University of Maryland, the University of Washington, Deutsches Elektronen-Synchrotron and Humboldt University, Los Alamos National Laboratories, the TANGO Consortium of Taiwan, the University of Wisconsin at Milwaukee, and Lawrence Berkeley National Laboratories. Operations are conducted by COO, IPAC, and UW. This research has made use of data and/or services provided by the International Astronomical Union’s Minor Planet Center. The National Geographic Society - Palomar Observatory Sky Atlas (POSS-I) image of 107P/(4015) Wilson-Harrington presented in this paper were obtained from the Mikulski Archive for Space Telescopes (MAST). STScI is operated by the Association of Universities for Research in Astronomy, Inc., under NASA contract NAS5-26555. POSS-I was made by the California Institute of Technology with grants from the National Geographic Society.

Bibliography41

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1Aksnes (1971) Aksnes, K. 1971, NASA Special Publication, 267, 649
2Bauer et al. (2018) Bauer, J., Milam, S., & Holler, B. 2018, in COSPAR Meeting, Vol. 42, 42nd COSPAR Scientific Assembly, B 0.3–13–18
3Bellm et al. (2019) Bellm, E. C., Kulkarni, S. R., Graham, M. J., et al. 2019, PASP, 131, 018002
4Binzel (1997) Binzel, R. P. 1997, Annals of the New York Academy of Sciences, 822, 545
5Borovička et al. (2013) Borovička, J., Spurný, P., Brown, P., et al. 2013, Nature, 503, 235
6Bowen (1949) Bowen, I. S. 1949, IAU Circ., 1244
7Brown et al. (2013) Brown, P. G., Assink, J. D., Astiz, L., et al. 2013, Nature, 503, 238
8Carry (2018) Carry, B. 2018, A&A, 609, A 113