Scalable Reverse Image Search Engine for NASAWorldview
Abhigya Sodani, Michael Levy, Anirudh Koul, Meher Anand Kasam, Siddha, Ganju

TL;DR
This paper presents a scalable, fast image similarity search engine for NASA's satellite imagery, reducing dataset exploration time from weeks to minutes using optimized CNN features and approximate nearest neighbor search.
Contribution
The authors developed a novel scalable image search system that compresses features and employs efficient search algorithms, enabling rapid retrieval from petabyte-scale satellite image datasets.
Findings
Search queries are completed in approximately 5 seconds on a single cloud VM.
Feature compression reduces image representation size from 2048 to 128 dimensions.
The system effectively handles petabyte-scale datasets with high speed and efficiency.
Abstract
Researchers often spend weeks sifting through decades of unlabeled satellite imagery(on NASA Worldview) in order to develop datasets on which they can start conducting research. We developed an interactive, scalable and fast image similarity search engine (which can take one or more images as the query image) that automatically sifts through the unlabeled dataset reducing dataset generation time from weeks to minutes. In this work, we describe key components of the end to end pipeline. Our similarity search system was created to be able to identify similar images from a potentially petabyte scale database that are similar to an input image, and for this we had to break down each query image into its features, which were generated by a classification layer stripped CNN trained in a supervised manner. To store and search these features efficiently, we had to make several scalability…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image and Video Retrieval Techniques · Advanced Neural Network Applications · Image Retrieval and Classification Techniques
MethodsBitcoin Customer Service Number +1-833-534-1729 · *Communicated@Fast*How Do I Communicate to Expedia? · Convolution · Batch Normalization · Residual Connection · Average Pooling · Dense Connections · Kaiming Initialization · Random Resized Crop · Feedforward Network
