A Visual Active Search Framework for Geospatial Exploration
Anindya Sarkar, Michael Lanier, Scott Alfeld, Jiarui Feng, Roman, Garnett, Nathan Jacobs, Yevgeniy Vorobeychik

TL;DR
This paper introduces a reinforcement learning-based visual active search framework for geospatial exploration, optimizing object detection in satellite imagery within limited search budgets, and demonstrates significant improvements over baselines.
Contribution
The paper presents a novel RL-based meta-search policy for geospatial search tasks, with domain adaptation techniques to handle domain gaps, advancing the state-of-the-art in satellite imagery analysis.
Findings
Significant performance improvements over baselines in satellite imagery search tasks.
Effective domain adaptation techniques enhance policy performance with domain gaps.
The approach generalizes well across multiple large-scale datasets.
Abstract
Many problems can be viewed as forms of geospatial search aided by aerial imagery, with examples ranging from detecting poaching activity to human trafficking. We model this class of problems in a visual active search (VAS) framework, which has three key inputs: (1) an image of the entire search area, which is subdivided into regions, (2) a local search function, which determines whether a previously unseen object class is present in a given region, and (3) a fixed search budget, which limits the number of times the local search function can be evaluated. The goal is to maximize the number of objects found within the search budget. We propose a reinforcement learning approach for VAS that learns a meta-search policy from a collection of fully annotated search tasks. This meta-search policy is then used to dynamically search for a novel target-object class, leveraging the outcome of any…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image and Video Retrieval Techniques · Multimodal Machine Learning Applications · Domain Adaptation and Few-Shot Learning
