A Visual Active Search Framework for Geospatial Exploration

Anindya Sarkar; Michael Lanier; Scott Alfeld; Jiarui Feng; Roman; Garnett; Nathan Jacobs; Yevgeniy Vorobeychik

arXiv:2211.15788·cs.CV·October 31, 2023·1 cites

A Visual Active Search Framework for Geospatial Exploration

Anindya Sarkar, Michael Lanier, Scott Alfeld, Jiarui Feng, Roman, Garnett, Nathan Jacobs, Yevgeniy Vorobeychik

PDF

Open Access 1 Repo

TL;DR

This paper introduces a reinforcement learning-based visual active search framework for geospatial exploration, optimizing object detection in satellite imagery within limited search budgets, and demonstrates significant improvements over baselines.

Contribution

The paper presents a novel RL-based meta-search policy for geospatial search tasks, with domain adaptation techniques to handle domain gaps, advancing the state-of-the-art in satellite imagery analysis.

Findings

01

Significant performance improvements over baselines in satellite imagery search tasks.

02

Effective domain adaptation techniques enhance policy performance with domain gaps.

03

The approach generalizes well across multiple large-scale datasets.

Abstract

Many problems can be viewed as forms of geospatial search aided by aerial imagery, with examples ranging from detecting poaching activity to human trafficking. We model this class of problems in a visual active search (VAS) framework, which has three key inputs: (1) an image of the entire search area, which is subdivided into regions, (2) a local search function, which determines whether a previously unseen object class is present in a given region, and (3) a fixed search budget, which limits the number of times the local search function can be evaluated. The goal is to maximize the number of objects found within the search budget. We propose a reinforcement learning approach for VAS that learns a meta-search policy from a collection of fully annotated search tasks. This meta-search policy is then used to dynamically search for a novel target-object class, leveraging the outcome of any…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

anindyasarkariith/vas
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Image and Video Retrieval Techniques · Multimodal Machine Learning Applications · Domain Adaptation and Few-Shot Learning