Online Feedback Efficient Active Target Discovery in Partially Observable Environments
Anindya Sarkar, Binglin Ji, Yevgeniy Vorobeychik

TL;DR
This paper introduces DiffATD, a diffusion-guided active target discovery method that efficiently balances exploration and exploitation in partially observable environments, outperforming baselines without prior supervised training.
Contribution
The paper presents a novel diffusion-guided approach for active target discovery that operates without supervised training and offers interpretability in partially observable settings.
Findings
DiffATD outperforms baseline methods across multiple domains.
It effectively balances exploration and exploitation.
It achieves competitive results with fully supervised methods.
Abstract
In various scientific and engineering domains, where data acquisition is costly--such as in medical imaging, environmental monitoring, or remote sensing--strategic sampling from unobserved regions, guided by prior observations, is essential to maximize target discovery within a limited sampling budget. In this work, we introduce Diffusion-guided Active Target Discovery (DiffATD), a novel method that leverages diffusion dynamics for active target discovery. DiffATD maintains a belief distribution over each unobserved state in the environment, using this distribution to dynamically balance exploration-exploitation. Exploration reduces uncertainty by sampling regions with the highest expected entropy, while exploitation targets areas with the highest likelihood of discovering the target, indicated by the belief distribution and an incrementally trained reward model designed to learn the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsGaussian Processes and Bayesian Inference · Domain Adaptation and Few-Shot Learning · Advanced Neural Network Applications
MethodsDiffusion
