RS-OVC: Open-Vocabulary Counting for Remote-Sensing Data

Tamir Shor; George Leifman; Genady Beryozkin

arXiv:2604.08704·cs.CV·April 13, 2026

RS-OVC: Open-Vocabulary Counting for Remote-Sensing Data

Tamir Shor, George Leifman, Genady Beryozkin

PDF

TL;DR

RS-OVC introduces an open-vocabulary counting model for remote-sensing imagery, enabling accurate counting of unseen object classes using textual and visual cues, thus overcoming the limitations of closed-set methods.

Contribution

This work presents the first open-vocabulary counting model for remote sensing, allowing for flexible counting of novel objects without re-annotation or re-training.

Findings

01

Model accurately counts unseen object classes.

02

Enables flexible, real-world remote sensing monitoring.

03

Uses textual and visual conditioning for open vocabulary understanding.

Abstract

Object-Counting for remote-sensing (RS) imagery is attracting increasing research interest due to its crucial role in a wide and diverse set of applications. While several promising methods for RS object-counting have been proposed, existing methods focus on a closed, pre-defined set of object classes. This limitation necessitates costly re-annotation and model re-training to adapt current approaches for counting of novel objects that have not been seen during training, and severely inhibits their application in dynamic, real-world monitoring scenarios. To address this gap, in this work we propose RS-OVC - the first Open Vocabulary Counting (OVC) model for Remote-Sensing and aerial imagery. We show that our model is capable of accurate counting of novel object classes, that were unseen during training, based solely on textual and/or visual conditioning.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.