CHOICE: Benchmarking the Remote Sensing Capabilities of Large Vision-Language Models

Xiao An; Jiaxing Sun; Zihan Gui; Wei He

arXiv:2411.18145·cs.CV·November 13, 2025

CHOICE: Benchmarking the Remote Sensing Capabilities of Large Vision-Language Models

Xiao An, Jiaxing Sun, Zihan Gui, Wei He

PDF

Open Access 1 Repo 1 Datasets

TL;DR

CHOICE is a comprehensive benchmark designed to evaluate the remote sensing capabilities of large vision-language models across perception and reasoning, addressing a critical gap in systematic assessment tools for Earth observation tasks.

Contribution

This paper introduces CHOICE, the first extensive benchmark with over 10,500 problems, for objectively evaluating the remote sensing abilities of VLMs across multiple dimensions.

Findings

01

Most VLMs show significant limitations in remote sensing tasks.

02

The benchmark reveals gaps in perception and reasoning capabilities of current models.

03

CHOICE provides a standardized platform for future model evaluation and development.

Abstract

The rapid advancement of Large Vision-Language Models (VLMs), both general-domain models and those specifically tailored for remote sensing, has demonstrated exceptional perception and reasoning capabilities in Earth observation tasks. However, a benchmark for systematically evaluating their capabilities in this domain is still lacking. To bridge this gap, we propose CHOICE, an extensive benchmark designed to objectively evaluate the hierarchical remote sensing capabilities of VLMs. Focusing on 2 primary capability dimensions essential to remote sensing: perception and reasoning, we further categorize 6 secondary dimensions and 23 leaf tasks to ensure a well-rounded assessment coverage. CHOICE guarantees the quality of all 10,507 problems through a rigorous process of data collection from 50 globally distributed cities, question construction and quality control. The newly curated data…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

shawnan-whu/choice
pytorchOfficial

Datasets

isaaccorley/CHOICE
dataset· 54 dl
54 dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Image and Video Retrieval Techniques · Satellite Image Processing and Photogrammetry

Methods+ ( 1 ) ⟷ 888 ⟷ ( 829 ) ⟷ 0881||How do I resolve a dispute on Expedia?