RS-Agent: Automating Remote Sensing Tasks through Intelligent Agent

Wenjia Xu; Zijian Yu; Boyang Mu; Zhiwei Wei; Yuanben Zhang; Guangzuo Li; Jiuniu Wang; Mugen Peng

arXiv:2406.07089·cs.CV·February 11, 2026·2 cites

RS-Agent: Automating Remote Sensing Tasks through Intelligent Agent

Wenjia Xu, Zijian Yu, Boyang Mu, Zhiwei Wei, Yuanben Zhang, Guangzuo Li, Jiuniu Wang, Mugen Peng

PDF

Open Access 1 Repo

TL;DR

RS-Agent is an AI framework that autonomously orchestrates specialized remote sensing tools using large language models, significantly improving task accuracy and performance in complex real-world remote sensing applications.

Contribution

The paper introduces RS-Agent, a novel AI agent architecture that integrates task-aware retrieval and dual retrieval-augmented generation mechanisms for remote sensing tasks.

Findings

01

Achieves over 95% task planning accuracy.

02

Outperforms state-of-the-art models in remote sensing tasks.

03

Demonstrates robustness across 9 datasets and 18 tasks.

Abstract

The unprecedented advancements in Multimodal Large Language Models (MLLMs) have demonstrated strong potential in interacting with humans through both language and visual inputs to perform downstream tasks such as visual question answering and scene understanding. However, these models are constrained to basic instruction-following or descriptive tasks, facing challenges in complex real-world remote sensing applications that require specialized tools and knowledge. To address these limitations, we propose RS-Agent, an AI agent designed to interact with human users and autonomously leverage specialized models to address the demands of real-world remote sensing applications. RS-Agent integrates four key components: a Central Controller based on large language models, a dynamic toolkit for tool execution, a Solution Space for task-specific expert guidance, and a Knowledge Space for…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

intellisensing/rs-agent
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMulti-Agent Systems and Negotiation