RS-Agent: Automating Remote Sensing Tasks through Intelligent Agent
Wenjia Xu, Zijian Yu, Boyang Mu, Zhiwei Wei, Yuanben Zhang, Guangzuo Li, Jiuniu Wang, Mugen Peng

TL;DR
RS-Agent is an AI framework that autonomously orchestrates specialized remote sensing tools using large language models, significantly improving task accuracy and performance in complex real-world remote sensing applications.
Contribution
The paper introduces RS-Agent, a novel AI agent architecture that integrates task-aware retrieval and dual retrieval-augmented generation mechanisms for remote sensing tasks.
Findings
Achieves over 95% task planning accuracy.
Outperforms state-of-the-art models in remote sensing tasks.
Demonstrates robustness across 9 datasets and 18 tasks.
Abstract
The unprecedented advancements in Multimodal Large Language Models (MLLMs) have demonstrated strong potential in interacting with humans through both language and visual inputs to perform downstream tasks such as visual question answering and scene understanding. However, these models are constrained to basic instruction-following or descriptive tasks, facing challenges in complex real-world remote sensing applications that require specialized tools and knowledge. To address these limitations, we propose RS-Agent, an AI agent designed to interact with human users and autonomously leverage specialized models to address the demands of real-world remote sensing applications. RS-Agent integrates four key components: a Central Controller based on large language models, a dynamic toolkit for tool execution, a Solution Space for task-specific expert guidance, and a Knowledge Space for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMulti-Agent Systems and Negotiation
