AutoGEEval: A Multimodal and Automated Framework for Geospatial Code Generation on GEE with Large Language Models
Shuyang Hou, Zhangxiao Shen, Huayi Wu, Jianyuan Liang, Haoyue Jiao, Yaxian Qing, Xiaopu Zhang, Xu Li, Zhipeng Gui, Xuefeng Guan, Longgang Xiang

TL;DR
AutoGEEval is a comprehensive, automated evaluation framework for geospatial code generation on Google Earth Engine, enabling standardized assessment of large language models' performance across diverse data types and tasks.
Contribution
It introduces the first multimodal, unit-level automated evaluation framework and benchmark suite for GEE code generation using LLMs, facilitating standardized assessment and development.
Findings
Evaluated 18 LLMs revealing their strengths and weaknesses in GEE code generation.
AutoGEEval enables multidimensional analysis including accuracy and efficiency.
Benchmark suite covers 1325 test cases across 26 GEE data types.
Abstract
Geospatial code generation is emerging as a key direction in the integration of artificial intelligence and geoscientific analysis. However, there remains a lack of standardized tools for automatic evaluation in this domain. To address this gap, we propose AutoGEEval, the first multimodal, unit-level automated evaluation framework for geospatial code generation tasks on the Google Earth Engine (GEE) platform powered by large language models (LLMs). Built upon the GEE Python API, AutoGEEval establishes a benchmark suite (AutoGEEval-Bench) comprising 1325 test cases that span 26 GEE data types. The framework integrates both question generation and answer verification components to enable an end-to-end automated evaluation pipeline-from function invocation to execution validation. AutoGEEval supports multidimensional quantitative analysis of model outputs in terms of accuracy, resource…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGeographic Information Systems Studies · Data Visualization and Analytics · Constraint Satisfaction and Optimization
MethodsGenerative Emotion Estimator
