GEE-OPs: An Operator Knowledge Base for Geospatial Code Generation on the Google Earth Engine Platform Powered by Large Language Models
Shuyang Hou, Jianyuan Liang, Anqi Zhao, Huayi Wu

TL;DR
This paper introduces a structured geospatial operator knowledge base for the Google Earth Engine platform, enhancing large language model performance in geospatial code generation through systematic knowledge extraction and integration.
Contribution
It presents a novel framework for building a geospatial operator knowledge base tailored to GEE, improving LLM-based code generation accuracy and efficiency.
Findings
Achieves over 90% accuracy, recall, and F1 in operator knowledge extraction.
Improves LLM code generation performance by 20-30%.
Demonstrates the effectiveness of the knowledge base through ablation studies.
Abstract
As the scale and complexity of spatiotemporal data continue to grow rapidly, the use of geospatial modeling on the Google Earth Engine (GEE) platform presents dual challenges: improving the coding efficiency of domain experts and enhancing the coding capabilities of interdisciplinary users. To address these challenges and improve the performance of large language models (LLMs) in geospatial code generation tasks, we propose a framework for building a geospatial operator knowledge base tailored to the GEE JavaScript API. This framework consists of an operator syntax knowledge table, an operator relationship frequency table, an operator frequent pattern knowledge table, and an operator relationship chain knowledge table. By leveraging Abstract Syntax Tree (AST) techniques and frequent itemset mining, we systematically extract operator knowledge from 185,236 real GEE scripts and syntax…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSemantic Web and Ontologies · Natural Language Processing Techniques · Geographic Information Systems Studies
MethodsBalanced Selection · Generative Emotion Estimator
