EVE: A Domain-Specific LLM Framework for Earth Intelligence
\`Alex R. Atrio, Antonio Lopez, Jino Rohit, Yassine El Ouahidi, Marcello Politi, Vijayasri Iyer, Umar Jamil, S\'ebastien Brati\`eres, Nicolas Long\'ep\'e

TL;DR
EVE is an open-source framework for developing domain-specific large language models tailored for Earth Intelligence, featuring a specialized 24B model, new benchmarks, and deployment tools.
Contribution
It introduces EVE-Instruct, a domain-adapted LLM, along with curated datasets, benchmarks, and a production system for Earth Science applications.
Findings
EVE-Instruct outperforms comparable models on Earth Observation benchmarks.
The system supports 350 pilot users via API and GUI.
All resources are publicly released under open licenses.
Abstract
We introduce Earth Virtual Expert (EVE), the first open-source, end-to-end initiative for developing and deploying domain-specialized LLMs for Earth Intelligence. At its core is EVE-Instruct, a domain-adapted 24B model built on Mistral Small 3.2 and optimized for reasoning and question answering. On newly constructed Earth Observation and Earth Sciences benchmarks, it outperforms comparable models while preserving general capabilities. We release curated training corpora and the first systematic domain-specific evaluation benchmarks, covering MCQA, open-ended QA, and factuality. EVE further integrates RAG and a hallucination-detection pipeline into a production system deployed via API and GUI, supporting 350 pilot users so far. All models, datasets, and code are ready to be released under open licenses as contributions to our field at huggingface.co/eve-esa and github.com/eve-esa.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
- eve-esa/hallucinationdataset· 80 dl80 dl
- eve-esa/open-endeddataset· 212 dl212 dl
- eve-esa/mcqa-multiple-answersdataset· 97 dl97 dl
- eve-esa/corpusdataset· 1.2k dl1.2k dl
- eve-esa/synthdataset· 2.9k dl2.9k dl
- eve-esa/open-ended-w-contextdataset· 40 dl40 dl
- eve-esa/mcqa-single-answerdataset· 93 dl93 dl
- eve-esa/hallucination-detectiondataset· 375 dl375 dl
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
