AgriBench: A Hierarchical Agriculture Benchmark for Multimodal Large   Language Models

Yutong Zhou; Masahiro Ryo

arXiv:2412.00465·cs.CV·December 24, 2024

AgriBench: A Hierarchical Agriculture Benchmark for Multimodal Large Language Models

Yutong Zhou, Masahiro Ryo

PDF

Open Access 1 Repo

TL;DR

This paper introduces AgriBench, a comprehensive benchmark for evaluating multimodal large language models in agriculture, supported by a new detailed dataset called MM-LUCAS that includes images, annotations, and land use data.

Contribution

The paper presents the first agriculture-specific benchmark for multimodal LLMs and introduces MM-LUCAS, a rich dataset with diverse agricultural and geographical annotations.

Findings

01

AgriBench enables systematic evaluation of agriculture MM-LLMs.

02

MM-LUCAS provides extensive multimodal agricultural data.

03

The work offers insights for future agriculture AI developments.

Abstract

We introduce AgriBench, the first agriculture benchmark designed to evaluate MultiModal Large Language Models (MM-LLMs) for agriculture applications. To further address the agriculture knowledge-based dataset limitation problem, we propose MM-LUCAS, a multimodal agriculture dataset, that includes 1,784 landscape images, segmentation masks, depth maps, and detailed annotations (geographical location, country, date, land cover and land use taxonomic details, quality scores, aesthetic scores, etc), based on the Land Use/Cover Area Frame Survey (LUCAS) dataset, which contains comparable statistics on land use and land cover for the European Union (EU) territory. This work presents a groundbreaking perspective in advancing agriculture MM-LLMs and is still in progress, offering valuable insights for future developments and innovations in specific expert knowledge-based MM-LLMs.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

yutong-zhou-cv/agribench
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques