AgriChat: A Multimodal Large Language Model for Agriculture Image Understanding
Abderrahmene Boudiaf, Irfan Hussain, Sajid Javed

TL;DR
AgriChat introduces a multimodal large language model tailored for agriculture, utilizing a novel dataset and verification pipeline to enhance accuracy and trustworthiness in agricultural image understanding tasks.
Contribution
The paper presents the V2VK annotation pipeline and the AgriMM benchmark, enabling the development of a specialized MLLM with verified domain knowledge for agriculture.
Findings
AgriChat outperforms other open-source models on agricultural tasks.
The AgriMM benchmark contains over 3,000 classes and 607k VQAs.
Grounding training data in verified literature reduces hallucinations.
Abstract
The deployment of Multimodal Large Language Models (MLLMs) in agriculture is currently stalled by a critical trade-off: the existing literature lacks the large-scale agricultural datasets required for robust model development and evaluation, while current state-of-the-art models lack the verified domain expertise necessary to reason across diverse taxonomies. To address these challenges, we propose the Vision-to-Verified-Knowledge (V2VK) pipeline, a novel generative AI-driven annotation framework that integrates visual captioning with web-augmented scientific retrieval to autonomously generate the AgriMM benchmark, effectively eliminating biological hallucinations by grounding training data in verified phytopathological literature. The AgriMM benchmark contains over 3,000 agricultural classes and more than 607k VQAs spanning multiple tasks, including fine-grained plant species…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSmart Agriculture and AI · Multimodal Machine Learning Applications · Advanced Neural Network Applications
