Leveraging Vision Language Models for Specialized Agricultural Tasks
Muhammad Arbab Arshad, Talukder Zaki Jubery, Tirtho Roy, Rim Nassiri,, Asheesh K. Singh, Arti Singh, Chinmay Hegde, Baskar Ganapathysubramanian,, Aditya Balu, Adarsh Krishnamurthy, Soumik Sarkar

TL;DR
This paper introduces AgEval, a benchmark for evaluating vision language models in plant stress phenotyping, demonstrating their rapid adaptability with few examples and providing insights into their performance across diverse agricultural tasks.
Contribution
The paper presents AgEval, a new benchmark for assessing VLMs in agriculture, and analyzes their few-shot learning capabilities and performance variability across tasks.
Findings
VLMs significantly improve F1 scores with few-shot learning.
Strategic example selection enhances model reliability.
Performance disparities across classes are quantified by CV metrics.
Abstract
As Vision Language Models (VLMs) become increasingly accessible to farmers and agricultural experts, there is a growing need to evaluate their potential in specialized tasks. We present AgEval, a comprehensive benchmark for assessing VLMs' capabilities in plant stress phenotyping, offering a solution to the challenge of limited annotated data in agriculture. Our study explores how general-purpose VLMs can be leveraged for domain-specific tasks with only a few annotated examples, providing insights into their behavior and adaptability. AgEval encompasses 12 diverse plant stress phenotyping tasks, evaluating zero-shot and few-shot in-context learning performance of state-of-the-art models including Claude, GPT, Gemini, and LLaVA. Our results demonstrate VLMs' rapid adaptability to specialized tasks, with the best-performing model showing an increase in F1 scores from 46.24% to 73.37% in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSmart Agriculture and AI
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · Cosine Annealing · Adam · Linear Layer · Byte Pair Encoding · Layer Normalization · Softmax · Dense Connections · Multi-Head Attention
