Leveraging Vision Language Models for Specialized Agricultural Tasks

Muhammad Arbab Arshad; Talukder Zaki Jubery; Tirtho Roy; Rim Nassiri,; Asheesh K. Singh; Arti Singh; Chinmay Hegde; Baskar Ganapathysubramanian,; Aditya Balu; Adarsh Krishnamurthy; Soumik Sarkar

arXiv:2407.19617·cs.LG·March 4, 2025·3 cites

Leveraging Vision Language Models for Specialized Agricultural Tasks

Muhammad Arbab Arshad, Talukder Zaki Jubery, Tirtho Roy, Rim Nassiri,, Asheesh K. Singh, Arti Singh, Chinmay Hegde, Baskar Ganapathysubramanian,, Aditya Balu, Adarsh Krishnamurthy, Soumik Sarkar

PDF

Open Access

TL;DR

This paper introduces AgEval, a benchmark for evaluating vision language models in plant stress phenotyping, demonstrating their rapid adaptability with few examples and providing insights into their performance across diverse agricultural tasks.

Contribution

The paper presents AgEval, a new benchmark for assessing VLMs in agriculture, and analyzes their few-shot learning capabilities and performance variability across tasks.

Findings

01

VLMs significantly improve F1 scores with few-shot learning.

02

Strategic example selection enhances model reliability.

03

Performance disparities across classes are quantified by CV metrics.

Abstract

As Vision Language Models (VLMs) become increasingly accessible to farmers and agricultural experts, there is a growing need to evaluate their potential in specialized tasks. We present AgEval, a comprehensive benchmark for assessing VLMs' capabilities in plant stress phenotyping, offering a solution to the challenge of limited annotated data in agriculture. Our study explores how general-purpose VLMs can be leveraged for domain-specific tasks with only a few annotated examples, providing insights into their behavior and adaptability. AgEval encompasses 12 diverse plant stress phenotyping tasks, evaluating zero-shot and few-shot in-context learning performance of state-of-the-art models including Claude, GPT, Gemini, and LLaVA. Our results demonstrate VLMs' rapid adaptability to specialized tasks, with the best-performing model showing an increase in F1 scores from 46.24% to 73.37% in…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSmart Agriculture and AI

MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · Cosine Annealing · Adam · Linear Layer · Byte Pair Encoding · Layer Normalization · Softmax · Dense Connections · Multi-Head Attention