Harnessing Large Vision and Language Models in Agriculture: A Review
Hongyan Zhu, Shuai Qin, Min Su, Chengzhi Lin, Anjie Li, Junfeng Gao

TL;DR
This review explores how large vision and language models can revolutionize agriculture by improving crop management, disease detection, and decision-making, ultimately enhancing productivity and sustainability.
Contribution
It provides a comprehensive overview of current applications and future potential of large multimodal models in agricultural tasks and decision support systems.
Findings
Large models assist in pest and disease detection.
They improve soil and seed quality assessment.
Potential to significantly boost agricultural efficiency.
Abstract
Large models can play important roles in many domains. Agriculture is another key factor affecting the lives of people around the world. It provides food, fabric, and coal for humanity. However, facing many challenges such as pests and diseases, soil degradation, global warming, and food security, how to steadily increase the yield in the agricultural sector is a problem that humans still need to solve. Large models can help farmers improve production efficiency and harvest by detecting a series of agricultural production tasks such as pests and diseases, soil quality, and seed quality. It can also help farmers make wise decisions through a variety of information, such as images, text, etc. Herein, we delve into the potential applications of large models in agriculture, from large language model (LLM) and large vision model (LVM) to large vision-language models (LVLM). After gaining a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSmart Agriculture and AI
