TL;DR
Agri-R1 introduces a reasoning-enhanced large model for agricultural disease diagnosis that leverages automated reasoning data synthesis and reinforcement learning, achieving high performance with limited labeled data.
Contribution
The paper presents a novel framework combining automated reasoning data generation and RL training with domain-specific rewards for agriculture-related vision-language tasks.
Findings
Achieved 27.9% relative gain in disease recognition accuracy.
Improved agricultural knowledge QA by 33.3%.
Enhanced cross-domain generalization by 26.10 points.
Abstract
Agricultural disease diagnosis challenges VLMs, as conventional fine-tuning requires extensive labels, lacks interpretability, and generalizes poorly. While reasoning improves model robustness, existing methods rely on costly expert annotations and rarely address the open-ended, diverse nature of agricultural queries. To address these limitations, we propose \textbf{Agri-R1}, a reasoning-enhanced large model for agriculture. Our framework automates high-quality reasoning data generation via vision-language synthesis and LLM-based filtering, using only 19\% of available samples. Training employs Group Relative Policy Optimization (GRPO) with a novel reward function that integrates domain-specific lexicons and fuzzy matching to assess both correctness and linguistic flexibility in open-ended responses. Evaluated on CDDMBench, our resulting 3B-parameter model achieves performance…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
