AgriKD: Cross-Architecture Knowledge Distillation for Efficient Leaf Disease Classification

Minh-Dung Le; Minh-Duc Hoang; Hoang-Vu Truong; Thi-Thu-Hong Phan

arXiv:2605.01355·cs.CV·May 8, 2026

AgriKD: Cross-Architecture Knowledge Distillation for Efficient Leaf Disease Classification

Minh-Dung Le, Minh-Duc Hoang, Hoang-Vu Truong, Thi-Thu-Hong Phan

PDF

TL;DR

AgriKD introduces a cross-architecture knowledge distillation method that effectively transfers knowledge from Vision Transformers to lightweight CNNs, enabling efficient leaf disease classification on edge devices with minimal accuracy loss.

Contribution

The paper proposes a novel multi-level distillation framework bridging Transformer and CNN architectures, significantly improving lightweight model performance for agricultural disease detection.

Findings

01

Student model achieves comparable accuracy to the teacher.

02

Model parameters reduced by approximately 172 times.

03

Inference latency decreased by 18-22 times.

Abstract

Automated leaf disease classification is critical for early disease detection in resource-constrained field environments. Vision Transformers (ViTs) provide strong representation capability by modeling long-range dependencies and inter-class relationships; however, their high computational cost makes them impractical for deployment on edge devices. As a result, existing approaches struggle to effectively transfer these rich representations to lightweight models. This paper introduces AgriKD, a cross-architecture knowledge distillation framework for efficient edge deployment, which transfers knowledge from a Vision Transformer (ViT) teacher to a compact convolutional student model. To bridge the representational gap between Transformer and CNN architectures, the proposed approach integrates multiple distillation objectives at the output, feature, and relational levels, where each…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.