AgriKD: Cross-Architecture Knowledge Distillation for Efficient Leaf Disease Classification
Minh-Dung Le, Minh-Duc Hoang, Hoang-Vu Truong, Thi-Thu-Hong Phan

TL;DR
AgriKD introduces a cross-architecture knowledge distillation method that effectively transfers knowledge from Vision Transformers to lightweight CNNs, enabling efficient leaf disease classification on edge devices with minimal accuracy loss.
Contribution
The paper proposes a novel multi-level distillation framework bridging Transformer and CNN architectures, significantly improving lightweight model performance for agricultural disease detection.
Findings
Student model achieves comparable accuracy to the teacher.
Model parameters reduced by approximately 172 times.
Inference latency decreased by 18-22 times.
Abstract
Automated leaf disease classification is critical for early disease detection in resource-constrained field environments. Vision Transformers (ViTs) provide strong representation capability by modeling long-range dependencies and inter-class relationships; however, their high computational cost makes them impractical for deployment on edge devices. As a result, existing approaches struggle to effectively transfer these rich representations to lightweight models. This paper introduces AgriKD, a cross-architecture knowledge distillation framework for efficient edge deployment, which transfers knowledge from a Vision Transformer (ViT) teacher to a compact convolutional student model. To bridge the representational gap between Transformer and CNN architectures, the proposed approach integrates multiple distillation objectives at the output, feature, and relational levels, where each…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
