Automatic Fused Multimodal Deep Learning for Plant Identification
Alfreds Lapkovskis, Natalia Nefedova, Ali Beikmohammadi

TL;DR
This paper presents an innovative deep learning approach that automatically fuses multiple plant data modalities for improved classification accuracy, leveraging architecture search and a new multimodal dataset.
Contribution
It introduces a novel multimodal fusion architecture search method and a new dataset, Multimodal-PlantCLEF, for plant classification, achieving superior accuracy over existing methods.
Findings
Achieved 82.61% accuracy on Multimodal-PlantCLEF
Outperformed late fusion by 10.33%
Demonstrated robustness to missing modalities
Abstract
Plant classification is vital for ecological conservation and agricultural productivity, enhancing our understanding of plant growth dynamics and aiding species preservation. The advent of deep learning (DL) techniques has revolutionized this field by enabling autonomous feature extraction, significantly reducing the dependence on manual expertise. However, conventional DL models often rely solely on single data sources, failing to capture the full biological diversity of plant species comprehensively. Recent research has turned to multimodal learning to overcome this limitation by integrating multiple data types, which enriches the representation of plant characteristics. This shift introduces the challenge of determining the optimal point for modality fusion. In this paper, we introduce a pioneering multimodal DL-based approach for plant classification with automatic modality fusion.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSmart Agriculture and AI · Leaf Properties and Growth Measurement
