Weakly Supervised Instance Attention for Multisource Fine-Grained Object Recognition with an Application to Tree Species Classification
Bulut Aygunes, Ramazan Gokberk Cinbis, Selim Aksoy

TL;DR
This paper introduces a weakly supervised deep instance attention model for multisource fine-grained object recognition, demonstrating improved accuracy in tree species classification by fusing spectral, spatial, and structural data at multiple levels.
Contribution
It proposes a novel weakly supervised multi-source attention framework with four fusion levels, enhancing fine-grained classification accuracy over state-of-the-art methods.
Findings
Feature-level fusion achieved 53% accuracy in tree species classification.
All fusion levels outperformed existing methods.
Increased model capacity further improved accuracy by 6.3%.
Abstract
Multisource image analysis that leverages complementary spectral, spatial, and structural information benefits fine-grained object recognition that aims to classify an object into one of many similar subcategories. However, for multisource tasks that involve relatively small objects, even the smallest registration errors can introduce high uncertainty in the classification process. We approach this problem from a weakly supervised learning perspective in which the input images correspond to larger neighborhoods around the expected object locations where an object with a given class label is present in the neighborhood without any knowledge of its exact location. The proposed method uses a single-source deep instance attention model with parallel branches for joint localization and classification of objects, and extends this model into a multisource setting where a reference source that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
