Embedded Self-Distillation in Compact Multi-Branch Ensemble Network for Remote Sensing Scene Classification
Qi Zhao, Yujing Ma, Shuchang Lyu, Lijiang Chen

TL;DR
This paper introduces a compact multi-branch ensemble network with embedded self-distillation for remote sensing scene classification, achieving high accuracy while maintaining efficiency by simplifying the model during inference.
Contribution
It proposes a novel end-to-end trainable multi-branch ensemble network with embedded self-distillation, improving classification accuracy and interpretability in remote sensing tasks.
Findings
Achieves superior accuracy on three benchmark datasets.
Maintains model simplicity and efficiency during inference.
Outperforms previous state-of-the-art models.
Abstract
Remote sensing (RS) image scene classification task faces many challenges due to the interference from different characteristics of different geographical elements. To solve this problem, we propose a multi-branch ensemble network to enhance the feature representation ability by fusing features in final output logits and intermediate feature maps. However, simply adding branches will increase the complexity of models and decline the inference efficiency. On this issue, we embed self-distillation (SD) method to transfer knowledge from ensemble network to main-branch in it. Through optimizing with SD, main-branch will have close performance as ensemble network. During inference, we can cut other branches to simplify the whole model. In this paper, we first design compact multi-branch ensemble network, which can be trained in an end-to-end manner. Then, we insert SD method on output logits…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
