Spatial-Spectral Hyperspectral Classification based on Learnable 3D Group Convolution
Guandong Li, Mengxia Ye

TL;DR
This paper introduces LGCNet, a learnable group convolution network that dynamically optimizes spectral-spatial feature extraction in hyperspectral image classification, improving accuracy and efficiency on edge devices.
Contribution
The paper proposes a novel learnable group convolution module that adaptively determines channel grouping, enhancing feature representation and computational efficiency in hyperspectral image classification.
Findings
Outperforms mainstream methods on Indian Pines, Pavia University, and KSC datasets.
Achieves higher accuracy and faster inference speed.
Effectively captures diverse spectral-spatial features.
Abstract
Deep neural networks have faced many problems in hyperspectral image classification, including the ineffective utilization of spectral-spatial joint information and the problems of gradient vanishing and overfitting that arise with increasing depth. In order to accelerate the deployment of models on edge devices with strict latency requirements and limited computing power, this paper proposes a learnable group convolution network (LGCNet) based on an improved 3D-DenseNet model and a lightweight model design. The LGCNet module improves the shortcomings of group convolution by introducing a dynamic learning method for the input channels and convolution kernel grouping, enabling flexible grouping structures and generating better representation ability. Through the overall loss and gradient of the backpropagation network, the 3D group convolution is dynamically determined and updated in an…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRemote-Sensing Image Classification · Remote Sensing and Land Use · Image Retrieval and Classification Techniques
MethodsConvolution · 3D Convolution · SPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings
