Dual-Stream Global-Local Feature Collaborative Representation Network for Scene Classification of Mining Area
Shuqi Fan, Haoyi Wang, Xianju Li

TL;DR
This paper introduces a dual-stream neural network that combines global and local features through collaborative representation to improve scene classification accuracy in complex mining areas, leveraging multi-source data and multi-scale features.
Contribution
It presents a novel dual-branch fusion model with multi-scale global transformer and local enhancement modules for mining scene classification, outperforming existing models.
Findings
Overall accuracy of 83.63% achieved
Model outperforms comparative models on all metrics
Effective integration of global and local features enhances classification
Abstract
Scene classification of mining areas provides accurate foundational data for geological environment monitoring and resource development planning. This study fuses multi-source data to construct a multi-modal mine land cover scene classification dataset. A significant challenge in mining area classification lies in the complex spatial layout and multi-scale characteristics. By extracting global and local features, it becomes possible to comprehensively reflect the spatial distribution, thereby enabling a more accurate capture of the holistic characteristics of mining scenes. We propose a dual-branch fusion model utilizing collaborative representation to decompose global features into a set of key semantic vectors. This model comprises three key components:(1) Multi-scale Global Transformer Branch: It leverages adjacent large-scale features to generate global channel attention features…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImage Processing and 3D Reconstruction
