2-speed network ensemble for efficient classification of incremental land-use/land-cover satellite image chips
Michael James Horry, Subrata Chakraborty, Biswajeet Pradhan, Nagesh, Shukla, Sanjoy Paul

TL;DR
This paper introduces a two-speed ensemble approach combining a high-accuracy vision transformer and a fast CNN to efficiently classify large-scale satellite images incrementally, improving scalability and cost-effectiveness.
Contribution
It presents a novel ensemble method with staggered training schedules that enhances incremental satellite image classification efficiency.
Findings
Ensemble models outperform individual components in accuracy.
The approach scales well with large satellite datasets.
Achieves up to 65% accuracy on the test set.
Abstract
The ever-growing volume of satellite imagery data presents a challenge for industry and governments making data-driven decisions based on the timely analysis of very large data sets. Commonly used deep learning algorithms for automatic classification of satellite images are time and resource-intensive to train. The cost of retraining in the context of Big Data presents a practical challenge when new image data and/or classes are added to a training corpus. Recognizing the need for an adaptable, accurate, and scalable satellite image chip classification scheme, in this research we present an ensemble of: i) a slow to train but high accuracy vision transformer; and ii) a fast to train, low-parameter convolutional neural network. The vision transformer model provides a scalable and accurate foundation model. The high-speed CNN provides an efficient means of incorporating newly labelled…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRemote-Sensing Image Classification · CCD and CMOS Imaging Sensors
MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Layer Normalization · Softmax · Dense Connections · Residual Connection · Vision Transformer
