2-speed network ensemble for efficient classification of incremental   land-use/land-cover satellite image chips

Michael James Horry; Subrata Chakraborty; Biswajeet Pradhan; Nagesh; Shukla; Sanjoy Paul

arXiv:2203.08267·cs.CV·March 17, 2022

2-speed network ensemble for efficient classification of incremental land-use/land-cover satellite image chips

Michael James Horry, Subrata Chakraborty, Biswajeet Pradhan, Nagesh, Shukla, Sanjoy Paul

PDF

Open Access

TL;DR

This paper introduces a two-speed ensemble approach combining a high-accuracy vision transformer and a fast CNN to efficiently classify large-scale satellite images incrementally, improving scalability and cost-effectiveness.

Contribution

It presents a novel ensemble method with staggered training schedules that enhances incremental satellite image classification efficiency.

Findings

01

Ensemble models outperform individual components in accuracy.

02

The approach scales well with large satellite datasets.

03

Achieves up to 65% accuracy on the test set.

Abstract

The ever-growing volume of satellite imagery data presents a challenge for industry and governments making data-driven decisions based on the timely analysis of very large data sets. Commonly used deep learning algorithms for automatic classification of satellite images are time and resource-intensive to train. The cost of retraining in the context of Big Data presents a practical challenge when new image data and/or classes are added to a training corpus. Recognizing the need for an adaptable, accurate, and scalable satellite image chip classification scheme, in this research we present an ensemble of: i) a slow to train but high accuracy vision transformer; and ii) a fast to train, low-parameter convolutional neural network. The vision transformer model provides a scalable and accurate foundation model. The high-speed CNN provides an efficient means of incorporating newly labelled…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRemote-Sensing Image Classification · CCD and CMOS Imaging Sensors

MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Layer Normalization · Softmax · Dense Connections · Residual Connection · Vision Transformer