Invariant Deep Compressible Covariance Pooling for Aerial Scene Categorization
Shidong Wang, Yi Ren, Gerard Parr, Yu Guan, Ling Shao

TL;DR
This paper introduces a novel deep learning method called IDCCP that creates invariant, compact feature representations for aerial scene images, effectively handling nuisance variations and outperforming existing methods.
Contribution
The paper proposes a new invariant deep compressible covariance pooling method that reduces feature dimensions significantly while maintaining high accuracy in aerial scene categorization.
Findings
IDCCP reduces feature dimension by about 98% with less than 0.5% accuracy loss.
The method outperforms state-of-the-art techniques on aerial scene datasets.
Invariance to transformations improves categorization robustness.
Abstract
Learning discriminative and invariant feature representation is the key to visual image categorization. In this article, we propose a novel invariant deep compressible covariance pooling (IDCCP) to solve nuisance variations in aerial scene categorization. We consider transforming the input image according to a finite transformation group that consists of multiple confounding orthogonal matrices, such as the D4 group. Then, we adopt a Siamese-style network to transfer the group structure to the representation space, where we can derive a trivial representation that is invariant under the group action. The linear classifier trained with trivial representation will also be possessed with invariance. To further improve the discriminative power of representation, we extend the representation to the tensor space while imposing orthogonal constraints on the transformation matrix to effectively…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
Methods1x1 Convolution · Residual Connection · Convolution · Average Pooling · Bottleneck Residual Block · Batch Normalization · *Communicated@Fast*How Do I Communicate to Expedia? · Max Pooling · Residual Block · Kaiming Initialization
