Reconciliation of Statistical and Spatial Sparsity For Robust Image and Image-Set Classification
Hao Cheng, Kim-Hui Yap, and Bihan Wen

TL;DR
This paper introduces J3S, a novel joint statistical and spatial sparse representation method that models local patch structures and global distributions for robust image and image-set classification, especially in noisy or limited data scenarios.
Contribution
The paper proposes the first method to jointly utilize global statistics and local patch structures via sparse representation for image classification.
Findings
J3S outperforms state-of-the-art methods on multiple datasets.
The approach effectively handles noisy and limited data scenarios.
Experimental results demonstrate improved robustness and accuracy.
Abstract
Recent image classification algorithms, by learning deep features from large-scale datasets, have achieved significantly better results comparing to the classic feature-based approaches. However, there are still various challenges of image classifications in practice, such as classifying noisy image or image-set queries and training deep image classification models over the limited-scale dataset. Instead of applying generic deep features, the model-based approaches can be more effective and data-efficient for robust image and image-set classification tasks, as various image priors are exploited for modeling the inter- and intra-set data variations while preventing over-fitting. In this work, we propose a novel Joint Statistical and Spatial Sparse representation, dubbed \textit{J3S}, to model the image or image-set data for classification, by reconciling both their local patch structures…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSparse and Compressive Sensing Techniques · Domain Adaptation and Few-Shot Learning · Advanced Image and Video Retrieval Techniques
