Representation Learning with Deep Extreme Learning Machines for Efficient Image Set Classification
Muhammad Uzair, Faisal Shafait, Bernard Ghanem, Ajmal Mian

TL;DR
This paper introduces a fast and effective method using Deep Extreme Learning Machines to represent image sets without prior assumptions, improving classification accuracy and speed on multiple datasets.
Contribution
It proposes a novel image set representation method with Deep Extreme Learning Machines that is efficient, assumption-free, and generalizes well with limited data.
Findings
Outperforms state-of-the-art methods in accuracy
Runs faster than existing approaches
Effective on diverse public datasets
Abstract
Efficient and accurate joint representation of a collection of images, that belong to the same class, is a major research challenge for practical image set classification. Existing methods either make prior assumptions about the data structure, or perform heavy computations to learn structure from the data itself. In this paper, we propose an efficient image set representation that does not make any prior assumptions about the structure of the underlying data. We learn the non-linear structure of image sets with Deep Extreme Learning Machines (DELM) that are very efficient and generalize well even on a limited number of training samples. Extensive experiments on a broad range of public datasets for image set classification (Honda/UCSD, CMU Mobo, YouTube Celebrities, Celebrity-1000, ETH-80) show that the proposed algorithm consistently outperforms state-of-the-art image set…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and ELM · Domain Adaptation and Few-Shot Learning · Face and Expression Recognition
MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings
