Grassmannian learning mutual subspace method for image set recognition

Lincon S. Souza; Naoya Sogi; Bernardo B. Gatto; Takumi Kobayashi and; Kazuhiro Fukui

arXiv:2111.04352·cs.CV·November 9, 2021·1 cites

Grassmannian learning mutual subspace method for image set recognition

Lincon S. Souza, Naoya Sogi, Bernardo B. Gatto, Takumi Kobayashi and, Kazuhiro Fukui

PDF

Open Access

TL;DR

This paper introduces G-LMSM, a novel neural network layer that improves image set recognition by representing sets as subspaces on the Grassmann manifold and matching them via canonical angles, trained end-to-end.

Contribution

It proposes a Grassmannian learning mutual subspace method (G-LMSM) that embeds subspace learning into CNNs for more effective image set classification.

Findings

01

Effective on hand shape recognition tasks

02

Improves face identification accuracy

03

Stable and efficient training with Riemannian optimization

Abstract

This paper addresses the problem of object recognition given a set of images as input (e.g., multiple camera sources and video frames). Convolutional neural network (CNN)-based frameworks do not exploit these sets effectively, processing a pattern as observed, not capturing the underlying feature distribution as it does not consider the variance of images in the set. To address this issue, we propose the Grassmannian learning mutual subspace method (G-LMSM), a NN layer embedded on top of CNNs as a classifier, that can process image sets more effectively and can be trained in an end-to-end manner. The image set is represented by a low-dimensional input subspace; and this input subspace is matched with reference subspaces by a similarity of their canonical angles, an interpretable and easy to compute metric. The key idea of G-LMSM is that the reference subspaces are learned as points on…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsFace and Expression Recognition · Video Surveillance and Tracking Methods · Face recognition and analysis