Unsupervised View-Invariant Human Posture Representation
Faegheh Sardari, Bj\"orn Ommer, Majid Mirmehdi

TL;DR
This paper introduces an unsupervised method for extracting view-invariant 3D human pose representations from 2D images, eliminating the need for 3D skeleton data and improving cross-view action recognition.
Contribution
The novel approach learns view-invariant human pose features from 2D images without 3D data, leveraging view and equivariance properties, and demonstrates superior performance on multiple datasets.
Findings
Improves cross-view action classification accuracy on NTU RGB+D.
Achieves first unsupervised cross-view and cross-subject rank correlation on QMAR.
Marginally outperforms supervised methods on QMAR.
Abstract
Most recent view-invariant action recognition and performance assessment approaches rely on a large amount of annotated 3D skeleton data to extract view-invariant features. However, acquiring 3D skeleton data can be cumbersome, if not impractical, in in-the-wild scenarios. To overcome this problem, we present a novel unsupervised approach that learns to extract view-invariant 3D human pose representation from a 2D image without using 3D joint data. Our model is trained by exploiting the intrinsic view-invariant properties of human pose between simultaneous frames from different viewpoints and their equivariant properties between augmented frames from the same viewpoint. We evaluate the learned view-invariant pose representations for two downstream tasks. We perform comparative experiments that show improvements on the state-of-the-art unsupervised cross-view action classification…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Pose and Action Recognition · Diabetic Foot Ulcer Assessment and Management · Hand Gesture Recognition Systems
