DECA: Deep viewpoint-Equivariant human pose estimation using Capsule   Autoencoders

Nicola Garau; Niccol\`o Bisagno; Piotr Br\'odka; Nicola Conci

arXiv:2108.08557·cs.CV·August 20, 2021

DECA: Deep viewpoint-Equivariant human pose estimation using Capsule Autoencoders

Nicola Garau, Niccol\`o Bisagno, Piotr Br\'odka, Nicola Conci

PDF

Open Access 1 Repo

TL;DR

DECA introduces a capsule autoencoder for human pose estimation that achieves viewpoint equivariance, enabling better generalization to unseen viewpoints and outperforming existing methods on depth and RGB images.

Contribution

The paper presents a novel capsule autoencoder with fast Variational Bayes routing that models joints as capsules, improving viewpoint generalization in human pose estimation.

Findings

01

Outperforms existing methods on depth images from unseen viewpoints

02

Achieves state-of-the-art results on RGB viewpoint transfer tasks

03

Reduces data dependency for training in human pose estimation

Abstract

Human Pose Estimation (HPE) aims at retrieving the 3D position of human joints from images or videos. We show that current 3D HPE methods suffer a lack of viewpoint equivariance, namely they tend to fail or perform poorly when dealing with viewpoints unseen at training time. Deep learning methods often rely on either scale-invariant, translation-invariant, or rotation-invariant operations, such as max-pooling. However, the adoption of such procedures does not necessarily improve viewpoint generalization, rather leading to more data-dependent methods. To tackle this issue, we propose a novel capsule autoencoder network with fast Variational Bayes capsule routing, named DECA. By modeling each joint as a capsule entity, combined with the routing algorithm, our approach can preserve the joints' hierarchical and geometrical structure in the feature space, independently from the viewpoint. By…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

mmlab-cv/deca
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHuman Pose and Action Recognition · Diabetic Foot Ulcer Assessment and Management · Video Surveillance and Tracking Methods