Pose Embeddings: A Deep Architecture for Learning to Match Human Poses

Greg Mori; Caroline Pantofaru; Nisarg Kothari; Thomas Leung; George; Toderici; Alexander Toshev; Weilong Yang

arXiv:1507.00302·cs.CV·July 2, 2015·24 cites

Pose Embeddings: A Deep Architecture for Learning to Match Human Poses

Greg Mori, Caroline Pantofaru, Nisarg Kothari, Thomas Leung, George, Toderici, Alexander Toshev, Weilong Yang

PDF

Open Access

TL;DR

This paper introduces a deep learning-based pose embedding method that directly compares human poses in images, bypassing joint estimation, and shows promising results in pose matching and retrieval tasks.

Contribution

It proposes a novel deep architecture for learning pose embeddings using a triplet-based criterion, improving pose comparison without joint detection.

Findings

01

Effective pose matching and retrieval demonstrated on video data

02

Embedding captures pose similarities accurately

03

Avoids challenges of joint position estimation

Abstract

We present a method for learning an embedding that places images of humans in similar poses nearby. This embedding can be used as a direct method of comparing images based on human pose, avoiding potential challenges of estimating body joint positions. Pose embedding learning is formulated under a triplet-based distance criterion. A deep architecture is used to allow learning of a representation capable of making distinctions between different poses. Experiments on human pose matching and retrieval from video data demonstrate the potential of the method.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHuman Pose and Action Recognition · Video Surveillance and Tracking Methods · Hand Gesture Recognition Systems