Integrating Human Parsing and Pose Network for Human Action Recognition

Runwei Ding; Yuhang Wen; Jinfu Liu; Nan Dai; Fanyang Meng; Mengyuan; Liu

arXiv:2307.07977·cs.CV·July 18, 2023

Integrating Human Parsing and Pose Network for Human Action Recognition

Runwei Ding, Yuhang Wen, Jinfu Liu, Nan Dai, Fanyang Meng, Mengyuan, Liu

PDF

Open Access 1 Repo

TL;DR

This paper introduces IPP-Net, a dual-branch network that combines human pose and parsing features to improve human action recognition accuracy using skeletons and RGB data.

Contribution

It is the first to integrate human parsing feature maps with skeleton data in a dual-branch network for action recognition.

Findings

01

Outperforms existing methods on NTU RGB+D benchmarks.

02

Effectively filters irrelevant background and outfit information.

03

Demonstrates the benefit of combining semantic body-part features with pose data.

Abstract

Human skeletons and RGB sequences are both widely-adopted input modalities for human action recognition. However, skeletons lack appearance features and color data suffer large amount of irrelevant depiction. To address this, we introduce human parsing feature map as a novel modality, since it can selectively retain spatiotemporal features of the body parts, while filtering out noises regarding outfits, backgrounds, etc. We propose an Integrating Human Parsing and Pose Network (IPP-Net) for action recognition, which is the first to leverage both skeletons and human parsing feature maps in dual-branch approach. The human pose branch feeds compact skeletal representations of different modalities in graph convolutional network to model pose features. In human parsing branch, multi-frame body-part parsing features are extracted with human detector and parser, which is later learnt using a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

liujf69/ipp-net-parsing
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHuman Pose and Action Recognition · Hand Gesture Recognition Systems · Multimodal Machine Learning Applications