Heatmap Pooling Network for Action Recognition from RGB Videos
Mengyuan Liu, Jinfu Liu, Yongkang Jiang, Bin He

TL;DR
The paper introduces HP-Net, a novel heatmap pooling network that improves human action recognition in RGB videos by extracting robust, information-rich features and integrating multimodal data, outperforming existing methods.
Contribution
The paper proposes a heatmap pooling network with feedback pooling, spatial-motion co-learning, and text refinement modules for enhanced action recognition from RGB videos.
Findings
Outperforms existing action recognition methods on multiple benchmarks.
Demonstrates robustness and efficiency in feature extraction.
Effectively integrates multimodal data for improved accuracy.
Abstract
Human action recognition (HAR) in videos has garnered widespread attention due to the rich information in RGB videos. Nevertheless, existing methods for extracting deep features from RGB videos face challenges such as information redundancy, susceptibility to noise and high storage costs. To address these issues and fully harness the useful information in videos, we propose a novel heatmap pooling network (HP-Net) for action recognition from videos, which extracts information-rich, robust and concise pooled features of the human body in videos through a feedback pooling module. The extracted pooled features demonstrate obvious performance advantages over the previously obtained pose data and heatmap features from videos. In addition, we design a spatial-motion co-learning module and a text refinement modulation module to integrate the extracted pooled features with other multimodal…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Pose and Action Recognition · Human Motion and Animation · Hand Gesture Recognition Systems
