Spatio-Temporal Proximity-Aware Dual-Path Model for Panoramic Activity Recognition
Sumin Lee, Yooseung Wang, Sangmin Woo, Changick Kim

TL;DR
This paper introduces SPDP-Net, a dual-path model that leverages spatio-temporal proximity for improved panoramic activity recognition, achieving state-of-the-art results on the JRDB-PAR dataset.
Contribution
The paper proposes a novel dual-path architecture and emphasizes the importance of spatio-temporal proximity in social relation encoding for PAR.
Findings
SPDP-Net outperforms previous methods with 46.5% F1 score on JRDB-PAR.
Spatio-temporal proximity significantly improves activity recognition accuracy.
Dual-path architecture effectively captures multi-granular human activities.
Abstract
Panoramic Activity Recognition (PAR) seeks to identify diverse human activities across different scales, from individual actions to social group and global activities in crowded panoramic scenes. PAR presents two major challenges: 1) recognizing the nuanced interactions among numerous individuals and 2) understanding multi-granular human activities. To address these, we propose Social Proximity-aware Dual-Path Network (SPDP-Net) based on two key design principles. First, while previous works often focus on spatial distance among individuals within an image, we argue to consider the spatio-temporal proximity. It is crucial for individual relation encoding to correctly understand social dynamics. Secondly, deviating from existing hierarchical approaches (individual-to-social-to-global activity), we introduce a dual-path architecture for multi-granular activity recognition. This…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Pose and Action Recognition · Video Surveillance and Tracking Methods · Multimodal Machine Learning Applications
MethodsFocus
