TL;DR
This paper introduces a Dynamic Inference Network that models person-specific spatio-temporal interactions for group activity recognition, improving accuracy and efficiency over previous methods by dynamically predicting relations and interaction paths.
Contribution
The paper presents a novel Dynamic Inference Network with relation and walk modules that dynamically construct person-specific interaction graphs for better activity recognition.
Findings
Significant accuracy improvements over state-of-the-art methods.
Reduced computational overhead in reasoning modules.
Effective modeling of person-specific interactions.
Abstract
Group activity recognition aims to understand the activity performed by a group of people. In order to solve it, modeling complex spatio-temporal interactions is the key. Previous methods are limited in reasoning on a predefined graph, which ignores the inherent person-specific interaction context. Moreover, they adopt inference schemes that are computationally expensive and easily result in the over-smoothing problem. In this paper, we manage to achieve spatio-temporal person-specific inferences by proposing Dynamic Inference Network (DIN), which composes of Dynamic Relation (DR) module and Dynamic Walk (DW) module. We firstly propose to initialize interaction fields on a primary spatio-temporal graph. Within each interaction field, we apply DR to predict the relation matrix and DW to predict the dynamic walk offsets in a joint-processing manner, thus forming a person-specific…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
