Trunk-branch Contrastive Network with Multi-view Deformable Aggregation for Multi-view Action Recognition
Yingyuan Yang, Guoyuan Liang, Can Wang, Xiaojun Wu

TL;DR
This paper introduces TBCNet, a novel multi-view action recognition network that combines multi-view feature fusion with contrastive learning to improve the recognition accuracy by capturing detailed local features.
Contribution
The paper proposes a trunk-branch contrastive network with multi-view deformable aggregation and a weighted contrastive loss, enhancing multi-view feature fusion and detail extraction for action recognition.
Findings
Achieves state-of-the-art performance on four datasets.
Effectively captures intra- and cross-view spatial-temporal correlations.
Improves recognition accuracy over existing RGB-based methods.
Abstract
Multi-view action recognition aims to identify actions in a given multi-view scene. Traditional studies initially extracted refined features from each view, followed by implemented paired interaction and integration, but they potentially overlooked the critical local features in each view. When observing objects from multiple perspectives, individuals typically form a comprehensive impression and subsequently fill in specific details. Drawing inspiration from this cognitive process, we propose a novel trunk-branch contrastive network (TBCNet) for RGB-based multi-view action recognition. Distinctively, TBCNet first obtains fused features in the trunk block and then implicitly supplements vital details provided by the branch block via contrastive learning, generating a more informative and comprehensive action representation. Within this framework, we construct two core components: the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Pose and Action Recognition · Gait Recognition and Analysis · Anomaly Detection Techniques and Applications
