Spatio-temporal Relation Modeling for Few-shot Action Recognition
Anirudh Thatipelli, Sanath Narayan, Salman Khan, Rao Muhammad Anwer,, Fahad Shahbaz Khan, Bernard Ghanem

TL;DR
This paper introduces STRM, a novel few-shot action recognition framework that enhances feature discriminability and temporal understanding through spatio-temporal enrichment modules, achieving state-of-the-art results on multiple benchmarks.
Contribution
The paper proposes a new spatio-temporal enrichment module and a query-class similarity classifier for improved few-shot action recognition.
Findings
Achieves state-of-the-art accuracy on Kinetics, SSv2, HMDB51, and UCF101.
Provides a 3.5% accuracy gain on SSv2 over previous best methods.
Demonstrates the effectiveness of local patch and global frame enrichment in action recognition.
Abstract
We propose a novel few-shot action recognition framework, STRM, which enhances class-specific feature discriminability while simultaneously learning higher-order temporal representations. The focus of our approach is a novel spatio-temporal enrichment module that aggregates spatial and temporal contexts with dedicated local patch-level and global frame-level feature enrichment sub-modules. Local patch-level enrichment captures the appearance-based characteristics of actions. On the other hand, global frame-level enrichment explicitly encodes the broad temporal context, thereby capturing the relevant object features over time. The resulting spatio-temporally enriched representations are then utilized to learn the relational matching between query and support action sub-sequences. We further introduce a query-class similarity classifier on the patch-level enriched features to enhance…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Pose and Action Recognition · Anomaly Detection Techniques and Applications · Diabetic Foot Ulcer Assessment and Management
