EAN: Event Adaptive Network for Enhanced Action Recognition
Yuan Tian, Yichao Yan, Guangtao Zhai, Guodong Guo, and Zhiyong Gao

TL;DR
The paper introduces the Event Adaptive Network (EAN), a novel framework for action recognition that adaptively models diverse video events using dynamic-scale kernels and sparse interaction mining, achieving state-of-the-art results.
Contribution
The paper proposes a unified adaptive framework with dynamic-scale kernels and sparse interaction modeling for improved action recognition.
Findings
Achieves state-of-the-art or competitive performance on large-scale datasets.
Utilizes dynamic-scale kernels for better event fitting.
Employs sparse Transformer-based interaction mining for global representation.
Abstract
Efficiently modeling spatial-temporal information in videos is crucial for action recognition. To achieve this goal, state-of-the-art methods typically employ the convolution operator and the dense interaction modules such as non-local blocks. However, these methods cannot accurately fit the diverse events in videos. On the one hand, the adopted convolutions are with fixed scales, thus struggling with events of various scales. On the other hand, the dense interaction modeling paradigm only achieves sub-optimal performance as action-irrelevant parts bring additional noises for the final prediction. In this paper, we propose a unified action recognition framework to investigate the dynamic nature of video content by introducing the following designs. First, when extracting local cues, we generate the spatial-temporal kernels of dynamic-scale to adaptively fit the diverse events. Second,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Pose and Action Recognition · Stroke Rehabilitation and Recovery · Advanced Technologies in Various Fields
MethodsAttention Is All You Need · Linear Layer · Absolute Position Encodings · Position-Wise Feed-Forward Layer · Label Smoothing · Convolution · Residual Connection · Softmax · Dense Connections · Adam
