TEA: Temporal Excitation and Aggregation for Action Recognition
Yan Li, Bin Ji, Xintian Shi, Jianguo Zhang, Bin Kang and, Limin Wang

TL;DR
The paper introduces TEA, a novel block for action recognition in videos that effectively models both short- and long-range temporal dynamics using motion excitation and hierarchical aggregation, achieving high accuracy with low computational cost.
Contribution
It proposes the TEA block with motion excitation and hierarchical temporal aggregation modules, enhancing temporal modeling without extra parameters.
Findings
Achieves high accuracy on Kinetics, Something-Something, HMDB51, and UCF101.
Operates with low FLOPs, demonstrating efficiency.
Effectively captures both short- and long-range temporal features.
Abstract
Temporal modeling is key for action recognition in videos. It normally considers both short-range motions and long-range aggregations. In this paper, we propose a Temporal Excitation and Aggregation (TEA) block, including a motion excitation (ME) module and a multiple temporal aggregation (MTA) module, specifically designed to capture both short- and long-range temporal evolution. In particular, for short-range motion modeling, the ME module calculates the feature-level temporal differences from spatiotemporal features. It then utilizes the differences to excite the motion-sensitive channels of the features. The long-range temporal aggregations in previous works are typically achieved by stacking a large number of local temporal convolutions. Each convolution processes a local temporal window at a time. In contrast, the MTA module proposes to deform the local convolution to a group of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
TEA: Temporal Excitation and Aggregation for Action Recognition· youtube
Taxonomy
TopicsHuman Pose and Action Recognition · Video Surveillance and Tracking Methods · Anomaly Detection Techniques and Applications
MethodsConvolution
