PAN: Towards Fast Action Recognition via Learning Persistence of Appearance
Can Zhang, Yuexian Zou, Guang Chen, Lei Gan

TL;DR
This paper introduces Persistent Appearance (PA), a fast and efficient motion cue for action recognition that focuses on motion boundaries, significantly reducing computation time compared to optical flow, and enhances long-term temporal modeling with VAP within the PAN framework.
Contribution
The paper proposes PA as a novel, efficient motion representation focusing on boundaries, combined with VAP for long-term dynamics, forming the PAN framework for fast and accurate action recognition.
Findings
PA is over 1000x faster than optical flow (8196fps vs. 8fps).
PAN outperforms state-of-the-art methods on six benchmarks.
The framework achieves high accuracy with low computational cost.
Abstract
Efficiently modeling dynamic motion information in videos is crucial for action recognition task. Most state-of-the-art methods heavily rely on dense optical flow as motion representation. Although combining optical flow with RGB frames as input can achieve excellent recognition performance, the optical flow extraction is very time-consuming. This undoubtably will count against real-time action recognition. In this paper, we shed light on fast action recognition by lifting the reliance on optical flow. Our motivation lies in the observation that small displacements of motion boundaries are the most critical ingredients for distinguishing actions, so we design a novel motion cue called Persistence of Appearance (PA). In contrast to optical flow, our PA focuses more on distilling the motion information at boundaries. Also, it is more efficient by only accumulating pixel-wise differences…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Pose and Action Recognition · Video Surveillance and Tracking Methods · Gait Recognition and Analysis
