Information Elevation Network for Fast Online Action Detection
Sunah Min, Jinyoung Moon

TL;DR
This paper introduces the Information Elevation Network (IEN), a novel online action detection model that efficiently models relevant past information using the proposed IEU, outperforming existing methods on benchmark datasets.
Contribution
The paper proposes the IEU for better past information modeling and develops the IEN, the first efficient OAD network considering computational overhead, using only RGB features.
Findings
IEN outperforms state-of-the-art methods on THUMOS-14 and TVSeries datasets.
IEN achieves superior accuracy using only RGB frames.
IEN surpasses methods using two-stream features on THUMOS-14.
Abstract
Online action detection (OAD) is a task that receives video segments within a streaming video as inputs and identifies ongoing actions within them. It is important to retain past information associated with a current action. However, long short-term memory (LSTM), a popular recurrent unit for modeling temporal information from videos, accumulates past information from the previous hidden and cell states and the extracted visual features at each timestep without considering the relationships between the past and current information. Consequently, the forget gate of the original LSTM can lose the accumulated information relevant to the current action because it determines which information to forget without considering the current action. We introduce a novel information elevation unit (IEU) that lifts up and accumulate the past information relevant to the current action in order to model…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Pose and Action Recognition · Anomaly Detection Techniques and Applications · Multimodal Machine Learning Applications
MethodsSigmoid Activation · Tanh Activation · Long Short-Term Memory
