RED: Reinforced Encoder-Decoder Networks for Action Anticipation

Jiyang Gao; Zhenheng Yang; Ram Nevatia

arXiv:1707.04818·cs.CV·July 18, 2017·23 cites

RED: Reinforced Encoder-Decoder Networks for Action Anticipation

Jiyang Gao, Zhenheng Yang, Ram Nevatia

PDF

Open Access 1 Repo

TL;DR

This paper introduces RED, a reinforcement learning-based encoder-decoder network that anticipates future actions by leveraging multiple past representations, enabling earlier and more accurate predictions in various datasets.

Contribution

The paper proposes a novel RED network that uses multiple history inputs and reinforcement learning to improve early and accurate action anticipation.

Findings

01

Achieves state-of-the-art results on TVSeries, THUMOS-14, and TV-Human-Interaction datasets.

02

Outperforms existing methods in early action prediction accuracy.

03

Demonstrates the effectiveness of reinforcement learning in sequence-level supervision for anticipation.

Abstract

Action anticipation aims to detect an action before it happens. Many real world applications in robotics and surveillance are related to this predictive capability. Current methods address this problem by first anticipating visual representations of future frames and then categorizing the anticipated representations to actions. However, anticipation is based on a single past frame's representation, which ignores the history trend. Besides, it can only anticipate a fixed future time. We propose a Reinforced Encoder-Decoder (RED) network for action anticipation. RED takes multiple history representations as input and learns to anticipate a sequence of future representations. One salient aspect of RED is that a reinforcement module is adopted to provide sequence-level supervision; the reward function is designed to encourage the system to make correct predictions as early as possible. We…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

rajskar/CS763Project
pytorch

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHuman Pose and Action Recognition · Anomaly Detection Techniques and Applications · Video Surveillance and Tracking Methods