Embarrassingly Simple Model for Early Action Proposal

Marcos Baptista-R\'ios; Roberto J. L\'opez-Sastre; Franciso Javier; Acevedo-Rodr\'iguez; Saturnino Maldonado-Basc\'on

arXiv:1810.07420·cs.CV·October 19, 2018

Embarrassingly Simple Model for Early Action Proposal

Marcos Baptista-R\'ios, Roberto J. L\'opez-Sastre, Franciso Javier, Acevedo-Rodr\'iguez, Saturnino Maldonado-Basc\'on

PDF

Open Access

TL;DR

This paper introduces a simple, classifier-based 3D CNN model for real-time early action proposal in videos, outperforming more complex existing methods in online scenarios.

Contribution

The paper presents a straightforward 3D CNN approach for online action proposal, emphasizing simplicity and improved performance over complex prior methods.

Findings

01

Outperforms state-of-the-art methods in early action proposal

02

Uses standard 3D CNNs for online detection

03

Achieves significant improvements in real-time scenarios

Abstract

Early action proposal consists in generating high quality candidate temporal segments that are likely to contain an action in a video stream, as soon as they happen. Many sophisticated approaches have been proposed for the action proposal problem but from the off-line perspective. On the contrary, we focus on the on-line version of the problem, proposing a simple classifier-based model, using standard 3D CNNs, that performs significantly better than the state of the art.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Vision and Imaging · Human Pose and Action Recognition · Video Analysis and Summarization