Abnormal Event Detection in Videos using Spatiotemporal Autoencoder

Yong Shean Chong; Yong Haur Tay

arXiv:1701.01546·cs.CV·January 9, 2017

Abnormal Event Detection in Videos using Spatiotemporal Autoencoder

Yong Shean Chong, Yong Haur Tay

PDF

Open Access 5 Repos

TL;DR

This paper introduces a spatiotemporal autoencoder architecture for efficient anomaly detection in videos, capable of real-time processing and effective in crowded scenes, with performance comparable to state-of-the-art methods.

Contribution

The paper proposes a novel unsupervised spatiotemporal autoencoder architecture specifically designed for anomaly detection in videos, including crowded scenes.

Findings

01

Achieves detection accuracy comparable to state-of-the-art methods.

02

Operates at speeds up to 140 frames per second.

03

Effective in crowded scene scenarios.

Abstract

We present an efficient method for detecting anomalies in videos. Recent applications of convolutional neural networks have shown promises of convolutional layers for object detection and recognition, especially in images. However, convolutional neural networks are supervised and require labels as learning signals. We propose a spatiotemporal architecture for anomaly detection in videos including crowded scenes. Our architecture includes two main components, one for spatial feature representation, and one for learning the temporal evolution of the spatial features. Experimental results on Avenue, Subway and UCSD benchmarks confirm that the detection accuracy of our method is comparable to state-of-the-art methods at a considerable speed of up to 140 fps.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAnomaly Detection Techniques and Applications · Network Security and Intrusion Detection · Digital Media Forensic Detection

MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings