A Spatio-Temporal Attentive Network for Video-Based Crowd Counting

Marco Avvenuti; Marco Bongiovanni; Luca Ciampi; Fabrizio Falchi,; Claudio Gennaro; Nicola Messina

arXiv:2208.11339·cs.CV·August 25, 2022

A Spatio-Temporal Attentive Network for Video-Based Crowd Counting

Marco Avvenuti, Marco Bongiovanni, Luca Ciampi, Fabrizio Falchi,, Claudio Gennaro, Nicola Messina

PDF

Open Access

TL;DR

This paper introduces a spatio-temporal attentive neural network that leverages temporal correlations in video sequences to improve pedestrian counting accuracy in surveillance videos, outperforming existing methods on standard benchmarks.

Contribution

It presents a novel neural network architecture that incorporates spatio-temporal attention for more accurate crowd counting in videos, addressing limitations of previous image-based approaches.

Findings

01

Reduced count error by 5% on FDST benchmark

02

Lowered localization error by 7.5%

03

Demonstrated effectiveness of temporal attention in crowd counting

Abstract

Automatic people counting from images has recently drawn attention for urban monitoring in modern Smart Cities due to the ubiquity of surveillance camera networks. Current computer vision techniques rely on deep learning-based algorithms that estimate pedestrian densities in still, individual images. Only a bunch of works take advantage of temporal consistency in video sequences. In this work, we propose a spatio-temporal attentive neural network to estimate the number of pedestrians from surveillance videos. By taking advantage of the temporal correlation between consecutive frames, we lowered state-of-the-art count error by 5% and localization error by 7.5% on the widely-used FDST benchmark.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsVideo Surveillance and Tracking Methods · Impact of Light on Environment and Health · Image Enhancement Techniques