Audio Surveillance: a Systematic Review
Marco Crocco, Marco Cristani, Andrea Trucco, Vittorio Murino

TL;DR
This paper systematically reviews audio-based automated surveillance methods, proposing a taxonomy and analyzing various techniques for background subtraction, event classification, object tracking, and situation analysis, emphasizing their application contexts.
Contribution
It introduces a comprehensive taxonomy for audio surveillance methods and provides detailed analysis of their advantages, limitations, and specific application scenarios, filling a gap in existing surveys.
Findings
Audio features vary in expressiveness and suitability for different tasks.
Most methods are tailored to specific surveillance applications.
The survey offers practical tables and schemes for method selection.
Abstract
Despite surveillance systems are becoming increasingly ubiquitous in our living environment, automated surveillance, currently based on video sensory modality and machine intelligence, lacks most of the time the robustness and reliability required in several real applications. To tackle this issue, audio sensory devices have been taken into account, both alone or in combination with video, giving birth, in the last decade, to a considerable amount of research. In this paper audio-based automated surveillance methods are organized into a comprehensive survey: a general taxonomy, inspired by the more widespread video surveillance field, is proposed in order to systematically describe the methods covering background subtraction, event classification, object tracking and situation analysis. For each of these tasks, all the significant works are reviewed, detailing their pros and cons and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMusic and Audio Processing · Speech and Audio Processing · Video Surveillance and Tracking Methods
