Two-Stage Violence Detection Using ViTPose and Classification Models at Smart Airports
\.Irem \"Ustek, Jay Desai, Iv\'an L\'opez Torrecillas, Sofiane Abadou,, Jinjie Wang, Quentin Fever, Sandhya Rani Kasthuri, Yang Xing, Weisi Guo,, Antonios Tsourdos

TL;DR
This paper presents a real-time violence detection system for smart airports that combines ViTPose for pose estimation with CNN-BiLSTM models, demonstrating improved accuracy and robustness in surveillance scenarios.
Contribution
The study introduces a novel two-stage framework integrating ViTPose and classification models for real-time violence detection in airport security.
Findings
High accuracy in violence classification on AIRTLab dataset
Robust performance in real-world airport scenarios
Reduced false positives in violence detection
Abstract
This study introduces an innovative violence detection framework tailored to the unique requirements of smart airports, where prompt responses to violent situations are crucial. The proposed framework harnesses the power of ViTPose for human pose estimation. It employs a CNN - BiLSTM network to analyse spatial and temporal information within keypoints sequences, enabling the accurate classification of violent behaviour in real time. Seamlessly integrated within the SAFE (Situational Awareness for Enhanced Security framework of SAAB, the solution underwent integrated testing to ensure robust performance in real world scenarios. The AIRTLab dataset, characterized by its high video quality and relevance to surveillance scenarios, is utilized in this study to enhance the model's accuracy and mitigate false positives. As airports face increased foot traffic in the post pandemic era,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAnomaly Detection Techniques and Applications · Video Surveillance and Tracking Methods · Gait Recognition and Analysis
MethodsSigmoid Activation · Tanh Activation · Long Short-Term Memory · Bidirectional LSTM
