VARS: Vision-based Assessment of Risk in Security Systems
Pranav Gupta, Pratham Gohil, Sridhar S

TL;DR
This paper compares various machine learning and deep learning models to predict danger levels in videos, aiming to improve safety systems by identifying the most effective approach for risk assessment.
Contribution
It provides a comprehensive evaluation of classical and modern models for video danger prediction, introducing a new dataset and analysis framework.
Findings
Transformer-based models outperform classical methods in accuracy.
Neural networks achieve the lowest mean absolute error.
Support Vector Machines show competitive F1-scores.
Abstract
The accurate prediction of danger levels in video content is critical for enhancing safety and security systems, particularly in environments where quick and reliable assessments are essential. In this study, we perform a comparative analysis of various machine learning and deep learning models to predict danger ratings in a custom dataset of 100 videos, each containing 50 frames, annotated with human-rated danger scores ranging from 0 to 10. The danger ratings are further classified into three categories: no alert (less than 7)and high alert (greater than equal to 7). Our evaluation covers classical machine learning models, such as Support Vector Machines, as well as Neural Networks, and transformer-based models. Model performance is assessed using standard metrics such as accuracy, F1-score, and mean absolute error (MAE), and the results are compared to identify the most robust…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoftware Engineering Techniques and Practices · Information and Cyber Security · Software Reliability and Analysis Research
