DeepSafety:Multi-level Audio-Text Feature Extraction and Fusion Approach for Violence Detection in Conversations
Amna Anwar, Eiman Kanjo, Dario Ortega Anderez

TL;DR
DeepSafety introduces a multi-level audio-text feature fusion framework that combines verbal, vocal, and text cues to effectively detect violence in conversations, outperforming single-modality approaches.
Contribution
The paper presents a novel multi-level multimodal fusion approach integrating audio and text features for violence detection in conversations, enhancing detection accuracy.
Findings
Multi-level feature fusion achieves F1 score of 0.85.
Combining modalities outperforms single-modality methods.
The framework effectively captures verbal and vocal cues for violence detection.
Abstract
Natural Language Processing has recently made understanding human interaction easier, leading to improved sentimental analysis and behaviour prediction. However, the choice of words and vocal cues in conversations presents an underexplored rich source of natural language data for personal safety and crime prevention. When accompanied by audio analysis, it makes it possible to understand the context of a conversation, including the level of tension or rift between people. Building on existing work, we introduce a new information fusion approach that extracts and fuses multi-level features including verbal, vocal, and text as heterogeneous sources of information to detect the extent of violent behaviours in conversations. Our multilevel multimodel fusion framework integrates four types of information from raw audio signals including embeddings generated from both BERT and Bi-long…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAnomaly Detection Techniques and Applications · Human Pose and Action Recognition · Network Security and Intrusion Detection
