Real-Time Scream Detection and Position Estimation for Worker Safety in Construction Sites
Bikalpa Gautam, Anmol Guragain, Sarthak Giri

TL;DR
This paper presents a real-time system combining deep learning and signal processing techniques to detect screams and estimate worker positions in construction sites, enhancing safety in challenging environments.
Contribution
It introduces a novel integrated system using Wav2Vec2, ConvNet, and GCC-PHAT for accurate scream detection and localization in noisy, reverberant construction environments.
Findings
High detection accuracy amidst construction noise
Reliable localization of distress callers
Potential for widespread safety application
Abstract
The construction industry faces high risks due to frequent accidents, often leaving workers in perilous situations where rapid response is critical. Traditional safety monitoring methods, including wearable sensors and GPS, often fail under obstructive or indoor conditions. This research introduces a novel real-time scream detection and localization system tailored for construction sites, especially in low-resource environments. Integrating Wav2Vec2 and Enhanced ConvNet models for accurate scream detection, coupled with the GCC-PHAT algorithm for robust time delay estimation under reverberant conditions, followed by a gradient descent-based approach to achieve precise position estimation in noisy environments. Our approach combines these concepts to achieve high detection accuracy and rapid localization, thereby minimizing false alarms and optimizing emergency response. Preliminary…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsOccupational Health and Safety Research · Risk and Safety Analysis · Infrastructure Maintenance and Monitoring
