Deep learning based cough detection camera using enhanced features
Gyeong-Tae Lee, Hyeonuk Nam, Seong-Hu Kim, Sang-Min Choi, Youngkey, Kim, Yong-Hwa Park

TL;DR
This paper presents a deep learning-based cough detection system integrated with a sound camera, utilizing enhanced acoustic features and CNN architectures to accurately detect and localize cough sounds in real time.
Contribution
The work introduces a novel combination of enhanced spectral features and simplified CNN models for high-accuracy cough detection and localization using a sound camera.
Findings
Achieved a 91.9% F1 score with G-net and MFCC-V-A features.
Demonstrated real-time cough localization with 90% F1 score.
Integrated deep learning model with sound camera for practical cough monitoring.
Abstract
Coughing is a typical symptom of COVID-19. To detect and localize coughing sounds remotely, a convolutional neural network (CNN) based deep learning model was developed in this work and integrated with a sound camera for the visualization of the cough sounds. The cough detection model is a binary classifier of which the input is a two second acoustic feature and the output is one of two inferences (Cough or Others). Data augmentation was performed on the collected audio files to alleviate class imbalance and reflect various background noises in practical environments. For effective featuring of the cough sound, conventional features such as spectrograms, mel-scaled spectrograms, and mel-frequency cepstral coefficients (MFCC) were reinforced by utilizing their velocity (V) and acceleration (A) maps in this work. VGGNet, GoogLeNet, and ResNet were simplified to binary classifiers, and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsBatch Normalization · Residual Connection · Average Pooling · Global Average Pooling · Dense Connections · Softmax · Kaiming Initialization · Residual Block · 1x1 Convolution · Dropout
