TL;DR
This paper evaluates object detection models for real-time medical mask detection during COVID-19, comparing speed and accuracy, and proposes an optimized YOLOv5-based model that is faster and equally accurate.
Contribution
It provides a comprehensive evaluation of detection models' speed/accuracy trade-offs and introduces an optimized YOLOv5-based model for mask detection with improved speed.
Findings
YOLOv5 outperforms other models in speed and accuracy.
Optimizations like transfer learning and attention mechanisms improve real-time detection.
The proposed model achieves 69 fps with 67% mAP on PWMFD.
Abstract
Convolutional Neural Networks (CNN) are commonly used for the problem of object detection thanks to their increased accuracy. Nevertheless, the performance of CNN-based detection models is ambiguous when detection speed is considered. To the best of our knowledge, there has not been sufficient evaluation of the available methods in terms of the speed/accuracy trade-off in related literature. This work assesses the most fundamental object detection models on the Common Objects in Context (COCO) dataset with respect to this trade-off, their memory consumption, and computational and storage cost. Next, we select a highly efficient model called YOLOv5 to train on the topical and unexplored dataset of human faces with medical masks, the Properly-Wearing Masked Faces Dataset (PWMFD), and analyze the benefits of specific optimization techniques for real-time medical mask detection: transfer…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings
