A Review and Implementation of Object Detection Models and Optimizations   for Real-time Medical Mask Detection during the COVID-19 Pandemic

Ioanna Gogou; Dimitrios Koutsomitropoulos

arXiv:2405.18387·cs.CV·May 29, 2024

A Review and Implementation of Object Detection Models and Optimizations for Real-time Medical Mask Detection during the COVID-19 Pandemic

Ioanna Gogou, Dimitrios Koutsomitropoulos

PDF

1 Repo

TL;DR

This paper evaluates object detection models for real-time medical mask detection during COVID-19, comparing speed and accuracy, and proposes an optimized YOLOv5-based model that is faster and equally accurate.

Contribution

It provides a comprehensive evaluation of detection models' speed/accuracy trade-offs and introduces an optimized YOLOv5-based model for mask detection with improved speed.

Findings

01

YOLOv5 outperforms other models in speed and accuracy.

02

Optimizations like transfer learning and attention mechanisms improve real-time detection.

03

The proposed model achieves 69 fps with 67% mAP on PWMFD.

Abstract

Convolutional Neural Networks (CNN) are commonly used for the problem of object detection thanks to their increased accuracy. Nevertheless, the performance of CNN-based detection models is ambiguous when detection speed is considered. To the best of our knowledge, there has not been sufficient evaluation of the available methods in terms of the speed/accuracy trade-off in related literature. This work assesses the most fundamental object detection models on the Common Objects in Context (COCO) dataset with respect to this trade-off, their memory consumption, and computational and storage cost. Next, we select a highly efficient model called YOLOv5 to train on the topical and unexplored dataset of human faces with medical masks, the Properly-Wearing Masked Faces Dataset (PWMFD), and analyze the benefits of specific optimization techniques for real-time medical mask detection: transfer…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

joangog/object-detection
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings