Improved Pothole Detection Using YOLOv7 and ESRGAN

Nirmal Kumar Rout; Gyanateet Dutta; Varun Sinha; Arghadeep Dey,; Subhrangshu Mukherjee; Gopal Gupta

arXiv:2401.08588·cs.CV·January 18, 2024·5 cites

Improved Pothole Detection Using YOLOv7 and ESRGAN

Nirmal Kumar Rout, Gyanateet Dutta, Varun Sinha, Arghadeep Dey,, Subhrangshu Mukherjee, Gopal Gupta

PDF

Open Access 1 Repo

TL;DR

This paper enhances pothole detection accuracy in low-resolution images by integrating ESRGAN-based super resolution with YOLOv7, demonstrating improved speed and precision in real-world dashcam scenarios.

Contribution

It introduces a novel combination of SRGAN and YOLOv7 for effective low-resolution pothole detection, improving upon existing methods.

Findings

01

Enhanced detection accuracy on low-quality images

02

Increased processing speed after upscaling

03

Demonstrated effectiveness in real-world dashcam footage

Abstract

Potholes are common road hazards that is causing damage to vehicles and posing a safety risk to drivers. The introduction of Convolutional Neural Networks (CNNs) is widely used in the industry for object detection based on Deep Learning methods and has achieved significant progress in hardware improvement and software implementations. In this paper, a unique better algorithm is proposed to warrant the use of low-resolution cameras or low-resolution images and video feed for automatic pothole detection using Super Resolution (SR) through Super Resolution Generative Adversarial Networks (SRGANs). Then we have proceeded to establish a baseline pothole detection performance on low quality and high quality dashcam images using a You Only Look Once (YOLO) network, namely the YOLOv7 network. We then have illustrated and examined the speed and accuracy gained above the benchmark after having…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

https://gitlab.com/Ryukijano/ESRGAN_AND_YOLOV7
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsInfrastructure Maintenance and Monitoring · Advanced Neural Network Applications · Image and Object Detection Techniques

MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings