Road Damage Detection Using Deep Neural Networks with Images Captured Through a Smartphone
Hiroya Maeda, Yoshihide Sekimoto, Toshikazu Seto, Takehiro Kashiyama,, Hiroshi Omata

TL;DR
This paper introduces a large-scale, publicly available dataset of road damage images captured with smartphones, and develops a deep learning model capable of accurately detecting and classifying eight types of road damage in real-world conditions.
Contribution
It creates the first large-scale, annotated road damage dataset from smartphone images and applies state-of-the-art neural networks for damage detection and classification.
Findings
High detection accuracy for eight damage types.
Effective real-time detection on smartphones.
Robust performance across diverse weather conditions.
Abstract
Research on damage detection of road surfaces using image processing techniques has been actively conducted, achieving considerably high detection accuracies. Many studies only focus on the detection of the presence or absence of damage. However, in a real-world scenario, when the road managers from a governing body need to repair such damage, they need to clearly understand the type of damage in order to take effective action. In addition, in many of these previous studies, the researchers acquire their own data using different methods. Hence, there is no uniform road damage dataset available openly, leading to the absence of a benchmark for road damage detection. This study makes three contributions to address these issues. First, to the best of our knowledge, for the first time, a large-scale road damage dataset is prepared. This dataset is composed of 9,053 road damage images…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsInfrastructure Maintenance and Monitoring · Advanced Neural Network Applications · Remote Sensing and LiDAR Applications
MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings
