Cut-and-Paste with Precision: a Content and Perspective-aware Data Augmentation for Road Damage Detection
Punnawat Siripathitti, Florent Forest, Olga Fink

TL;DR
This paper introduces a novel content- and perspective-aware data augmentation method for road damage detection, improving the realism and effectiveness of training data by considering road location and perspective differences.
Contribution
The work presents an improved cut-and-paste augmentation technique that accounts for road location and perspective, enhancing data diversity for better damage detection performance.
Findings
Enhanced detection accuracy with the proposed augmentation method
More realistic augmented images leading to better model generalization
Improved robustness of damage detection models in diverse scenarios
Abstract
Damage to road pavement can develop into cracks, potholes, spallings, and other issues posing significant challenges to the integrity, safety, and durability of the road structure. Detecting and monitoring the evolution of these damages is crucial for maintaining the condition and structural health of road infrastructure. In recent years, researchers have explored various data-driven methods for image-based damage detection in road monitoring applications. The field gained attention with the introduction of the Road Damage Detection Challenge (RDDC2018), encouraging competition in developing object detectors on street-view images from various countries. Leading teams have demonstrated the effectiveness of ensemble models, mostly based on the YOLO and Faster R-CNN series. Data augmentations have also shown benefits in object detection within the computer vision field, including…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsInfrastructure Maintenance and Monitoring · Handwritten Text Recognition Techniques · Multimodal Machine Learning Applications
MethodsSoftmax · RoIPool · Region Proposal Network · Convolution · Faster R-CNN
