Cut-and-Paste with Precision: a Content and Perspective-aware Data   Augmentation for Road Damage Detection

Punnawat Siripathitti; Florent Forest; Olga Fink

arXiv:2406.18586·cs.CV·July 11, 2024

Cut-and-Paste with Precision: a Content and Perspective-aware Data Augmentation for Road Damage Detection

Punnawat Siripathitti, Florent Forest, Olga Fink

PDF

Open Access

TL;DR

This paper introduces a novel content- and perspective-aware data augmentation method for road damage detection, improving the realism and effectiveness of training data by considering road location and perspective differences.

Contribution

The work presents an improved cut-and-paste augmentation technique that accounts for road location and perspective, enhancing data diversity for better damage detection performance.

Findings

01

Enhanced detection accuracy with the proposed augmentation method

02

More realistic augmented images leading to better model generalization

03

Improved robustness of damage detection models in diverse scenarios

Abstract

Damage to road pavement can develop into cracks, potholes, spallings, and other issues posing significant challenges to the integrity, safety, and durability of the road structure. Detecting and monitoring the evolution of these damages is crucial for maintaining the condition and structural health of road infrastructure. In recent years, researchers have explored various data-driven methods for image-based damage detection in road monitoring applications. The field gained attention with the introduction of the Road Damage Detection Challenge (RDDC2018), encouraging competition in developing object detectors on street-view images from various countries. Leading teams have demonstrated the effectiveness of ensemble models, mostly based on the YOLO and Faster R-CNN series. Data augmentations have also shown benefits in object detection within the computer vision field, including…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsInfrastructure Maintenance and Monitoring · Handwritten Text Recognition Techniques · Multimodal Machine Learning Applications

MethodsSoftmax · RoIPool · Region Proposal Network · Convolution · Faster R-CNN