PaveSync: A Unified and Comprehensive Dataset for Pavement Distress Analysis and Classification
Blessing Agyei Kyem, Joshua Kofi Asamoah, Anthony Dontoh, Andrews Danyo, Eugene Denteh, Armstrong Aboah

TL;DR
PaveSync introduces a large, standardized dataset of over 52,000 images from seven countries, covering 13 pavement distress types, to improve defect detection and model generalization across diverse real-world conditions.
Contribution
This paper presents the first comprehensive, standardized benchmark dataset for pavement distress analysis, enabling consistent training, evaluation, and fair comparison of detection models globally.
Findings
State-of-the-art models achieved competitive performance on the dataset.
The dataset captures diverse real-world conditions, enhancing model robustness.
Standardization facilitates zero-shot transfer to new environments.
Abstract
Automated pavement defect detection often struggles to generalize across diverse real-world conditions due to the lack of standardized datasets. Existing datasets differ in annotation styles, distress type definitions, and formats, limiting their integration for unified training. To address this gap, we introduce a comprehensive benchmark dataset that consolidates multiple publicly available sources into a standardized collection of 52747 images from seven countries, with 135277 bounding box annotations covering 13 distinct distress types. The dataset captures broad real-world variation in image quality, resolution, viewing angles, and weather conditions, offering a unique resource for consistent training and evaluation. Its effectiveness was demonstrated through benchmarking with state-of-the-art object detection models including YOLOv8-YOLOv12, Faster R-CNN, and DETR, which achieved…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsInfrastructure Maintenance and Monitoring · Advanced Neural Network Applications · Image Enhancement Techniques
