PaveSync: A Unified and Comprehensive Dataset for Pavement Distress Analysis and Classification
By: Blessing Agyei Kyem , Joshua Kofi Asamoah , Anthony Dontoh and more
Automated pavement defect detection often struggles to generalize across diverse real-world conditions due to the lack of standardized datasets. Existing datasets differ in annotation styles, distress type definitions, and formats, limiting their integration for unified training. To address this gap, we introduce a comprehensive benchmark dataset that consolidates multiple publicly available sources into a standardized collection of 52747 images from seven countries, with 135277 bounding box annotations covering 13 distinct distress types. The dataset captures broad real-world variation in image quality, resolution, viewing angles, and weather conditions, offering a unique resource for consistent training and evaluation. Its effectiveness was demonstrated through benchmarking with state-of-the-art object detection models including YOLOv8-YOLOv12, Faster R-CNN, and DETR, which achieved competitive performance across diverse scenarios. By standardizing class definitions and annotation formats, this dataset provides the first globally representative benchmark for pavement defect detection and enables fair comparison of models, including zero-shot transfer to new environments.
Similar Papers
A Benchmark Dataset for Spatially Aligned Road Damage Assessment in Small Uncrewed Aerial Systems Disaster Imagery
CV and Pattern Recognition
Helps drones find damaged roads after disasters.
PAVE: An End-to-End Dataset for Production Autonomous Vehicle Evaluation
CV and Pattern Recognition
Tests self-driving cars for safer real-world driving.
Deep Learning for Pavement Condition Evaluation Using Satellite Imagery
CV and Pattern Recognition
Quickly checks road conditions from satellites at 90% accuracy