YOLO9tr: A Lightweight Model for Pavement Damage Detection Utilizing a Generalized Efficient Layer Aggregation Network and Attention Mechanism
Sompote Youwai, Achitaphon Chaiyaphat, Pawarotorn Chaipetch

TL;DR
YOLO9tr is a lightweight, real-time pavement damage detection model that improves accuracy and speed by integrating a generalized efficient layer aggregation network with an attention mechanism, suitable for practical road maintenance applications.
Contribution
The paper introduces YOLO9tr, a novel lightweight detection model with a partial attention block, enhancing feature extraction and detection performance over existing YOLO variants.
Findings
Achieves up to 136 FPS for real-time detection
Outperforms YOLO8, YOLO9, and YOLO10 in accuracy and speed
Validated through ablation studies on architectural modifications
Abstract
Maintaining road pavement integrity is crucial for ensuring safe and efficient transportation. Conventional methods for assessing pavement condition are often laborious and susceptible to human error. This paper proposes YOLO9tr, a novel lightweight object detection model for pavement damage detection, leveraging the advancements of deep learning. YOLO9tr is based on the YOLOv9 architecture, incorporating a partial attention block that enhances feature extraction and attention mechanisms, leading to improved detection performance in complex scenarios. The model is trained on a comprehensive dataset comprising road damage images from multiple countries, including an expanded set of damage categories beyond the standard four. This broadened classification range allows for a more accurate and realistic assessment of pavement conditions. Comparative analysis demonstrates YOLO9tr's superior…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsInfrastructure Maintenance and Monitoring · Asphalt Pavement Performance Evaluation · Non-Destructive Testing Techniques
MethodsSoftmax · Attention Is All You Need · Sparse Evolutionary Training · SPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings
