YOLO11-CR: a Lightweight Convolution-and-Attention Framework for Accurate Fatigue Driving Detection
Zhebin Jin, Ligang Dong

TL;DR
YOLO11-CR is a lightweight, real-time detection model combining convolution and attention mechanisms to improve fatigue driving detection accuracy, especially for small and occluded objects.
Contribution
The paper introduces YOLO11-CR, a novel lightweight detection framework with CAFM and RCM modules for enhanced feature fusion and spatial localization in fatigue detection.
Findings
Achieves 87.17% precision and 88.09% mAP@50 on DSM dataset.
Outperforms baseline models significantly in accuracy.
Validated effectiveness of CAFM and RCM modules through ablation studies.
Abstract
Driver fatigue detection is of paramount importance for intelligent transportation systems due to its critical role in mitigating road traffic accidents. While physiological and vehicle dynamics-based methods offer accuracy, they are often intrusive, hardware-dependent, and lack robustness in real-world environments. Vision-based techniques provide a non-intrusive and scalable alternative, but still face challenges such as poor detection of small or occluded objects and limited multi-scale feature modeling. To address these issues, this paper proposes YOLO11-CR, a lightweight and efficient object detection model tailored for real-time fatigue detection. YOLO11-CR introduces two key modules: the Convolution-and-Attention Fusion Module (CAFM), which integrates local CNN features with global Transformer-based context to enhance feature expressiveness; and the Rectangular Calibration Module…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
