YOLO advances to its genesis: a decadal and comprehensive review of the You Only Look Once (YOLO) series
Ranjan Sapkota, Marco Flores Calero, Rizwan Qureshi, Chetan Badgujar, Upesh Nepal, Alwin Poulose, Peter Zeno, Uday Bhanu Prakash Vaddevolu, Sheheryar Khan, Maged Shoman, Hong Yan, Manoj Karkee

TL;DR
This comprehensive review traces the evolution of YOLO object detection models from YOLOv1 to YOLOv12, highlighting technological advancements, applications, and future directions in real-time object detection.
Contribution
It provides a systematic, reverse chronological analysis of YOLO versions, including architectural innovations and their impact on detection speed, accuracy, and efficiency.
Findings
YOLO has significantly improved detection accuracy and speed over a decade.
Architectural innovations like YOLO-NAS and YOLO-X enhanced performance.
YOLO models have broad applications in autonomous vehicles, healthcare, and security.
Abstract
This review systematically examines the progression of the You Only Look Once (YOLO) object detection algorithms from YOLOv1 to the recently unveiled YOLOv12. Employing a reverse chronological analysis, this study examines the advancements introduced by YOLO algorithms, beginning with YOLOv12 and progressing through YOLO11 (or YOLOv11), YOLOv10, YOLOv9, YOLOv8, and subsequent versions to explore each version's contributions to enhancing speed, detection accuracy, and computational efficiency in real-time object detection. Additionally, this study reviews the alternative versions derived from YOLO architectural advancements of YOLO-NAS, YOLO-X, YOLO-R, DAMO-YOLO, and Gold-YOLO. Moreover, the study highlights the transformative impact of YOLO models across five critical application areas: autonomous vehicles and traffic safety, healthcare and medical imaging, industrial manufacturing,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsYou Only Look Once · Dropout · HuMan(Expedia)||How do I get a human at Expedia? · Convolution · Max Pooling · 1x1 Convolution · Non Maximum Suppression · Dense Connections · YOLOv1
