A Robust Deep Learning Framework for Bangla License Plate Recognition Using YOLO and Vision-Language OCR
Nayeb Hasin, Md. Arafath Rahman Nishat, Mainul Islam, Khandakar Shakib Al Hasan, Asif Newaz

TL;DR
This paper introduces a robust Bangla license plate recognition system combining advanced object detection with vision-language OCR, achieving high accuracy and robustness across diverse real-world conditions.
Contribution
It proposes a novel two-stage adaptive training strategy for YOLOv8 and integrates a VisionEncoderDecoder architecture for improved text recognition in Bangla license plates.
Findings
Achieved 97.83% detection accuracy and 91.3% IoU.
Character Error Rate of 0.1323 with ViT + BanglaBERT.
System performs well under varying environmental conditions.
Abstract
An Automatic License Plate Recognition (ALPR) system constitutes a crucial element in an intelligent traffic management system. However, the detection of Bangla license plates remains challenging because of the complicated character scheme and uneven layouts. This paper presents a robust Bangla License Plate Recognition system that integrates a deep learning-based object detection model for license plate localization with Optical Character Recognition for text extraction. Multiple object detection architectures, including U-Net and several YOLO (You Only Look Once) variants, are compared for license plate localization. This study proposes a novel two-stage adaptive training strategy built upon the YOLOv8 architecture to improve localization performance. The proposed approach outperforms the established models, achieving an accuracy of 97.83% and an Intersection over Union (IoU) of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVehicle License Plate Recognition · Handwritten Text Recognition Techniques · Advanced Neural Network Applications
