A New Method for Vehicle Logo Recognition Based on Swin Transformer
Yang Li, Doudou Zhang, Jianli Xiao

TL;DR
This paper introduces a real-time vehicle logo recognition method using Swin Transformer, achieving high accuracy and efficiency, surpassing CNN-based approaches in challenging conditions for intelligent transportation systems.
Contribution
The paper presents the first application of Swin Transformer for vehicle logo recognition, demonstrating superior performance and real-time capability compared to traditional CNN methods.
Findings
Achieved top accuracy of over 99% on three datasets.
Outperformed CNN-based methods in challenging conditions.
Validated effectiveness of transfer learning in VLR.
Abstract
Intelligent Transportation Systems (ITS) utilize sensors, cameras, and big data analysis to monitor real-time traffic conditions, aiming to improve traffic efficiency and safety. Accurate vehicle recognition is crucial in this process, and Vehicle Logo Recognition (VLR) stands as a key method. VLR enables effective management and monitoring by distinguishing vehicles on the road. Convolutional Neural Networks (CNNs) have made impressive strides in VLR research. However, achieving higher performance demands significant time and computational resources for training. Recently, the rise of Transformer models has brought new opportunities to VLR. Swin Transformer, with its efficient computation and global feature modeling capabilities, outperforms CNNs under challenging conditions. In this paper, we implement real-time VLR using Swin Transformer and fine-tune it for optimal performance.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVehicle License Plate Recognition
MethodsAttention Is All You Need · Linear Layer · Stochastic Depth · Dropout · Layer Normalization · Multi-Head Attention · Byte Pair Encoding · Residual Connection · Adam · Softmax
