A New Method for Vehicle Logo Recognition Based on Swin Transformer

Yang Li; Doudou Zhang; Jianli Xiao

arXiv:2401.15458·cs.CV·January 30, 2024·1 cites

A New Method for Vehicle Logo Recognition Based on Swin Transformer

Yang Li, Doudou Zhang, Jianli Xiao

PDF

Open Access

TL;DR

This paper introduces a real-time vehicle logo recognition method using Swin Transformer, achieving high accuracy and efficiency, surpassing CNN-based approaches in challenging conditions for intelligent transportation systems.

Contribution

The paper presents the first application of Swin Transformer for vehicle logo recognition, demonstrating superior performance and real-time capability compared to traditional CNN methods.

Findings

01

Achieved top accuracy of over 99% on three datasets.

02

Outperformed CNN-based methods in challenging conditions.

03

Validated effectiveness of transfer learning in VLR.

Abstract

Intelligent Transportation Systems (ITS) utilize sensors, cameras, and big data analysis to monitor real-time traffic conditions, aiming to improve traffic efficiency and safety. Accurate vehicle recognition is crucial in this process, and Vehicle Logo Recognition (VLR) stands as a key method. VLR enables effective management and monitoring by distinguishing vehicles on the road. Convolutional Neural Networks (CNNs) have made impressive strides in VLR research. However, achieving higher performance demands significant time and computational resources for training. Recently, the rise of Transformer models has brought new opportunities to VLR. Swin Transformer, with its efficient computation and global feature modeling capabilities, outperforms CNNs under challenging conditions. In this paper, we implement real-time VLR using Swin Transformer and fine-tune it for optimal performance.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsVehicle License Plate Recognition

MethodsAttention Is All You Need · Linear Layer · Stochastic Depth · Dropout · Layer Normalization · Multi-Head Attention · Byte Pair Encoding · Residual Connection · Adam · Softmax