Study on Aspect Ratio Variability toward Robustness of Vision Transformer-based Vehicle Re-identification
Mei Qiu, Lauren Christopher, and Lingxi Li

TL;DR
This paper investigates how aspect ratio variability affects ViT-based vehicle re-identification and proposes a novel framework that fuses models trained on various aspect ratios to improve robustness and accuracy.
Contribution
It introduces a new ViT-based ReID framework that analyzes aspect ratio effects, employs intra-image patch mixup and uneven stride techniques, and features a dynamic feature fusion for enhanced robustness.
Findings
Achieved 91.0% mAP on VehicleID, surpassing previous 80.9%.
Analyzed aspect ratio impact guiding input settings.
Proposed intra-image mixup and dynamic fusion methods.
Abstract
Vision Transformers (ViTs) have excelled in vehicle re-identification (ReID) tasks. However, non-square aspect ratios of image or video input might significantly affect the re-identification performance. To address this issue, we propose a novel ViT-based ReID framework in this paper, which fuses models trained on a variety of aspect ratios. Our main contributions are threefold: (i) We analyze aspect ratio performance on VeRi-776 and VehicleID datasets, guiding input settings based on aspect ratios of original images. (ii) We introduce patch-wise mixup intra-image during ViT patchification (guided by spatial attention scores) and implement uneven stride for better object aspect ratio matching. (iii) We propose a dynamic feature fusing ReID network, enhancing model robustness. Our ReID method achieves a significantly improved mean Average Precision (mAP) of 91.0\% compared to the the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Measurement and Detection Methods · Industrial Vision Systems and Defect Detection · Spectroscopy and Chemometric Analyses
MethodsSoftmax · Attention Is All You Need · Mixup
