TL;DR
This paper introduces Vehicle-Rear, a comprehensive dataset for vehicle identification, and proposes a two-stream CNN that fuses appearance and license plate features to improve accuracy in non-overlapping camera scenarios.
Contribution
The paper presents a novel vehicle dataset and a two-stream CNN architecture that combines shape and textual features for enhanced vehicle identification.
Findings
Two-stream CNN outperforms single-feature models.
Dataset includes detailed vehicle attributes and license plate info.
Fusion of appearance and license plate features reduces false alarms.
Abstract
This work addresses the problem of vehicle identification through non-overlapping cameras. As our main contribution, we introduce a novel dataset for vehicle identification, called Vehicle-Rear, that contains more than three hours of high-resolution videos, with accurate information about the make, model, color and year of nearly 3,000 vehicles, in addition to the position and identification of their license plates. To explore our dataset we design a two-stream CNN that simultaneously uses two of the most distinctive and persistent features available: the vehicle's appearance and its license plate. This is an attempt to tackle a major problem: false alarms caused by vehicles with similar designs or by very close license plate identifiers. In the first network stream, shape similarities are identified by a Siamese CNN that uses a pair of low-resolution vehicle patches recorded by two…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
