TrackNet: A Triplet metric-based method for Multi-Target Multi-Camera   Vehicle Tracking

David Serrano; Francesc Net; Juan Antonio Rodr\'iguez; Igor Ugarte

arXiv:2205.13857·cs.CV·May 30, 2022·1 cites

TrackNet: A Triplet metric-based method for Multi-Target Multi-Camera Vehicle Tracking

David Serrano, Francesc Net, Juan Antonio Rodr\'iguez, Igor Ugarte

PDF

Open Access 1 Repo

TL;DR

TrackNet is a modular multi-camera vehicle tracking method that combines detection, single-camera tracking, and triplet metric learning for improved IDF1 performance in traffic videos.

Contribution

The paper introduces a novel triplet metric learning strategy integrated with detection and tracking for multi-camera vehicle tracking.

Findings

01

Achieved IDF1 score of 0.4733 on AI City Challenge

02

Demonstrated effectiveness of triplet metric learning in track matching

03

Provided a modular framework adaptable to traffic video analysis

Abstract

We present TrackNet, a method for Multi-Target Multi-Camera (MTMC) vehicle tracking from traffic video sequences. Cross-camera vehicle tracking has proved to be a challenging task due to perspective, scale and speed variance, as well occlusions and noise conditions. Our method is based on a modular approach that first detects vehicles frame-by-frame using Faster R-CNN, then tracks detections through single camera using Kalman filter, and finally matches tracks by a triplet metric learning strategy. We conduct experiments on TrackNet within the AI City Challenge framework, and present competitive IDF1 results of 0.4733.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

davidserra9/tracknet
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsVideo Surveillance and Tracking Methods · Autonomous Vehicle Technology and Safety · IoT and GPS-based Vehicle Safety Systems

MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings · RoIPool · Convolution · Softmax · Region Proposal Network · Faster R-CNN