An Evaluation of Deep CNN Baselines for Scene-Independent Person Re-Identification
Paul Marchwica, Michael Jamieson, Parthipan Siva

TL;DR
This paper evaluates the effectiveness of deep CNN architectures for scene-independent person re-identification, demonstrating that large composite datasets enable competitive results without scene-specific training.
Contribution
It provides an in-depth comparison of CNN baselines for scene-independent ReID and introduces a new dataset for within-camera and across-camera ReID evaluation.
Findings
Scene-independent ReID can achieve competitive results with large composite datasets.
CNN baseline architectures vary in performance depending on dataset size.
The new dataset enables better evaluation of within-camera and across-camera ReID.
Abstract
In recent years, a variety of proposed methods based on deep convolutional neural networks (CNNs) have improved the state of the art for large-scale person re-identification (ReID). While a large number of optimizations and network improvements have been proposed, there has been relatively little evaluation of the influence of training data and baseline network architecture. In particular, it is usually assumed either that networks are trained on labeled data from the deployment location (scene-dependent), or else adapted with unlabeled data, both of which complicate system deployment. In this paper, we investigate the feasibility of achieving scene-independent person ReID by forming a large composite dataset for training. We present an in-depth comparison of several CNN baseline architectures for both scene-dependent and scene-independent ReID, across a range of training dataset sizes.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
