Learning Generalisable Omni-Scale Representations for Person   Re-Identification

Kaiyang Zhou; Yongxin Yang; Andrea Cavallaro; Tao Xiang

arXiv:1910.06827·cs.CV·April 30, 2021·49 cites

Learning Generalisable Omni-Scale Representations for Person Re-Identification

Kaiyang Zhou, Yongxin Yang, Andrea Cavallaro, Tao Xiang

PDF

Open Access 5 Repos 4 Models

TL;DR

This paper introduces OSNet, a lightweight CNN architecture with omni-scale features and instance normalization, achieving state-of-the-art person re-identification performance and strong cross-dataset generalization without domain adaptation.

Contribution

The paper proposes a novel omni-scale CNN architecture with dynamic multi-scale feature fusion and an architecture search for optimal IN layer placement, enhancing generalization in person re-ID.

Findings

01

OSNet achieves state-of-the-art results on standard datasets.

02

OSNet outperforms many unsupervised domain adaptation methods cross-dataset.

03

The architecture is lightweight and efficient.

Abstract

An effective person re-identification (re-ID) model should learn feature representations that are both discriminative, for distinguishing similar-looking people, and generalisable, for deployment across datasets without any adaptation. In this paper, we develop novel CNN architectures to address both challenges. First, we present a re-ID CNN termed omni-scale network (OSNet) to learn features that not only capture different spatial scales but also encapsulate a synergistic combination of multiple scales, namely omni-scale features. The basic building block consists of multiple convolutional streams, each detecting features at a certain scale. For omni-scale feature learning, a unified aggregation gate is introduced to dynamically fuse multi-scale features with channel-wise weights. OSNet is lightweight as its building blocks comprise factorised convolutions. Second, to improve…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Models

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsVideo Surveillance and Tracking Methods · Gait Recognition and Analysis · Human Pose and Action Recognition