Learning by Aligning: Visible-Infrared Person Re-identification using   Cross-Modal Correspondences

Hyunjong Park; Sanghoon Lee; Junghyup Lee; Bumsub Ham

arXiv:2108.07422·cs.CV·August 18, 2021

Learning by Aligning: Visible-Infrared Person Re-identification using Cross-Modal Correspondences

Hyunjong Park, Sanghoon Lee, Junghyup Lee, Bumsub Ham

PDF

Open Access 1 Repo

TL;DR

This paper introduces a novel framework for visible-infrared person re-identification that leverages dense cross-modal correspondences to improve discriminative feature learning and address modality discrepancies.

Contribution

It proposes exploiting dense pixel-level correspondences between cross-modal images to enhance feature alignment and discrimination in VI-reID.

Findings

01

Significantly outperforms state-of-the-art methods on standard benchmarks.

02

Effectively suppresses modality-related features for better discrimination.

03

Addresses misalignment issues in person images for improved re-identification.

Abstract

We address the problem of visible-infrared person re-identification (VI-reID), that is, retrieving a set of person images, captured by visible or infrared cameras, in a cross-modal setting. Two main challenges in VI-reID are intra-class variations across person images, and cross-modal discrepancies between visible and infrared images. Assuming that the person images are roughly aligned, previous approaches attempt to learn coarse image- or rigid part-level person representations that are discriminative and generalizable across different modalities. However, the person images, typically cropped by off-the-shelf object detectors, are not necessarily well-aligned, which distract discriminative person representation learning. In this paper, we introduce a novel feature learning framework that addresses these problems in a unified way. To this end, we propose to exploit dense correspondences…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

cvlab-yonsei/LbA
pytorch

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsVideo Surveillance and Tracking Methods · Gait Recognition and Analysis · Face recognition and analysis