Let Features Decide for Themselves: Feature Mask Network for Person Re-identification
Guodong Ding, Salman Khan, Zhenmin Tang, Fatih Porikli

TL;DR
This paper introduces a Feature Mask Network that dynamically emphasizes local details in person re-identification, significantly improving accuracy by leveraging high-level features for attention and multi-task training.
Contribution
The novel FMN architecture uses high-level features to predict masks that reweight low-level features, enhancing local detail focus for better person re-identification.
Findings
Achieved 5.3% mAP improvement on Market-1501
Achieved 9.1% mAP improvement on DukeMTMC-reID
Achieved 10.7% mAP improvement on CUHK03
Abstract
Person re-identification aims at establishing the identity of a pedestrian from a gallery that contains images of multiple people obtained from a multi-camera system. Many challenges such as occlusions, drastic lighting and pose variations across the camera views, indiscriminate visual appearances, cluttered backgrounds, imperfect detections, motion blur, and noise make this task highly challenging. While most approaches focus on learning features and metrics to derive better representations, we hypothesize that both local and global contextual cues are crucial for an accurate identity matching. To this end, we propose a Feature Mask Network (FMN) that takes advantage of ResNet high-level features to predict a feature map mask and then imposes it on the low-level features to dynamically reweight different object parts for a locally aware feature representation. This serves as an…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
