TL;DR
This paper introduces SSP-ReID, a framework that combines saliency and semantic parsing maps to enhance person re-identification accuracy using deep neural networks, achieving state-of-the-art results.
Contribution
The novel SSP-ReID framework effectively fuses saliency and semantic parsing clues to improve person re-identification across various network architectures and benchmarks.
Findings
Significant performance improvements over baseline models.
Achieved state-of-the-art results on three benchmarks.
Framework is compatible with multiple neural network backbones.
Abstract
Given a video or an image of a person acquired from a camera, person re-identification is the process of retrieving all instances of the same person from videos or images taken from a different camera with non-overlapping view. This task has applications in various fields, such as surveillance, forensics, robotics, multimedia. In this paper, we present a novel framework, named Saliency-Semantic Parsing Re-Identification (SSP-ReID), for taking advantage of the capabilities of both clues: saliency and semantic parsing maps, to guide a backbone convolutional neural network (CNN) to learn complementary representations that improves the results over the original backbones. The insight of fusing multiple clues is based on specific scenarios in which one response is better than another, thus favoring the combination of them to increase performance. Due to its definition, our framework can be…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
