ES-Net: Erasing Salient Parts to Learn More in Re-Identification
Dong Shen, Shuai Zhao, Jinming Hu, Hao Feng, Deng Cai, Xiaofei He

TL;DR
ES-Net improves re-identification accuracy by erasing salient regions during training, encouraging the model to learn more comprehensive features, validated by superior results on multiple benchmarks.
Contribution
The paper introduces ES-Net, a novel approach that erases salient areas to enhance feature diversity in re-ID models, with a new trainable pooling layer for better performance.
Findings
Outperforms state-of-the-art on multiple re-ID benchmarks.
Achieves high mAP and Rank-1 rates on Person and Vehicle re-ID datasets.
Provides human-interpretable visual explanations for ranking results.
Abstract
As an instance-level recognition problem, re-identification (re-ID) requires models to capture diverse features. However, with continuous training, re-ID models pay more and more attention to the salient areas. As a result, the model may only focus on few small regions with salient representations and ignore other important information. This phenomenon leads to inferior performance, especially when models are evaluated on small inter-identity variation data. In this paper, we propose a novel network, Erasing-Salient Net (ES-Net), to learn comprehensive features by erasing the salient areas in an image. ES-Net proposes a novel method to locate the salient areas by the confidence of objects and erases them efficiently in a training batch. Meanwhile, to mitigate the over-erasing problem, this paper uses a trainable pooling layer P-pooling that generalizes global max and global average…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
