# Interpretable and Generalizable Person Re-Identification with   Query-Adaptive Convolution and Temporal Lifting

**Authors:** Shengcai Liao, Ling Shao

arXiv: 1904.10424 · 2021-04-13

## TL;DR

This paper introduces QAConv, a query-adaptive convolution approach for person re-identification that enhances interpretability and generalization across unseen scenarios, combined with a temporal lifting method for improved accuracy.

## Contribution

The paper proposes a novel query-adaptive convolution method for interpretable and generalizable person re-identification, along with a temporal lifting technique for state-of-the-art performance.

## Key findings

- QAConv outperforms popular methods by over 10% mAP in cross-dataset tests.
- The approach achieves comparable results to transfer learning methods.
- Temporal lifting further improves re-identification accuracy.

## Abstract

For person re-identification, existing deep networks often focus on representation learning. However, without transfer learning, the learned model is fixed as is, which is not adaptable for handling various unseen scenarios. In this paper, beyond representation learning, we consider how to formulate person image matching directly in deep feature maps. We treat image matching as finding local correspondences in feature maps, and construct query-adaptive convolution kernels on the fly to achieve local matching. In this way, the matching process and results are interpretable, and this explicit matching is more generalizable than representation features to unseen scenarios, such as unknown misalignments, pose or viewpoint changes. To facilitate end-to-end training of this architecture, we further build a class memory module to cache feature maps of the most recent samples of each class, so as to compute image matching losses for metric learning. Through direct cross-dataset evaluation, the proposed Query-Adaptive Convolution (QAConv) method gains large improvements over popular learning methods (about 10%+ mAP), and achieves comparable results to many transfer learning methods. Besides, a model-free temporal cooccurrence based score weighting method called TLift is proposed, which improves the performance to a further extent, achieving state-of-the-art results in cross-dataset person re-identification. Code is available at https://github.com/ShengcaiLiao/QAConv.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1904.10424/full.md

## Figures

29 figures with captions in the complete paper: https://tomesphere.com/paper/1904.10424/full.md

## References

130 references — full list in the complete paper: https://tomesphere.com/paper/1904.10424/full.md

---
Source: https://tomesphere.com/paper/1904.10424