Channel Recurrent Attention Networks for Video Pedestrian Retrieval

Pengfei Fang; Pan Ji; Jieming Zhou; Lars Petersson; Mehrtash Harandi

arXiv:2010.03108·cs.CV·October 8, 2020

Channel Recurrent Attention Networks for Video Pedestrian Retrieval

Pengfei Fang, Pan Ji, Jieming Zhou, Lars Petersson, Mehrtash Harandi

PDF

Open Access

TL;DR

This paper introduces a novel fully attentional network with channel recurrent attention for video pedestrian retrieval, leveraging recurrent neural networks to identify attention maps and improve global feature representation.

Contribution

It proposes a new channel recurrent attention mechanism that jointly models spatial and channel patterns for enhanced video pedestrian retrieval.

Findings

01

Outperforms state-of-the-art on standard benchmarks

02

Demonstrates effectiveness through ablation studies

03

Builds a global receptive field via recurrent learning

Abstract

Full attention, which generates an attention value per element of the input feature maps, has been successfully demonstrated to be beneficial in visual tasks. In this work, we propose a fully attentional network, termed {\it channel recurrent attention network}, for the task of video pedestrian retrieval. The main attention unit, \textit{channel recurrent attention}, identifies attention maps at the frame level by jointly leveraging spatial and channel patterns via a recurrent neural network. This channel recurrent attention is designed to build a global receptive field by recurrently receiving and learning the spatial vectors. Then, a \textit{set aggregation} cell is employed to generate a compact video representation. Empirical experimental results demonstrate the superior performance of the proposed deep network, outperforming current state-of-the-art results across standard video…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsVideo Surveillance and Tracking Methods · Human Pose and Action Recognition · Advanced Neural Network Applications