SGE net: Video object detection with squeezed GRU and information   entropy map

Rui Su; Wenjing Huang; Haoyu Ma; Xiaowei Song; Jinglu Hu

arXiv:2106.07224·cs.CV·June 15, 2021

SGE net: Video object detection with squeezed GRU and information entropy map

Rui Su, Wenjing Huang, Haoyu Ma, Xiaowei Song, Jinglu Hu

PDF

Open Access

TL;DR

This paper introduces SGE-Net, an efficient video object detection method that combines a channel-reduced GRU and an information entropy map, achieving higher accuracy with fewer parameters.

Contribution

The paper proposes a novel Squeezed GRU and entropy-based attention mechanism for improved efficiency and accuracy in video object detection.

Findings

01

mAP increased by 3.7 over baseline

02

Parameters reduced from 6.33M to 0.67M

03

Enhanced classification performance with entropy map

Abstract

Recently, deep learning based video object detection has attracted more and more attention. Compared with object detection of static images, video object detection is more challenging due to the motion of objects, while providing rich temporal information. The RNN-based algorithm is an effective way to enhance detection performance in videos with temporal information. However, most studies in this area only focus on accuracy while ignoring the calculation cost and the number of parameters. In this paper, we propose an efficient method that combines channel-reduced convolutional GRU (Squeezed GRU), and Information Entropy map for video object detection (SGE-Net). The experimental results validate the accuracy improvement, computational savings of the Squeezed GRU, and superiority of the information entropy attention mechanism on the classification performance. The mAP has increased by…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Visual Attention and Saliency Detection · Domain Adaptation and Few-Shot Learning

MethodsGated Recurrent Unit