No Time to Waste: Squeeze Time into Channel for Mobile Video   Understanding

Yingjie Zhai; Wenshuo Li; Yehui Tang; Xinghao Chen; Yunhe Wang

arXiv:2405.08344·cs.CV·May 15, 2024

No Time to Waste: Squeeze Time into Channel for Mobile Video Understanding

Yingjie Zhai, Wenshuo Li, Yehui Tang, Xinghao Chen, Yunhe Wang

PDF

Open Access 3 Repos

TL;DR

This paper introduces SqueezeTime, a lightweight video recognition network that squeezes temporal information into channels for efficient mobile video understanding, achieving high accuracy with reduced computation.

Contribution

It proposes a novel channel-time squeezing approach and a Channel-Time Learning block to enhance temporal modeling in a lightweight network for mobile devices.

Findings

01

Achieves +1.2% accuracy on Kinetics400

02

80% GPU throughput gain over prior methods

03

Effective on multiple video benchmarks

Abstract

Current architectures for video understanding mainly build upon 3D convolutional blocks or 2D convolutions with additional operations for temporal modeling. However, these methods all regard the temporal axis as a separate dimension of the video sequence, which requires large computation and memory budgets and thus limits their usage on mobile devices. In this paper, we propose to squeeze the time axis of a video sequence into the channel dimension and present a lightweight video recognition network, term as \textit{SqueezeTime}, for mobile video understanding. To enhance the temporal modeling capability of the proposed network, we design a Channel-Time Learning (CTL) Block to capture temporal dynamics of the sequence. This module has two complementary branches, in which one branch is for temporal importance learning and another branch with temporal position restoring capability is to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsVideo Analysis and Summarization