Taylor Videos for Action Recognition

Lei Wang; Xiuyuan Yuan; Tom Gedeon; Liang Zheng

arXiv:2402.03019·cs.CV·May 13, 2024·1 cites

Taylor Videos for Action Recognition

Lei Wang, Xiuyuan Yuan, Tom Gedeon, Liang Zheng

PDF

Open Access 1 Repo

TL;DR

The paper introduces Taylor videos, a novel motion representation for action recognition that emphasizes dominant motions by approximating motion functions through Taylor series expansion, improving recognition accuracy across multiple architectures.

Contribution

We propose the Taylor video format, which captures dominant motions via Taylor series expansion, enhancing action recognition performance over traditional RGB and optical flow inputs.

Findings

01

Taylor videos improve action recognition accuracy.

02

Fusion of Taylor videos with RGB or optical flow boosts performance.

03

Taylor skeleton sequences outperform original skeletons in skeleton-based recognition.

Abstract

Effectively extracting motions from video is a critical and long-standing problem for action recognition. This problem is very challenging because motions (i) do not have an explicit form, (ii) have various concepts such as displacement, velocity, and acceleration, and (iii) often contain noise caused by unstable pixels. Addressing these challenges, we propose the Taylor video, a new video format that highlights the dominate motions (e.g., a waving hand) in each of its frames named the Taylor frame. Taylor video is named after Taylor series, which approximates a function at a given point using important terms. In the scenario of videos, we define an implicit motion-extraction function which aims to extract motions from video temporal block. In this block, using the frames, the difference frames, and higher-order difference frames, we perform Taylor expansion to approximate this function…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

leiwangr/video-ar
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHuman Pose and Action Recognition