Real-World Repetition Estimation by Div, Grad and Curl
Tom F.H. Runia, Cees G.M. Snoek, Arnold W.M. Smeulders

TL;DR
This paper introduces a wavelet-based method for estimating repetitions in real-world videos, handling non-static and non-stationary dynamics better than Fourier-based methods, and presents a new dataset for evaluation.
Contribution
It proposes a novel wavelet transform approach using flow differentials for repetition estimation and introduces the QUVA Repetition dataset for realistic evaluation.
Findings
Outperforms deep learning methods in repetition counting
Handles non-static, non-stationary video dynamics effectively
Provides a comprehensive theoretical framework for 2D and 3D periodicity detection
Abstract
We consider the problem of estimating repetition in video, such as performing push-ups, cutting a melon or playing violin. Existing work shows good results under the assumption of static and stationary periodicity. As realistic video is rarely perfectly static and stationary, the often preferred Fourier-based measurements is inapt. Instead, we adopt the wavelet transform to better handle non-static and non-stationary video dynamics. From the flow field and its differentials, we derive three fundamental motion types and three motion continuities of intrinsic periodicity in 3D. On top of this, the 2D perception of 3D periodicity considers two extreme viewpoints. What follows are 18 fundamental cases of recurrent perception in 2D. In practice, to deal with the variety of repetitive appearance, our theory implies measuring time-varying flow and its differentials (gradient, divergence and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
