Spatio-Temporal Video Representation Learning for AI Based Video Playback Style Prediction
Rishubh Parihar, Gaurav Ramola, Ranajit Saha, Ravi Kini, Aniket Rege,, Sudha Velusamy

TL;DR
This paper introduces a novel motion type classification approach to understand object motions in videos, enabling improved video retrieval and a recommendation system for playback styles, especially useful for power-constrained devices.
Contribution
It proposes a new motion type classifier that categorizes object motions into five primitive types, enhancing spatio-temporal video understanding and enabling a video playback style recommendation system.
Findings
Motion type classification generalizes well for video retrieval.
The approach effectively distinguishes different object motion patterns.
The system supports intelligent video playback style recommendations.
Abstract
Ever-increasing smartphone-generated video content demands intelligent techniques to edit and enhance videos on power-constrained devices. Most of the best performing algorithms for video understanding tasks like action recognition, localization, etc., rely heavily on rich spatio-temporal representations to make accurate predictions. For effective learning of the spatio-temporal representation, it is crucial to understand the underlying object motion patterns present in the video. In this paper, we propose a novel approach for understanding object motions via motion type classification. The proposed motion type classifier predicts a motion type for the video based on the trajectories of the objects present. Our classifier assigns a motion type for the given video from the following five primitive motion classes: linear, projectile, oscillatory, local and random. We demonstrate that the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Pose and Action Recognition · Video Analysis and Summarization · Multimodal Machine Learning Applications
