TL;DR
This paper introduces TinyVIRAT, a new low-resolution video dataset for action recognition, and proposes a novel generative and attention-based method that significantly improves recognition accuracy in challenging surveillance scenarios.
Contribution
The paper presents a new low-resolution video dataset, TinyVIRAT, and a novel recognition method combining generative and attention mechanisms, advancing tiny action recognition in surveillance videos.
Findings
Proposed method outperforms baseline models on TinyVIRAT.
Achieved state-of-the-art results on synthetically resized datasets.
Significant improvement in recognizing tiny actions in low-resolution videos.
Abstract
The existing research in action recognition is mostly focused on high-quality videos where the action is distinctly visible. In real-world surveillance environments, the actions in videos are captured at a wide range of resolutions. Most activities occur at a distance with a small resolution and recognizing such activities is a challenging problem. In this work, we focus on recognizing tiny actions in videos. We introduce a benchmark dataset, TinyVIRAT, which contains natural low-resolution activities. The actions in TinyVIRAT videos have multiple labels and they are extracted from surveillance videos which makes them realistic and more challenging. We propose a novel method for recognizing tiny actions in videos which utilizes a progressive generative approach to improve the quality of low-resolution actions. The proposed method also consists of a weakly trained attention mechanism…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
