Zeus: Efficiently Localizing Actions in Videos using Reinforcement Learning
Pramod Chunduri, Jaeho Bang, Yao Lu, Joy Arulraj

TL;DR
ZEUS is a reinforcement learning-based system that efficiently localizes actions in videos by adaptively modifying input segments, significantly reducing computation while maintaining accuracy.
Contribution
This paper introduces ZEUS, a novel reinforcement learning approach for adaptive video segment selection to improve action localization efficiency.
Findings
ZEUS outperforms existing filtering techniques by up to 22.1x in efficiency.
ZEUS consistently meets user-specified accuracy targets.
Evaluation on three datasets demonstrates broad applicability.
Abstract
Detection and localization of actions in videos is an important problem in practice. State-of-the-art video analytics systems are unable to efficiently and effectively answer such action queries because actions often involve a complex interaction between objects and are spread across a sequence of frames; detecting and localizing them requires computationally expensive deep neural networks. It is also important to consider the entire sequence of frames to answer the query effectively. In this paper, we present ZEUS, a video analytics system tailored for answering action queries. We present a novel technique for efficiently answering these queries using deep reinforcement learning. ZEUS trains a reinforcement learning agent that learns to adaptively modify the input video segments that are subsequently sent to an action classification network. The agent alters the input segments along…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Pose and Action Recognition · Video Surveillance and Tracking Methods · Generative Adversarial Networks and Image Synthesis
