Zeus: Efficiently Localizing Actions in Videos using Reinforcement   Learning

Pramod Chunduri; Jaeho Bang; Yao Lu; Joy Arulraj

arXiv:2104.06142·cs.CV·September 29, 2022·1 cites

Zeus: Efficiently Localizing Actions in Videos using Reinforcement Learning

Pramod Chunduri, Jaeho Bang, Yao Lu, Joy Arulraj

PDF

Open Access

TL;DR

ZEUS is a reinforcement learning-based system that efficiently localizes actions in videos by adaptively modifying input segments, significantly reducing computation while maintaining accuracy.

Contribution

This paper introduces ZEUS, a novel reinforcement learning approach for adaptive video segment selection to improve action localization efficiency.

Findings

01

ZEUS outperforms existing filtering techniques by up to 22.1x in efficiency.

02

ZEUS consistently meets user-specified accuracy targets.

03

Evaluation on three datasets demonstrates broad applicability.

Abstract

Detection and localization of actions in videos is an important problem in practice. State-of-the-art video analytics systems are unable to efficiently and effectively answer such action queries because actions often involve a complex interaction between objects and are spread across a sequence of frames; detecting and localizing them requires computationally expensive deep neural networks. It is also important to consider the entire sequence of frames to answer the query effectively. In this paper, we present ZEUS, a video analytics system tailored for answering action queries. We present a novel technique for efficiently answering these queries using deep reinforcement learning. ZEUS trains a reinforcement learning agent that learns to adaptively modify the input video segments that are subsequently sent to an action classification network. The agent alters the input segments along…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHuman Pose and Action Recognition · Video Surveillance and Tracking Methods · Generative Adversarial Networks and Image Synthesis