AutoVideo: An Automated Video Action Recognition System

Daochen Zha; Zaid Pervaiz Bhat; Yi-Wei Chen; Yicheng Wang; Sirui Ding,; Jiaben Chen; Kwei-Herng Lai; Mohammad Qazim Bhat; Anmoll Kumar Jain; Alfredo; Costilla Reyes; Na Zou; Xia Hu

arXiv:2108.04212·cs.CV·July 19, 2022·1 cites

AutoVideo: An Automated Video Action Recognition System

Daochen Zha, Zaid Pervaiz Bhat, Yi-Wei Chen, Yicheng Wang, Sirui Ding,, Jiaben Chen, Kwei-Herng Lai, Mohammad Qazim Bhat, Anmoll Kumar Jain, Alfredo, Costilla Reyes, Na Zou, Xia Hu

PDF

Open Access 1 Repo

TL;DR

AutoVideo is a Python-based system that automates video action recognition by providing modular pipeline construction, data-driven tuning, and a user-friendly GUI, simplifying the development process.

Contribution

It introduces a highly modular, extendable framework with comprehensive primitives and automated tuning, streamlining the creation of video action recognition solutions.

Findings

01

Automates pipeline construction and tuning for action recognition

02

Provides a user-friendly GUI for easy system interaction

03

Enables efficient development with extensive primitives

Abstract

Action recognition is an important task for video understanding with broad applications. However, developing an effective action recognition solution often requires extensive engineering efforts in building and testing different combinations of the modules and their hyperparameters. In this demo, we present AutoVideo, a Python system for automated video action recognition. AutoVideo is featured for 1) highly modular and extendable infrastructure following the standard pipeline language, 2) an exhaustive list of primitives for pipeline construction, 3) data-driven tuners to save the efforts of pipeline tuning, and 4) easy-to-use Graphical User Interface (GUI). AutoVideo is released under MIT license at https://github.com/datamllab/autovideo

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

datamllab/autovideo
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHuman Pose and Action Recognition · Anomaly Detection Techniques and Applications · Video Analysis and Summarization