A Review of Machine Learning Methods Applied to Video Analysis Systems
Marios S. Pattichis, Venkatesh Jatla, Alvaro E. Ullao Cerna

TL;DR
This survey reviews machine learning techniques for video analysis, focusing on deep learning methods, dataset challenges, and approaches that reduce labeled data requirements, including self-supervised, semi-supervised, active, and zero-shot learning.
Contribution
It provides a comprehensive overview of recent machine learning methods tailored for video analysis, emphasizing techniques that operate effectively with limited labeled data and real-world complexities.
Findings
Deep learning methods vary in performance across datasets.
Low-parameter models are effective for single-activity detection.
Modern techniques reduce the need for extensive labeled data.
Abstract
The paper provides a survey of the development of machine-learning techniques for video analysis. The survey provides a summary of the most popular deep learning methods used for human activity recognition. We discuss how popular architectures perform on standard datasets and highlight the differences from real-life datasets dominated by multiple activities performed by multiple participants over long periods. For real-life datasets, we describe the use of low-parameter models (with 200X or 1,000X fewer parameters) that are trained to detect a single activity after the relevant objects have been successfully detected. Our survey then turns to a summary of machine learning methods that are specifically developed for working with a small number of labeled video samples. Our goal here is to describe modern techniques that are specifically designed so as to minimize the amount of ground…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAnomaly Detection Techniques and Applications · Human Pose and Action Recognition · Data Stream Mining Techniques
