An Effective Way to Improve YouTube-8M Classification Accuracy in Google Cloud Platform
Zhenzhen Zhong, Shujiao Huang, Cheng Zhan, Licheng Zhang, Zhiwei Xiao,, Chang-Chun Wang, Pei Yang

TL;DR
This paper explores improving multi-label video classification on the YouTube-8M dataset by leveraging ensemble methods, achieving a significant accuracy increase from 77% to 80.7% in Google Cloud Platform.
Contribution
It introduces ensemble techniques to enhance classification accuracy on YouTube-8M, building upon existing baselines and participating in a Kaggle challenge.
Findings
Global prediction accuracy improved from 77% to 80.7%.
Ensemble methods effectively boost classification performance.
Baseline models serve as a foundation for further improvements.
Abstract
Large-scale datasets have played a significant role in progress of neural network and deep learning areas. YouTube-8M is such a benchmark dataset for general multi-label video classification. It was created from over 7 million YouTube videos (450,000 hours of video) and includes video labels from a vocabulary of 4716 classes (3.4 labels/video on average). It also comes with pre-extracted audio & visual features from every second of video (3.2 billion feature vectors in total). Google cloud recently released the datasets and organized 'Google Cloud & YouTube-8M Video Understanding Challenge' on Kaggle. Competitors are challenged to develop classification algorithms that assign video-level labels using the new and improved Youtube-8M V2 dataset. Inspired by the competition, we started exploration of audio understanding and classification using deep learning algorithms and ensemble…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMusic and Audio Processing · Anomaly Detection Techniques and Applications · Digital Media Forensic Detection
