Approach for Video Classification with Multi-label on YouTube-8M Dataset

Kwangsoo Shin; Junhyeong Jeon; Seungbin Lee; Boyoung Lim; Minsoo; Jeong; Jongho Nang

arXiv:1808.08671·cs.CV·October 16, 2018

Approach for Video Classification with Multi-label on YouTube-8M Dataset

Kwangsoo Shin, Junhyeong Jeon, Seungbin Lee, Boyoung Lim, Minsoo, Jeong, Jongho Nang

PDF

Open Access

TL;DR

This paper presents a method for multi-label video classification on YouTube-8M using NetVLAD and NetFV models, achieving a high GAP score through hyperparameter optimization.

Contribution

It introduces the application of NetVLAD and NetFV with Huber loss for improved multi-label video classification on a large-scale dataset.

Findings

01

Achieved a GAP score of 0.8668 on YouTube-8M.

02

Optimized hyperparameters for better performance.

03

Validated effectiveness of NetVLAD and NetFV models.

Abstract

Video traffic is increasing at a considerable rate due to the spread of personal media and advancements in media technology. Accordingly, there is a growing need for techniques to automatically classify moving images. This paper use NetVLAD and NetFV models and the Huber loss function for video classification problem and YouTube-8M dataset to verify the experiment. We tried various attempts according to the dataset and optimize hyperparameters, ultimately obtain a GAP score of 0.8668.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Image and Video Retrieval Techniques · Text and Document Classification Technologies · Video Analysis and Summarization

MethodsHuber loss