Automatic Genre and Show Identification of Broadcast Media

Mortaza Doulaty; Oscar Saz; Raymond W. M. Ng; Thomas Hain

arXiv:1606.03333·cs.MM·June 13, 2016

Automatic Genre and Show Identification of Broadcast Media

Mortaza Doulaty, Oscar Saz, Raymond W. M. Ng, Thomas Hain

PDF

TL;DR

This paper presents a novel multi-modal approach using acoustic, textual, and meta-data features with LDA and SVMs for high-accuracy automatic genre and show identification in broadcast media, tested on BBC data.

Contribution

It introduces a new method combining acoustic, textual, and meta-data features with LDA and SVMs for improved media classification accuracy.

Findings

01

Achieved 98.6% accuracy for genre classification.

02

Achieved 85.7% accuracy for show identification.

03

Meta-data inclusion significantly boosts performance.

Abstract

Huge amounts of digital videos are being produced and broadcast every day, leading to giant media archives. Effective techniques are needed to make such data accessible further. Automatic meta-data labelling of broadcast media is an essential task for multimedia indexing, where it is standard to use multi-modal input for such purposes. This paper describes a novel method for automatic detection of media genre and show identities using acoustic features, textual features or a combination thereof. Furthermore the inclusion of available meta-data, such as time of broadcast, is shown to lead to very high performance. Latent Dirichlet Allocation is used to model both acoustics and text, yielding fixed dimensional representations of media recordings that can then be used in Support Vector Machines based classification. Experiments are conducted on more than 1200 hours of TV broadcasts from…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.