A Scalable Video Search Engine Based on Audio Content Indexing and Topic   Segmentation

Julien Lawto; Jean-Luc Gauvain (LIMSI); Lori Lamel (LIMSI); Gregory; Grefenstete; Guillaume Gravier (INRIA - IRISA); Julien Despres; Camille; Guinaudeau (INRIA - IRISA); Pascale S\'ebillot (INRIA - IRISA)

arXiv:1111.6265·cs.MM·November 29, 2011·1 cites

A Scalable Video Search Engine Based on Audio Content Indexing and Topic Segmentation

Julien Lawto, Jean-Luc Gauvain (LIMSI), Lori Lamel (LIMSI), Gregory, Grefenstete, Guillaume Gravier (INRIA - IRISA), Julien Despres, Camille, Guinaudeau (INRIA - IRISA), Pascale S\'ebillot (INRIA - IRISA)

PDF

Open Access

TL;DR

This paper introduces a scalable video search engine that uses audio content indexing and topic segmentation, enabling quick, focused access to relevant news broadcast segments across multiple languages and interfaces.

Contribution

It presents a novel system leveraging speech recognition and NLP for efficient, multilingual, and cross-lingual news broadcast segmentation and search, with a user-friendly interface.

Findings

01

Indexes 50 news sources in multiple languages

02

Enables cross-lingual and map-based search

03

Provides automatic textual clues for segments

Abstract

One important class of online videos is that of news broadcasts. Most news organisations provide near-immediate access to topical news broadcasts over the Internet, through RSS streams or podcasts. Until lately, technology has not made it possible for a user to automatically go to the smaller parts, within a longer broadcast, that might interest them. Recent advances in both speech recognition systems and natural language processing have led to a number of robust tools that allow us to provide users with quicker, more focussed access to relevant segments of one or more news broadcast videos. Here we present our new interface for browsing or searching news broadcasts (video/audio) that exploits these new language processing tools to (i) provide immediate access to topical passages within news broadcasts, (ii) browse news broadcasts by events as well as by people, places and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsVideo Analysis and Summarization · Music and Audio Processing · Speech Recognition and Synthesis