A quick search method for audio signals based on a piecewise linear   representation of feature trajectories

Akisato Kimura; Kunio Kashino; Takayuki Kurozumi; Hiroshi Murase

arXiv:0710.4180·cs.MM·November 10, 2011

A quick search method for audio signals based on a piecewise linear representation of feature trajectories

Akisato Kimura, Kunio Kashino, Takayuki Kurozumi, Hiroshi Murase

PDF

TL;DR

This paper introduces a fast audio search method using piecewise linear feature trajectory representation, significantly reducing search time while maintaining accuracy in long unlabeled audio streams.

Contribution

It proposes a novel feature-dimension reduction technique based on piecewise linear trajectories and segment-based KL transform for rapid audio similarity search.

Findings

01

Search time reduced to 1/12 of previous methods

02

Detected queries in approximately 0.3 seconds from 200-hour database

03

Maintains search accuracy despite dimension reduction

Abstract

This paper presents a new method for a quick similarity-based search through long unlabeled audio streams to detect and locate audio clips provided by users. The method involves feature-dimension reduction based on a piecewise linear representation of a sequential feature trajectory extracted from a long audio stream. Two techniques enable us to obtain a piecewise linear representation: the dynamic segmentation of feature trajectories and the segment-based Karhunen-L\'{o}eve (KL) transform. The proposed search method guarantees the same search results as the search method without the proposed feature-dimension reduction method in principle. Experiment results indicate significant improvements in search speed. For example the proposed method reduced the total search time to approximately 1/12 that of previous methods and detected queries in approximately 0.3 seconds from a 200-hour audio…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.