Topological fingerprints for audio identification

Wojciech Reise; Ximena Fern\'andez; Maria Dominguez; Heather A.; Harrington; Mariano Beguerisse-D\'iaz

arXiv:2309.03516·cs.SD·September 8, 2023

Topological fingerprints for audio identification

Wojciech Reise, Ximena Fern\'andez, Maria Dominguez, Heather A., Harrington, Mariano Beguerisse-D\'iaz

PDF

Open Access 1 Repo

TL;DR

This paper introduces a topological audio fingerprinting method using persistent homology on mel-spectrograms, achieving robust and accurate identification of duplicate audio tracks even under distortions.

Contribution

The novel approach applies topological data analysis to audio signals, providing a new robust fingerprinting technique that outperforms existing methods under various distortions.

Findings

01

High accuracy in duplicate audio detection

02

Robustness to time stretching and pitch shifting

03

Outperforms existing fingerprinting methods

Abstract

We present a topological audio fingerprinting approach for robustly identifying duplicate audio tracks. Our method applies persistent homology on local spectral decompositions of audio signals, using filtered cubical complexes computed from mel-spectrograms. By encoding the audio content in terms of local Betti curves, our topological audio fingerprints enable accurate detection of time-aligned audio matchings. Experimental results demonstrate the accuracy of our algorithm in the detection of tracks with the same audio content, even when subjected to various obfuscations. Our approach outperforms existing methods in scenarios involving topological distortions, such as time stretching and pitch shifting.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

wreise/top_audio_id
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopological and Geometric Data Analysis · Music and Audio Processing · Image Retrieval and Classification Techniques