SpectroMap: Peak detection algorithm for audio fingerprinting
Aar\'on L\'opez-Garc\'ia

TL;DR
SpectroMap is an open-source Python-based peak detection algorithm designed for audio fingerprinting, utilizing spectrogram topological prominences to improve audio identification accuracy in various sound datasets.
Contribution
The paper introduces SpectroMap, a novel peak search algorithm for audio fingerprinting, with open-source implementation and reproducible case studies demonstrating its effectiveness.
Findings
Effective in handling urban sound datasets
Improves audio fingerprinting accuracy
Open-source tools for reproducibility
Abstract
Audio fingerprinting is a technique used to identify and match audio recordings based on their unique characteristics. It involves creating a condensed representation of an audio signal that can be used to quickly compare and match against other audio recordings. The fingerprinting process involves analyzing the audio signal to extract certain features, such as spectral content, tempo, and rhythm, among other things. In this paper, we present SpectroMap, an open-source GitHub repository for audio fingerprinting written in Python programming language. It is composed of a peak search algorithm that extracts topological prominences from a spectrogram via time-frequency bands. In this paper, we introduce the algorithm functioning with two experimental applications in a high-quality urban sound dataset and environmental audio recordings to describe how it works and how effective it is in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMusic and Audio Processing · Speech and Audio Processing · Music Technology and Sound Studies
