Post-Processing Independent Evaluation of Sound Event Detection Systems

Janek Ebbers; Reinhold Haeb-Umbach; Romain Serizel

arXiv:2306.15440·eess.AS·June 28, 2023·1 cites

Post-Processing Independent Evaluation of Sound Event Detection Systems

Janek Ebbers, Reinhold Haeb-Umbach, Romain Serizel

PDF

Open Access 1 Repo

TL;DR

This paper introduces piPSDS, a new evaluation metric for sound event detection systems that assesses performance across various post-processing methods, providing a more comprehensive and unbiased comparison of system capabilities.

Contribution

The paper proposes piPSDS, a post-processing independent evaluation metric that generalizes PSDS, enabling fairer system comparisons without bias from post-processing choices.

Findings

01

piPSDS effectively evaluates systems across different post-processings.

02

Median filter independent PSDS (miPSDS) results are presented for DCASE Challenge systems.

03

The approach is implemented in an open-source package for community use.

Abstract

Due to the high variation in the application requirements of sound event detection (SED) systems, it is not sufficient to evaluate systems only in a single operating mode. Therefore, the community recently adopted the polyphonic sound detection score (PSDS) as an evaluation metric, which is the normalized area under the PSD receiver operating characteristic (PSD-ROC). It summarizes the system performance over a range of operating modes resulting from varying the decision threshold that is used to translate the system output scores into a binary detection output. Hence, it provides a more complete picture of the overall system behavior and is less biased by specific threshold tuning. However, besides the decision threshold there is also the post-processing that can be changed to enter another operating mode. In this paper we propose the post-processing independent PSDS (piPSDS) as a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

fgnt/sed_scores_eval
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech and Audio Processing · Music and Audio Processing · Advanced Adaptive Filtering Techniques