Post-Processing Independent Evaluation of Sound Event Detection Systems
Janek Ebbers, Reinhold Haeb-Umbach, Romain Serizel

TL;DR
This paper introduces piPSDS, a new evaluation metric for sound event detection systems that assesses performance across various post-processing methods, providing a more comprehensive and unbiased comparison of system capabilities.
Contribution
The paper proposes piPSDS, a post-processing independent evaluation metric that generalizes PSDS, enabling fairer system comparisons without bias from post-processing choices.
Findings
piPSDS effectively evaluates systems across different post-processings.
Median filter independent PSDS (miPSDS) results are presented for DCASE Challenge systems.
The approach is implemented in an open-source package for community use.
Abstract
Due to the high variation in the application requirements of sound event detection (SED) systems, it is not sufficient to evaluate systems only in a single operating mode. Therefore, the community recently adopted the polyphonic sound detection score (PSDS) as an evaluation metric, which is the normalized area under the PSD receiver operating characteristic (PSD-ROC). It summarizes the system performance over a range of operating modes resulting from varying the decision threshold that is used to translate the system output scores into a binary detection output. Hence, it provides a more complete picture of the overall system behavior and is less biased by specific threshold tuning. However, besides the decision threshold there is also the post-processing that can be changed to enter another operating mode. In this paper we propose the post-processing independent PSDS (piPSDS) as a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech and Audio Processing · Music and Audio Processing · Advanced Adaptive Filtering Techniques
